September 18, 2025
3 min read
New AI Tool Predicts Which of 1,000 Diseases Someone May Develop in 20 Years
A large language model called Delphi-2M analyzes a person’s medical records and lifestyle to provide risk estimates for more than 1,000 diseases
Boris Zhitkov/Getty Images
A new artificial intelligence (AI) tool can forecast a person’s risk of developing more than 1,000 diseases, in some cases providing a prediction decades in advance.
The model, called Delphi-2M, uses health records and lifestyle factors to estimate the likelihood that a person will develop diseases such as cancer, skin diseases and immune conditions up to 20 years ahead of time. Although Delphi-2M was trained only on one data set from the United Kingdom, its multi-disease modelling could one day help clinicians to identify high-risk people, allowing for the early roll-out of preventive measures. The model is described in a study published today in Nature.
The tool’s ability to model multiple diseases in one go is “astonishing,” says Stefan Feuerriegel, a computer scientist at the Ludwig Maximilian University of Munich in Germany, who has developed AI models for medical applications. “It can generate entire future health trajectories,” he says.
On supporting science journalism
If you’re enjoying this article, consider supporting our award-winning journalism by subscribing. By purchasing a subscription you are helping to ensure the future of impactful stories about the discoveries and ideas shaping our world today.
Oracle of health
Researchers have already developed AI-based tools to predict a person’s risk of developing certain conditions, including some cancers and cardiovascular disease. But most of these tools estimate the risk of only one disease, says study co-author Moritz Gerstung, a data scientist at the German Cancer Research Center in Heidelberg. “A health-care professional would have to run dozens of them to deliver a comprehensive answer,” he says.
To address this, Gerstung and his colleagues modified a type of large language model (LLM) called a generative pre-trained transformer (GPT), that forms the underpinning of AI chatbots such as ChatGPT. When asked a question, GPTs provide outputs that, according to their training on vast volumes of data, are statistically probable.
The authors designed their modified LLM to forecast a person’s likelihood of developing 1,258 diseases on the basis of their past medical history. The model also incorporates the person’s age, sex, body mass index and health-related habits, such as tobacco use and alcohol consumption. The researchers trained Delphi-2M on data from 400,000 participants of the UK Biobank, a long-term biomedical monitoring study.
For most diseases, Delphi-2M’s predictions matched or exceeded the accuracy of those of current models that estimate the risk of developing a single illness. The tool also performed better than a machine-learning algorithm that uses biomarkers — levels of specific molecules or compounds in the body — to predict the risk of several diseases. “It worked astonishingly well,” says Gerstung.
Delphi-2M worked best when forecasting the trajectories of conditions that follow predictable patterns of progression, such as some types of cancer. The model calculated the probability of a person developing each illness for a time period of up to two decades, depending on the information included in their medical records.
Early-warning system
Gerstung and his colleagues tested Delphi-2M on health data from 1.9 million people in the Danish National Patient Registry, a national database that has tracked hospital admissions for almost half a century. The authors found that the model’s predictions for people in the registry were only slightly less accurate than they were for participants in the UK Biobank. This demonstrates that the model could still make somewhat reliable predictions when it’s applied to data sets from national health systems other than the one it trained on, says Gerstung.
Delphi-2M is an “intriguing” contribution to the burgeoning field of modelling multiple diseases at once, but it has its limitations, says Degui Zhi, a bioinformatics researcher who develops AI models at the University of Texas Health Science Center at Houston. For instance, the UK Biobank data only captured participants’ first brush with a disease. The number of times someone has had an illness is “important for the modelling of personal health trajectories,” says Zhi.
Gerstung and his colleagues will evaluate Delphi-2M’s accuracy on data sets from several countries to expand its scope. “Thinking about how this information can be combined for developing even more precise algorithms will be important,” he says.
This article is reproduced with permission and was first published on September 17, 2025.
It’s Time to Stand Up for Science
If you enjoyed this article, I’d like to ask for your support. Scientific American has served as an advocate for science and industry for 180 years, and right now may be the most critical moment in that two-century history.
I’ve been a Scientific American subscriber since I was 12 years old, and it helped shape the way I look at the world. SciAm always educates and delights me, and inspires a sense of awe for our vast, beautiful universe. I hope it does that for you, too.
If you subscribe to Scientific American, you help ensure that our coverage is centered on meaningful research and discovery; that we have the resources to report on the decisions that threaten labs across the U.S.; and that we support both budding and working scientists at a time when the value of science itself too often goes unrecognized.
In return, you get essential news, captivating podcasts, brilliant infographics, can’t-miss newsletters, must-watch videos, challenging games, and the science world’s best writing and reporting. You can even gift someone a subscription.
There has never been a more important time for us to stand up and show why science matters. I hope you’ll support us in that mission.