
With the advent of artificial intelligence (AI), predictive medicine is becoming an important part of health care, especially in cancer treatment. Predictive medicine uses algorithms and data to help doctors understand how a cancer might continue to grow or react to specific drugs—making it easier to target precision treatment for individual patients.
While AI is important in this work, researchers from University of Maryland School of Medicine (UMSOM) say that it should not be relied on exclusively. Instead, AI should be combined with other methods, such as traditional mathematical modeling, for the best outcomes.
In a commentary published April 14 in Nature Biotechnology, Elana Fertig, Ph.D., Director of the Institute for Genome Sciences (IGS) and Professor of Medicine at UMSOM and Daniel Bergman, Ph.D., an IGS scientist argue that mathematical modeling has been underestimated and under-used in precision medicine to date.
All health computational models need three key components to work: datasets, equations, and software. Then, after generating data, comes leveraging it to improve early diagnoses, discover new treatments, and aid understanding of the diseases.
In a second commentary, out April 15 in Cell Reports Medicine, Dr. Fertig and IGS colleagues Dmitrijs Lvovs, Ph.D., Anup Mahurkar, Ph.D., and Owen White, Ph.D., address how to ethically share health data and methods to create reproducible science.
Taken together, the two commentaries set a foundational approach to generating, analyzing, and ethically sharing data to benefit both patients and science.
Explaining the argument of the Nature Biotechnology commentary Dr. Fertig said, “AI and mathematical models differ dramatically in how they arrive at an outcome.AI models first must be trained with existing data to make an outcome prediction, while mathematical models are directed to answer a specific question using both data and biological knowledge.”
That means that when data is sparse—as it often is in newer cancer treatments such as immunotherapy—AI can over generalize, resulting in biased or inaccurate outcomes that cannot be reproduced by other scientists. Mathematical modeling, on the other hand, uses known biological mechanisms, learned from scientific experiments, to explain how it arrived at an outcome.
“For example, with a mathematical model, we could create virtual cancer cells and healthy cells and write a program that would mimic how those cells interact and evolve inside of a tumor with different types of treatments,” said Dr. Bergman, Assistant Professor at IGS and UMSOM’s Department of Pharmacology, Physiology, and Drug Development. “At this time, AI cannot give us that type of specificity.”
The authors state that, in addition to using both types of models in “computational immunotherapy,” using a breadth of populations, and making datasets publicly available are critical for the most accurate outcomes.
“Data breadth and accuracy are key. Artifacts in the dataset, or even a simple typo in computer code, can throw off the accuracy of either type of model,” added Dr. Fertig. “Therefore, for any analysis pipeline to work correctly, it must be reproducible and that can only be assured by open science—giving access to other researchers whose work can confirm the models will get the right treatment to the right patient.”
However, reproducibility remains a critical challenge in science. In a 2016 article in Nature surveying more than 1,500 scientists, more than 70% of researchers said they have tried and failed to reproduce another scientist’s experiments, and more than half have failed to reproduce their own experiments.
“Reproducible research enables investigators to verify that the findings are accurate, reduce biases, promote scientific integrity, and build trust,” explained Dmitrijs Lvovs, Ph.D., Research Associate at IGS and first author on the Cell Reports Medicine commentary. “Because data science is computationally driven, all results should be transparent and automatically reproducible from the same dataset if the analysis code is readily available through open science.”
While that sounds simple enough—and there are best practices in place—the challenge, the authors argue, is how to share data while protecting patient privacy and blocking unauthorized data breaches. Genomic data, when combined with personal health information (PHI), could lead to re-identification of patients, a privacy violation.
The authors say that creating ethical open science data sharing means: 1. Getting detailed informed consent from patients; 2. Ensuring data quality when collecting and processing data by mitigating errors; 3. Harmonizing and standardizing data collected from disparate sources; 4. Using and creating resources and platforms, such as multiomic, clinical, public health, and drug discovery repositories; and 5. Working with vetted pipelines, such as open-source analysis tools and software platforms.
“Ethical and responsible data sharing democratizes research, supports the advancement of AI, and informs public health policies,” said Dr. Lvovs. “With ethical and responsible data sharing, the biomedical research community can maximize the benefits of shared data, accelerate discovery, and improve human health.”
More information:
Daniel R. Bergman et al, Virtual cells for predictive immunotherapy, Nature Biotechnology (2025). DOI: 10.1038/s41587-025-02583-2
Dmitrijs Lvovs et al, Balancing ethical data sharing and open science for reproducible research in biomedical data science, Cell Reports Medicine (2025). DOI: 10.1016/j.xcrm.2025.102080
Citation:
AI in health care is not a standalone solution, researchers caution (2025, April 15)
retrieved 15 April 2025
from https://medicalxpress.com/news/2025-04-ai-health-standalone-solution-caution.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

With the advent of artificial intelligence (AI), predictive medicine is becoming an important part of health care, especially in cancer treatment. Predictive medicine uses algorithms and data to help doctors understand how a cancer might continue to grow or react to specific drugs—making it easier to target precision treatment for individual patients.
While AI is important in this work, researchers from University of Maryland School of Medicine (UMSOM) say that it should not be relied on exclusively. Instead, AI should be combined with other methods, such as traditional mathematical modeling, for the best outcomes.
In a commentary published April 14 in Nature Biotechnology, Elana Fertig, Ph.D., Director of the Institute for Genome Sciences (IGS) and Professor of Medicine at UMSOM and Daniel Bergman, Ph.D., an IGS scientist argue that mathematical modeling has been underestimated and under-used in precision medicine to date.
All health computational models need three key components to work: datasets, equations, and software. Then, after generating data, comes leveraging it to improve early diagnoses, discover new treatments, and aid understanding of the diseases.
In a second commentary, out April 15 in Cell Reports Medicine, Dr. Fertig and IGS colleagues Dmitrijs Lvovs, Ph.D., Anup Mahurkar, Ph.D., and Owen White, Ph.D., address how to ethically share health data and methods to create reproducible science.
Taken together, the two commentaries set a foundational approach to generating, analyzing, and ethically sharing data to benefit both patients and science.
Explaining the argument of the Nature Biotechnology commentary Dr. Fertig said, “AI and mathematical models differ dramatically in how they arrive at an outcome.AI models first must be trained with existing data to make an outcome prediction, while mathematical models are directed to answer a specific question using both data and biological knowledge.”
That means that when data is sparse—as it often is in newer cancer treatments such as immunotherapy—AI can over generalize, resulting in biased or inaccurate outcomes that cannot be reproduced by other scientists. Mathematical modeling, on the other hand, uses known biological mechanisms, learned from scientific experiments, to explain how it arrived at an outcome.
“For example, with a mathematical model, we could create virtual cancer cells and healthy cells and write a program that would mimic how those cells interact and evolve inside of a tumor with different types of treatments,” said Dr. Bergman, Assistant Professor at IGS and UMSOM’s Department of Pharmacology, Physiology, and Drug Development. “At this time, AI cannot give us that type of specificity.”
The authors state that, in addition to using both types of models in “computational immunotherapy,” using a breadth of populations, and making datasets publicly available are critical for the most accurate outcomes.
“Data breadth and accuracy are key. Artifacts in the dataset, or even a simple typo in computer code, can throw off the accuracy of either type of model,” added Dr. Fertig. “Therefore, for any analysis pipeline to work correctly, it must be reproducible and that can only be assured by open science—giving access to other researchers whose work can confirm the models will get the right treatment to the right patient.”
However, reproducibility remains a critical challenge in science. In a 2016 article in Nature surveying more than 1,500 scientists, more than 70% of researchers said they have tried and failed to reproduce another scientist’s experiments, and more than half have failed to reproduce their own experiments.
“Reproducible research enables investigators to verify that the findings are accurate, reduce biases, promote scientific integrity, and build trust,” explained Dmitrijs Lvovs, Ph.D., Research Associate at IGS and first author on the Cell Reports Medicine commentary. “Because data science is computationally driven, all results should be transparent and automatically reproducible from the same dataset if the analysis code is readily available through open science.”
While that sounds simple enough—and there are best practices in place—the challenge, the authors argue, is how to share data while protecting patient privacy and blocking unauthorized data breaches. Genomic data, when combined with personal health information (PHI), could lead to re-identification of patients, a privacy violation.
The authors say that creating ethical open science data sharing means: 1. Getting detailed informed consent from patients; 2. Ensuring data quality when collecting and processing data by mitigating errors; 3. Harmonizing and standardizing data collected from disparate sources; 4. Using and creating resources and platforms, such as multiomic, clinical, public health, and drug discovery repositories; and 5. Working with vetted pipelines, such as open-source analysis tools and software platforms.
“Ethical and responsible data sharing democratizes research, supports the advancement of AI, and informs public health policies,” said Dr. Lvovs. “With ethical and responsible data sharing, the biomedical research community can maximize the benefits of shared data, accelerate discovery, and improve human health.”
More information:
Daniel R. Bergman et al, Virtual cells for predictive immunotherapy, Nature Biotechnology (2025). DOI: 10.1038/s41587-025-02583-2
Dmitrijs Lvovs et al, Balancing ethical data sharing and open science for reproducible research in biomedical data science, Cell Reports Medicine (2025). DOI: 10.1016/j.xcrm.2025.102080
Citation:
AI in health care is not a standalone solution, researchers caution (2025, April 15)
retrieved 15 April 2025
from https://medicalxpress.com/news/2025-04-ai-health-standalone-solution-caution.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.