
Gut bacteria are known to be a key factor in many health-related concerns. However, the number and variety of them is vast, as are the ways in which they interact with the body’s chemistry and each other.
For the first time, researchers from the University of Tokyo have used a special kind of artificial intelligence called a Bayesian neural network to probe a dataset of gut bacteria in order to find relationships that current analytical tools could not reliably identify.
The human body includes about 30 trillion to 40 trillion cells, but your intestines contain about 100 trillion gut bacteria. Technically, you’re carrying around more cells that aren’t part of you than are. These gut bacteria are of course responsible for some aspects of digestion, though what’s surprising to some is how they can relate to many other aspects of human health as well.
The bacteria are incredibly varied and also produce and modify a bewildering number of different chemicals called metabolites. These act like molecular messengers, permeating your body, affecting everything from your immune system and metabolism to your brain function and mood. Needless to say, there’s much to gain by understanding gut bacteria.
“The problem is that we’re only beginning to understand which bacteria produce which human metabolites and how these relationships change in different diseases,” said Project Researcher Tung Dang from the Tsunoda lab in the Department of Biological Sciences.
“By accurately mapping these bacteria-chemical relationships, we could potentially develop personalized treatments. Imagine being able to grow a specific bacterium to produce beneficial human metabolites or designing targeted therapies that modify these metabolites to treat diseases.”

This sounds good, so what’s the problem? As mentioned, there are uncountably many and varied bacteria and metabolites, and therefore far more relationships between these things. Gathering data on this alone is a monumental undertaking, but unpicking that data to find interesting patterns that might betray some useful function is even more so. To do this, Dang and his team decided to explore the use of state-of-the-art artificial intelligence (AI) tools.
“Our system, VBayesMM, automatically distinguishes the key players that significantly influence metabolites from the vast background of less relevant microbes, while also acknowledging uncertainty about the predicted relationships, rather than providing overconfident but potentially wrong answers,” said Dang.
“When tested on real data from sleep disorder, obesity and cancer studies, our approach consistently outperformed existing methods and identified specific bacterial families that align with known biological processes, giving confidence that it discovers real biological relationships rather than meaningless statistical patterns.”
As VBayesMM can handle and communicate issues of uncertainty, it gives researchers more confidence than a tool which does not. Even though the system is optimized to cope with heavy analytical workloads, mining such huge datasets still comes with high computational cost. However, as time goes on, this will become less and less of a barrier to those wishing to use them.
Other limitations at present include that the system benefits from having more data about the gut bacteria than the metabolites they produce; when there’s insufficient bacteria data, the accuracy drops. Also, VBayesMM assumes the microbes act independently, but in reality, gut bacteria interact in an incredibly complex number of ways.
“We plan to work with more comprehensive chemical datasets that capture the complete range of bacterial products, though this creates new challenges in determining whether chemicals come from bacteria, the human body or external sources like diet,” said Dang. “We also aim to make VBayesMM more robust when analyzing diverse patient populations, incorporating bacterial ‘family tree’ relationships to make better predictions, and further reducing the computational time needed for analysis.
For clinical applications, the ultimate goal is identifying specific bacterial targets for treatments or dietary interventions that could actually help patients, moving from basic research toward practical medical applications.”
More information:
Dang Tung et al, VBayesMM Variational Bayesian neural network to prioritize important relationships of high dimensional microbiome multiomics data, Briefings In Bioinformatics (2025). DOI: 10.1093/bib/bbaf300
Citation:
How AI is helping researchers to demystify gut bacteria (2025, July 4)
retrieved 4 July 2025
from https://medicalxpress.com/news/2025-07-ai-demystify-gut-bacteria.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

Gut bacteria are known to be a key factor in many health-related concerns. However, the number and variety of them is vast, as are the ways in which they interact with the body’s chemistry and each other.
For the first time, researchers from the University of Tokyo have used a special kind of artificial intelligence called a Bayesian neural network to probe a dataset of gut bacteria in order to find relationships that current analytical tools could not reliably identify.
The human body includes about 30 trillion to 40 trillion cells, but your intestines contain about 100 trillion gut bacteria. Technically, you’re carrying around more cells that aren’t part of you than are. These gut bacteria are of course responsible for some aspects of digestion, though what’s surprising to some is how they can relate to many other aspects of human health as well.
The bacteria are incredibly varied and also produce and modify a bewildering number of different chemicals called metabolites. These act like molecular messengers, permeating your body, affecting everything from your immune system and metabolism to your brain function and mood. Needless to say, there’s much to gain by understanding gut bacteria.
“The problem is that we’re only beginning to understand which bacteria produce which human metabolites and how these relationships change in different diseases,” said Project Researcher Tung Dang from the Tsunoda lab in the Department of Biological Sciences.
“By accurately mapping these bacteria-chemical relationships, we could potentially develop personalized treatments. Imagine being able to grow a specific bacterium to produce beneficial human metabolites or designing targeted therapies that modify these metabolites to treat diseases.”

This sounds good, so what’s the problem? As mentioned, there are uncountably many and varied bacteria and metabolites, and therefore far more relationships between these things. Gathering data on this alone is a monumental undertaking, but unpicking that data to find interesting patterns that might betray some useful function is even more so. To do this, Dang and his team decided to explore the use of state-of-the-art artificial intelligence (AI) tools.
“Our system, VBayesMM, automatically distinguishes the key players that significantly influence metabolites from the vast background of less relevant microbes, while also acknowledging uncertainty about the predicted relationships, rather than providing overconfident but potentially wrong answers,” said Dang.
“When tested on real data from sleep disorder, obesity and cancer studies, our approach consistently outperformed existing methods and identified specific bacterial families that align with known biological processes, giving confidence that it discovers real biological relationships rather than meaningless statistical patterns.”
As VBayesMM can handle and communicate issues of uncertainty, it gives researchers more confidence than a tool which does not. Even though the system is optimized to cope with heavy analytical workloads, mining such huge datasets still comes with high computational cost. However, as time goes on, this will become less and less of a barrier to those wishing to use them.
Other limitations at present include that the system benefits from having more data about the gut bacteria than the metabolites they produce; when there’s insufficient bacteria data, the accuracy drops. Also, VBayesMM assumes the microbes act independently, but in reality, gut bacteria interact in an incredibly complex number of ways.
“We plan to work with more comprehensive chemical datasets that capture the complete range of bacterial products, though this creates new challenges in determining whether chemicals come from bacteria, the human body or external sources like diet,” said Dang. “We also aim to make VBayesMM more robust when analyzing diverse patient populations, incorporating bacterial ‘family tree’ relationships to make better predictions, and further reducing the computational time needed for analysis.
For clinical applications, the ultimate goal is identifying specific bacterial targets for treatments or dietary interventions that could actually help patients, moving from basic research toward practical medical applications.”
More information:
Dang Tung et al, VBayesMM Variational Bayesian neural network to prioritize important relationships of high dimensional microbiome multiomics data, Briefings In Bioinformatics (2025). DOI: 10.1093/bib/bbaf300
Citation:
How AI is helping researchers to demystify gut bacteria (2025, July 4)
retrieved 4 July 2025
from https://medicalxpress.com/news/2025-07-ai-demystify-gut-bacteria.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.