Microbiome foundation modeling of cancer: MiFM-derived continuous trajectories and risk clocks for pan-cancer immunotherapy response prediction
Presenter: Hongru Shen, PhD Session: Large Language Models in the Clinic Time: 4/20/2026 2:00:00 PM → 4/20/2026 5:00:00 PM
Authors
Hongru Shen , Yajing Bi , Yan Zong , Zhangyan Lyu , Chao Zhang , Kexin Chen , Xiangchun Li Tianjin Medical Univ. Cancer Inst. & Hospital, Tianjin, China
Abstract
Background: The human microbiome shapes cancer risk, progression and response to immunotherapy, but most existing microbiome models are task-specific and fail to generalize across cohorts and sequencing platforms. We developed MiFM (Microbiome Foundation Model), a self-supervised Transformer trained on ~2 million human, animal and environmental microbiome profiles. MiFM encodes communities as multi-level tokens (species, genus and functional pathways) and uses masked reconstruction plus contrastive learning to obtain transferable embeddings that capture taxonomic and functional structure. Here, we evaluated whether MiFM embeddings can support continuous, clinically interpretable readouts of cancer trajectories and immunotherapy response. Methods: From MiFM embeddings we derived two continuous metrics: (1) a disease pseudotime placing each patient along a microbiome-encoded cancer progression trajectory; and (2) a microbiome cancer risk clock estimating microbiome-derived cancer risk and immunotherapy non-response at the individual level. We applied this framework to colorectal cancer (CRC) cohorts spanning normal mucosa, adenoma and carcinoma, and to 12 independent pan-cancer immune checkpoint blockade cohorts (n =1237). We additionally defined a microbiome-based aging acceleration signature and evaluated its association with disease pseudotime, the risk clock and overall survival. Results: In CRC, disease pseudotime recapitulated the adenoma-carcinoma sequence, with median values of 0.15 (normal), 0.38 (adenoma) and 0.72 (carcinoma; p Conclusions: To our knowledge, MiFM is the first large-scale microbiome foundation model applied to oncology, yielding continuous microbiome-derived readouts of cancer progression and immunotherapy response. The disease pseudotime and microbiome cancer risk clock capture cancer trajectories beyond conventional staging, enable ultra-early identification of high-risk lesions in normal-appearing mucosa and support robust prediction of immunotherapy resistance across cohorts. This framework delivers concrete microbiome biomarkers and a scalable platform for microbiome-guided precision oncology.
Disclosure
H. Shen, None.. Y. Bi, None.. Z. Lyu, None.. K. Chen, None.. X. Li, None.
Cited in
Control: 6133 · Presentation Id: 2654 · Meeting 21436