Protocol for using treeLFA to infer multimorbidity patterns in the form of disease topics from diagnosis data in biobanks.

Zhang Y., Jiang X., McVean G., Lunter G.

Research on multimorbidity patterns promotes our understanding of the common pathological mechanisms that underlie co-occurring diseases. Here, we present a protocol to infer multimorbidity clusters in the form of disease topics from large-scale diagnosis data using treeLFA, a topic model based on the Bayesian binary non-negative matrix factorization. We describe steps for installing software, preparing input data, and training the model. We then detail post-processing procedures to obtain summarized results for downstream analyses. For complete details on the use and execution of this protocol, please refer to Zhang et al.1.

DOI

10.1016/j.xpro.2025.104033

Type

Journal article

Publication Date

2025-09-01T00:00:00+00:00

Volume

6

Addresses

Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100006, China; Chinese Academy of Medical Sciences Oxford Institute, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, UK; Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK.

Keywords

Humans, Bayes Theorem, Software, Biological Specimen Banks, Multimorbidity

Permalink More information Close