Hello! My name is Nicholas Ho and I am a third year Computational Biology PhD student in Carnegie Mellon University School of Computer Science. I am incredibly grateful to be advised by Professor Jian Ma and Professor Eric Xing. I develop self-supervised models that learn from large biological datasets to uncover new insights across different scales of biology.
I believe that the task always informs the representation. In biology, because data is limited, it is critical to design pretraining objectives aligned with the downstream tasks that matter. By working on the right problems with the right inductive biases, I believe we can build generalizable and scalable models.
Feel free to reach out to me! I'm always looking for interesting problems to talk about!
Most recent publications on Google Scholar.
‡ indicates equal contribution.
HEIMDALL: Disentangling Tokenizer Design for Robust Transfer in Single-Cell Foundation Models
Ellie Haber*, Shahul Alam*, Nicholas Ho*, Renming Liu, Evan Trop, Shaoheng Liang, Muyu Yang, Spencer Krieger, Jian Ma.
bioRxiv, 2025.
AIDO.Tissue: Spatial Cell-Guided Pretraining for Scalable Spatial Transcriptomics Foundation Model
Jing Gong*, Yixuan Wang*, Nicholas Ho, Xingyi Cheng, Le Song, Eric Xing.
bioRxiv, 2025.
Foundation Models Improve Perturbation Response Prediction
Elijah Cole, Geert-Jan Huizing, Sohan Addagudi, Nicholas Ho, Euxhen Hasanaj, Merel Kuijs, Toby Johnstone, Maria Carilli, Alec Davi, Caleb Ellington, Christoph Feinauer, Pan Li, Romain Menegaux, Shahin Mohammadi, Yanjun Shao, Josiah Zhang, Emma Lundberg, Le Song, Ziv Bar-Joseph, Eric P. Xing.
bioRxiv, 2026.
Scaling Dense Representations for Single Cell with Transcriptome-Scale Context
Nicholas Ho, Caleb N. Ellington, Jinyu Hou, Sohan Addagudi, Shentong Mo, Tianhua Tao, Dian Li, Yonghao Zhuang, Hongyi Wang, Xingyi Cheng, Le Song, Eric P. Xing.
In NeurIPS Workshop on AI for New Drug Modalities, 2024.
Accurate and General DNA Representations Emerge from Genome Foundation Models at Scale
Caleb N. Ellington, Ning Sun, Nicholas Ho, Tianhua Tao, Sazan Mahbub, Dian Li, Yonghao Zhuang, Hongyi Wang, Le Song, Eric P. Xing.
In NeurIPS Workshop on AI for New Drug Modalities, 2024.
Learning Free Energy Pathways through Reinforcement Learning of Adaptive Steered Molecular Dynamics
Nicholas Ho, John Kevin Cava, John Vant, Ankita Shukla, Jacob Miratsky, Pavan Turaga, Ross Maciejewski, Abhishek Singharoy.
Machine Learning In Structural Biology (MLSB) Workshop at the 36th Conference on Neural Information Processing Systems
HEIMDALL: Disentangling Tokenizer Design for Robust Transfer in Single-Cell Foundation Models
Ellie Haber*, Shahul Alam*, Nicholas Ho*, Renming Liu, Evan Trop, Shaoheng Liang, Muyu Yang, Spencer Krieger, Jian Ma.
bioRxiv, 2025.
AIDO.Tissue: Spatial Cell-Guided Pretraining for Scalable Spatial Transcriptomics Foundation Model
Jing Gong*, Yixuan Wang*, Nicholas Ho, Xingyi Cheng, Le Song, Eric Xing.
bioRxiv, 2025.
Foundation Models Improve Perturbation Response Prediction
Elijah Cole, Geert-Jan Huizing, Sohan Addagudi, Nicholas Ho, Euxhen Hasanaj, Merel Kuijs, Toby Johnstone, Maria Carilli, Alec Davi, Caleb Ellington, Christoph Feinauer, Pan Li, Romain Menegaux, Shahin Mohammadi, Yanjun Shao, Josiah Zhang, Emma Lundberg, Le Song, Ziv Bar-Joseph, Eric P. Xing.
bioRxiv, 2026.
Scaling Dense Representations for Single Cell with Transcriptome-Scale Context
Nicholas Ho, Caleb N. Ellington, Jinyu Hou, Sohan Addagudi, Shentong Mo, Tianhua Tao, Dian Li, Yonghao Zhuang, Hongyi Wang, Xingyi Cheng, Le Song, Eric P. Xing.
In NeurIPS Workshop on AI for New Drug Modalities, 2024.
Accurate and General DNA Representations Emerge from Genome Foundation Models at Scale
Caleb N. Ellington, Ning Sun, Nicholas Ho, Tianhua Tao, Sazan Mahbub, Dian Li, Yonghao Zhuang, Hongyi Wang, Le Song, Eric P. Xing.
In NeurIPS Workshop on AI for New Drug Modalities, 2024.
Learning Free Energy Pathways through Reinforcement Learning of Adaptive Steered Molecular Dynamics
Nicholas Ho, John Kevin Cava, John Vant, Ankita Shukla, Jacob Miratsky, Pavan Turaga, Ross Maciejewski, Abhishek Singharoy.
Machine Learning In Structural Biology (MLSB) Workshop at the 36th Conference on Neural Information Processing Systems
Towards Conditional Generation of Minimal Action Potential Pathways for Molecular Dynamics
John Kevin Cava, John Vant, Nicholas Ho, Ankita Shukla, Pavan Turaga, Ross Maciejewski, Abhishek Singharoy
ELLIS ML4Molecules Workshop, 2021
CyanoPATH: a knowledgebase of genome-scale functional repertoire for toxic cyanobacterial blooms
Wei Du, Gaoyang Li, Nicholas Ho, Landon Jenkins, Drew Hockaday, Jiankang Tan, Huansheng Cao
Briefings in Bioinformatics