Chengxi Zang

Chengxi Zang (臧承熙), Ph.D.

Dr. Chengxi Zang is currently an Assistant Professor of Population Health Sciences, Division of Health Informatics and Artificial Intelligence, at Weill Medical College of Cornell University, Weill Cornell Medicine (WCM), and a faculty member in the WCM Institute of AI for Digital Health (AIDH). He earned his Ph.D. from Tsinghua University in January 2019, receiving an Excellent Ph.D. Dissertation Award in the Computer Science Department and an Excellent Ph.D. Award from Tsinghua University. His long-term research interest is AI for health (AI4Health), with a current focus on utilizing AI, Machine Learning, and large-scale Real-World health Data (RWD) to generate Robust, Generalizable, and High-throughput Real-World Evidence (RWE), aiming to address top health challenges including Alzheimer's Disease, Long COVID, Youth suicide, Women's health, as well as to accelerate drug discovery and development process. He develops advanced deep generative models, causal inference models, predictive models, graph neural networks, and more. His research has been published in top Medical Journals such as Nature Medicine, Nature Communications (1, 2, 3), Translational Psychiatry, eClinicalMedicine, Communications Medicine, Journal of General Internal Medicine, Scientific Reports, Cell Patterns, Archives of Pathology & Laboratory Medicine, etc. His work has also appeared in leading Computer Science venues, including KDD, AAAI, TKDE, ICDM, etc. His papers have won the ICDM'18 Best Paper Candidate and the Best Paper Award at AAAI'20 Workshop on Deep Learning on Graphs. His research, algorithms, and codes have been applied to companies including NAVIDIA, Boehringer Ingelheim, etc., and have received extensive media coverage, including Fortune, Nature Medicine, NIH News, NIH Direction’s Blog, BMJ News, etc..

[PDF] C.V. (Updated in Sept. 2020)
[JPG] Formal Headshot Photo

Research

Research Interests

Health Data Science, AI4Health, AI for Drug Discovery & Development
Real-World Evidence (RWD), Real-World Data (RWD), Electronic Health Records (EHRs), Administrative claims
ML-Driven Target Trial Emulation, RWD-based drug repurposing, RWD-based trial design, Deep Generative Models, Causal Inference
Post-Acute Sequelae of COVID-19 (PASC, Long COVID), Alzheimer's Disease, Mental Heath, Youth Suicide, Women Health
Graph Neural Networks, Graph Mining, Network Science, Social Media Analysis

Research Experiences

May 2025 - present, Assistant Professor in Population Health Sciences, Weill Cornell Medicine, Weill Medical College of Cornell University
May 2022 - April 2025, Instructor in Population Health Sciences, Weill Cornell Medicine, Weill Medical College of Cornell University
February 2019 - Apirl 2022, Weill Cornell Medicine, Cornell University, working with Prof. Fei Wang
March 2020 - January 2021, Boehringer Ingelheim Pharmaceuticals, Inc., Postdoctoral Fellowship, Biostatistics & Data Science Americas
July 2018 - October 2018, WeChat, Tencent Rhino-Bird Elite Training Program
June 2017 - May 2018, Center for Complex Network Research, Visiting scholar, working with Prof. Albert-László Barabási
Summer 2015 & 2016, working with visiting Prof. Christos Faloutsos from Carnegie Mellon University
Summer 2014 & 2015, working with visiting Prof. Chaoming Song from Miami University
September 2013 - January 2019 , Department of Computer Science, Tsinghua University. Advisors Prof. Wenwu Zhu and Prof. Peng Cui
March 2011 - May 2012, Department of Computer Science, Sun Yat-sen University. Advisor: Prof. Jiwu Huang

Grants

1OT2HL161847, NIH/NHLBI, co-I (2021-2025)
R01AG080991, NIH/NIA, co-I (2023-2027)

Awards

National Institutes of Health (NIH) Early Career Reviewer (2024)
American Medical Informatics Association (AMIA) Informatics, Year-In-Review (2024)
Postdoc Fellowship, Boehringer Ingelheim (2020)
Best Paper Award. The First International Workshop on Deep Learning on Graphs: Methodologies and Applications (DLGMA’20) in AAAI (2020)
Excellent Ph.D. Award. Tsinghua University (2019)
Excellent Ph.D. Dissertation Award, Computer Science Department of Tsinghua University (2019)
Best Paper Candidate, 2018 IEEE International Conference on Data Mining (ICDM) (2018)
ACM SIGKDD Student Travel Award (3 years in a row) (2016-2018)

Reviewers

Top Computer Science venues: ICML, KDD, AAAI, ICDM, CIKM, SDM, WWW, WSDM, TKDE, TBD, TKDD, TWEB, PRLETTERS, KAIS, Frontiers in (Big Data|Computer Science), etc.
Top Medical journals: Nature Communications, Cell Reports Medicine, npj Digital Medicine, The Lancet Digital Health, Nature Machine Intelligence, Communications Medicine, BMC medicine, Journal of Biomedical Informatics, ACM Transactions on Computing for Healthcare, JAMIA Open, Frontiers in (Public Health|Neurology), etc.
NIH Early Career Reviewer, UK Research and Innovation (UKRI)

Education

February 2019 - Apirl 2022, Postdoctoral Research Associate, Weill Cornell Medicine, Cornell University
August 2013 - January 2019 , Doctor of Engineering, Tsinghua University, Department of Computer Science and Technology (Excellent Ph.D. Award in Tsinghua University, and Excellent Ph.D. Dissertation Award in the Computer Science Department. Top 3 in CS, and top 70/2279=3% in THU.)
September 2009 - July 2013, Bachelor of Science, Sun Yat-Sen University, Department of Computer Science and Technology (GPA 4.1/5, Rank 1st/100)

Publications

Working Manuscripts

M2	Federated Causal Inference in Healthcare: Methods, Challenges, and Applications Haoyang Li, Jie Xu, Kyra Gan, Fei Wang, Chengxi Zang (2025)	[PDF]
M1	TrialGenie: Empowering Clinical Trial Design with Agentic Intelligence and Real World Data Haoyang Li, Weishen Pan, Suraj Rajendran, Chengxi Zang, Fei Wang (2025)	[PDF]

Journal Papers

J21	Federated target trial emulation using distributed observational data for treatment effect estimation Haoyang Li, Chengxi Zang, Zhenxing Xu, Weishen Pan, Suraj Rajendran, Yong Chen, Fei Wang (2025) npj Digital Medicine 8, 387 (2025).	[PDF]
J20	Multicenter target trial emulation to evaluate corticosteroids for sepsis stratified by predicted organ dysfunction trajectory. Suraj Rajendran, Zhenxing Xu, Weishen Pan, Chengxi Zang, Ilias Siempos, Lisa Torres, Jie Xu, Jiang Bian, Edward J. Schenck & Fei Wang Nature Communications 16, Article number: 4450 (2025)	[PDF]
J19	Long COVID after SARS-CoV-2 during pregnancy in the United States. Chengxi Zang, Daniel Guth, Ann M. Bruno, Zhenxing Xu, Haoyang Li, Nariman Ammar, Robert Chew, Nick Guthe, Emily Hadley, Rainu Kaushal, Tanzy Love, Brenda M. McGrath, Rena C. Patel, Elizabeth C. Seibert, Yalini Senathirajah, Sharad Kumar Singh, Fei Wang, Mark G. Weiner, Kenneth J. Wilkins, Yiye Zhang, Torri D. Metz, Elaine Hill, Thomas W. Carton, on behalf of the RECOVER PCORnet EHR Consortia, the RECOVER N3C EHR Consortia, and the RECOVER Pregnancy Consortia Nature Communications 16, Article number: 3005 (2025)	[PDF]
J18	Long-COVID incidence proportion in adults and children between 2020 and 2024. Hannah Mandel, Yun J Yoo, Andrea J Allen, Sajjad Abedian, Zoe Verzani, Elizabeth W Karlson, Lawrence C Kleinman, Praveen C Mudumbi, Carlos R Oliveira, Jennifer A Muszynski, Rachel S Gross, Thomas W Carton, C Kim, Emily Taylor, Heekyong Park, Jasmin Divers, J Daniel Kelly, Jonathan Arnold, MSE, Carol Reynolds Geary, Chengxi Zang, Kelan G Tantisira, Kyung E Rhee, Michael Koropsak, Sindhu Mohandas, Andrew Vasey, Abu Saleh Mohammad Mosa, Melissa Haendel, Christopher G Chute, Shawn N Murphy, Lisa O'Brien, Jacqueline Szmuszkovicz, Nicholas Guthe, Jorge L Santana, Aliva De, Amanda L Bogie, Katia C Halabi, Lathika Mohanraj, Patricia A Kinser, Samuel E Packard, Katherine R Tuttle, Kathryn Hirabayashi, Rainu Kaushal, Emily Pfaff, Mark G Weiner, Lorna E Thorpe, Richard A Moffitt on behalf of the RECOVER EHR Cohort Clinical Infectious Diseases (2025)	[PDF]
J17	A network-based systems genetics framework identifies pathobiology and drug repurposing in Parkinson’s disease. Lijun Dou, Zhenxing Xu, Jielin Xu, Chengxi Zang, Chang Su, Andrew A. Pieper, James B. Leverenz, Fei Wang, Xiongwei Zhu, Jeffrey Cummings & Feixiong Cheng npj parkinson’s disease, 2025	[PDF]
J16	Single-microglia transcriptomic transition network-based prediction and real-world patient data validation identifies ketorolac as a repurposable drug for Alzheimer's disease. Jielin Xu, Wenqiang Song, Zhenxing Xu, Michael M. Danziger, Ehud Karavani, Chengxi Zang, Xin Chen, Yichen Li, Isabela M Rivera Paz, Dhruv Gohel, Chang Su, Yadi Zhou, Yuan Hou, Yishai Shimoni, Andrew A. Pieper, Jianying Hu, Fei Wang, Michal Rosen-Zvi, James B. Leverenz, Jeffrey Cummings, Feixiong Cheng Alzheimer's & Dementia: The Journal of the Alzheimer's Association. 2024	[PDF]
J15	Accuracy and transportability of machine learning models for adolescent suicide prediction with longitudinal clinical records Chengxi Zang, Yu Hou, Daoming Lyu, Jun Jin, Shane Sacco, Kun Chen, Robert Aseltine & Fei Wang Translational Psychiatry (2024).	[PDF]
J14	Identification of risk factors of Long COVID and predictive modeling in the RECOVER EHR cohorts Chengxi Zang, Yu Hou, Edward J. Schenck, Zhenxing Xu, Yongkang Zhang, Jie Xu, Jiang Bian, Dmitry Morozyuk, Dhruv Khullar, Anna S. Nordvig, Elizabeth A. Shenkman, Russell L. Rothman, Jason P. Block, Kristin Lyman, Yiye Zhang, Jay Varma, Mark G. Weiner, Thomas W. Carton, Fei Wang & Rainu Kaushal Communications Medicine (2024).	[PDF]
J13	Association between acquiring SARS-CoV-2 during pregnancy and post-acute sequelae of SARS-CoV-2 infection: RECOVER electronic health record cohort analysis Ann M. Bruno, Chengxi Zang, Zhengxing Xu, Fei Wang, Mark G. Weiner, Nick Guthe, Megan Fitzgerald, Rainu Kaushal, Thomas W. Carton, Torri D. Metz, RECOVER EHR Cohort, the RECOVER Pregnancy Cohort eClinicalMedicine (2024).	[PDF]
J12	Emerging opportunities of using large language models for translation between drug molecules and indications David Oniani, Jordan Hilsman, Chengxi Zang, Junmei Wang, Lianjin Cai, Jan Zawala, Yanshan Wang Scientific reports (2024).	[PDF]
J11	High-Throughput Target Trial Emulation for Alzheimer’s Disease Drug Repurposing with Real-World Data Chengxi Zang, Hao Zhang, Jie Xu, Hansi Zhang, Sajjad Fouladvand, Shreyas Havaldar, Feixiong Cheng, Kun Chen, Yong Chen, Benjamin Glicksberg, Jin Chen, and Jiang Bian, Fei Wang Nature Communications (2023).	[PDF]
J10	Comparing the effects of four common drug classes on the progression of mild cognitive impairment to dementia using electronic health records Jie Xu, Fei Wang, Chengxi Zang, Hao Zhang, Kellyann Niotis, Ava L. Liberman, Cynthia M. Stonnington, Makoto Ishii, Prakash Adekkanattu, Yuan Luo, Chengsheng Mao, Luke V. Rasmussen, Zhenxing Xu, Pascal Brandt, Jennifer A. Pacheco, Yifan Peng, Guoqian Jiang, Richard Isaacson & Jyotishman Pathak Scientific reports (2023).	[PDF]
J9	Data-driven analysis to understand long COVID using electronic health records from the RECOVER initiative Chengxi Zang, Yongkang Zhang, Jie Xu, Jiang Bian, Dmitry Morozyuk, Edward J. Schenck, Dhruv Khullar, Anna S. Nordvig, Elizabeth A. Shenkman, Russell L. Rothman, Jason P. Block, Kristin Lyman, Mark G. Weiner, Thomas W. Carton, Fei Wang, Rainu Kaushal Nature Communications (2023).	[PDF]
J8	Data-driven identification of post-acute SARS-CoV-2 infection subphenotypes Hao Zhang, Chengxi Zang, Zhenxing Xu, Yongkang Zhang, Jie Xu, Jiang Bian, Dmitry Morozyuk, Dhruv Khullar, Yiye Zhang, Anna Nordvig, Edward Schenck, Elizabeth Shenkman, Russel Rothman, Jason Block, Kristin Lyman, Mark Weiner, Thomas Carton, Fei Wang, Rainu Kaushal Nature Medicine (2023).	[PDF]
J7	Racial/Ethnic Disparities in Post-acute Sequelae of SARS-CoV-2 Infection in New York: an EHR-Based Cohort Study from the RECOVER Program Dhruv Khullar, Yongkang Zhang, Chengxi Zang, Zhenxing Xu, Fei Wang, Mark G. Weiner, Thomas W. Carton, Russell L. Rothman, Jason P. Block, Rainu Kaushal Journal of General Internal Medicine (2023).	[PDF]
J6	Identifying environmental risk factors for post-acute sequelae of SARS-CoV-2 infection: An EHR-based cohort study from the recover program Yongkang Zhang, Hui Hu, Vasilios Fokaidis, Colby Lewis V, Jie Xu, Chengxi Zang, Zhenxing Xu, Fei Wang, Michael Koropsak, Jiang Bian, Jaclyn Hall, Russell L. Rothman, Elizabeth A. Shenkman, Wei-Qi Wei, Mark G. Weiner, Thomas W. Carton, Rainu Kaushal Environmental Advances (2023).	[PDF]
J5	Building the Model: Challenges and Considerations of Developing and Implementing Machine Learning Tools for Clinical Laboratory Medicine Practice Yang, He S., Daniel D. Rhoads, Jorge Sepulveda, Chengxi Zang, Amy Chadburn, and Fei Wang. Archives of Pathology & Laboratory Medicine (2022).	[PDF]
J4	Development of a screening algorithm for borderline personality disorder using electronic health records Chengxi Zang, Marianne Goodman, Zheng Zhu, Lulu Yang, Ziwei Yin, Zsuzsanna Tamas, Vikas Mohan Sharma, Fei Wang, and Nan Shao Scientific reports 12, no. 1 (2022): 1-12.	[PDF]
J3	Contrastive Learning Improves Critical Event Prediction in COVID-19 Patients Tingyi Wanyan, Hossein Honarvar, Suraj K. Jaladanki, Chengxi Zang, Nidhi Naik, Sulaiman Somani, Jessica K. De Freitas, Ishan Paranjpe, Akhil Vaid, Jing Zhang,Riccardo Miotto, Zhangyang Wang, Girish N. Nadkarni, Marinka Zitnik, Ariful Azad, Fei Wang, Ying Ding, Benjamin S. Glicksberg Cell Patterns, 2021.	[PDF]
J2	Exploring the collective human behavior in cascading systems: a comprehensive framework Yunfei Lu, Linyun Yu, Tianyang Zhang, Chengxi Zang, Peng Cui, Chaoming Song & Wenwu Zhu. Knowledge and Information Systems (KAIS), 2020.	[PDF]
J1	On Power Law Growth of Social Networks. Chengxi Zang, Peng Cui, Christos Faloutsos, Wenwu Zhu. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2018	[PDF] [Code]

Conference Papers

C13	SCEHR: Supervised Contrastive Learning for Clinical Risk Prediction using Electronic Health Records Chengxi Zang and Fei Wang. 2021 IEEE International Conference on Data Mining (ICDM'21). (Acceptance Rate 98/990 =9.9%, Regular Paper)	[PDF] [Code]
C12	MoFlow: An Invertible Flow Model for Generating Molecular Graphs Chengxi Zang and Fei Wang. KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Accept Rate 216/1279 = 16.89%, Full Paper)	[PDF] [Slides] [YouTube] [Code] [Demo!]
C11	Neural Dynamics on Complex Networks Chengxi Zang and Fei Wang. KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Accept Rate 216/1279 = 16.89%, Full Paper)	[PDF] [Slides] [YouTube] [Code]
C10	Recent Advances on Graph Analytics and Its Applications in Healthcare Fei Wang , Peng Cui , Jian Pei , Yangqiu Song , Chengxi Zang KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining	[PDF] [Webpage]
C9	Dynamical Origins of Distribution Functions Chengxi Zang, Peng Cui, Wenwu Zhu, and Fei Wang. KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Accept Rate (110 oral + 60 poster)/1200 = 14.2%, Oral)	[PDF]
C8	Uncovering Pattern Formation of Information Flow. Chengxi Zang, Peng Cui, Chaoming Song, Wenwu Zhu, and Fei Wang. KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Accept Rate (110 oral + 60 poster)/1200 = 14.2%, Poster)	[PDF]
C7	Fates of Microscopic Social Ecosystems: Keep Alive or Dead? Haoyang Li, Peng Cui, Chengxi Zang, Tianyang Zhang, Wenwu Zhu, and Yishi Lin. KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Accept Rate (110 oral + 60 poster)/1200 = 14.2%, Oral)	[PDF]
C6	Uncovering the Co-driven Mechanism of Social and Content Links in User Churn Phenomena Yunfei Lu, Lingyun Yu, Peng Cui, Chengxi Zang, Renzhe Xu, Yihao Liu, Lei Li, and Wenwu Zhu. KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Accept Rate (45 oral + 100 poster)/700 = 20.7%, Poster, Applied Data Science Track)	[PDF]
C5	Learning and Interpreting Complex Distributions in Empirical Data Chengxi Zang, Peng Cui, Wenwu Zhu. KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Accept Rate 181/983 = 18.4%, Oral)	[PDF] [PPT]
C4	Collective Human Behavior in Cascading System: Discovery, Modeling and Applications. Yunfei Lu, Linyun Yu, Tianyang Zhang, Chengxi Zang, Peng Cui, Chaoming Song, and Wenwu Zhu. 2018 IEEE International Conference on Data Mining (ICDM'18). (Acceptance Rate 84/948 =8.86%, Full Paper, Best Paper Candidate)	[PDF]
C3	Long Short Memory Process: Modeling Growth Dynamics of Microscopic Social Connectivity. Chengxi Zang, Peng Cui, Christos Faloutsos, Wenwu Zhu. KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Accept Rate 131/748 = 17.5%, Oral)	[PDF] [PPT] [Poster] [Code] [Video]
C2	Quantifying Structural Patterns of Information Cascades. Chengxi Zang, Peng Cui, Chaoming Song, Christos Faloutsos, Wenwu Zhu. International World Wide Web Conference (WWW), Poster, 2017. (Accept Rate 67/166 = 40.4%)	[PDF] [PPT] [Poster]
C1	Beyond Sigmoids: the NetTide Model for Social Network Growth, and its Applications. Chengxi Zang, Peng Cui, Christos Faloutsos. KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (Accept Rate 142/784 = 18.1%)	[PDF] [PPT] [Poster] [Code]

Workshop Papers & Others

W6	Visualizing Deep Graph Generative Models for Drug Discovery Karan Yang, Chengxi Zang, Fei Wang 2020 KDD Workshop on Applied Data Science for Healthcare	[PDF] [Webpage]
W5	MoFlow: An Invertible Flow Model for Molecular Graph Generation Chengxi Zang, Fei Wang 2020 ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models (INNF+2020)	[PDF] [Webpage]
W4	Neural Dynamics on Complex Networks Chengxi Zang and Fei Wang. 2020 AAAI Deep Learning on Graphs: Methodologies and Applications (DLGMA’20) workshop (Best Paper Award)	[PDF] [Webpage]
W3	Causation-Driven Visualizations for Insurance Recommendation Zhixiu Liu, Chengxi Zang, Kun Kuang, Hao Zou, Hu Zheng, and Peng Cui IEEE International Conference on Multimedia & Expo (ICME), 2019, the Cross-media Analysis for Semantic Knowledge Reasoning and Transfer Workshop.	[PDF]
W2	Modeling the Dynamics of WeChat Social System. Chengxi Zang, Peng Cui, Linyun Yu, Tianyang Zhang, Wenwu Zhu, Hao Ye. Draft	[PDF]
W1	Structural Patterns of Information Cascades and their Implications for Dynamics and Semantics. Chengxi Zang, Peng Cui, Chaoming Song, Christos Faloutsos, Wenwu Zhu. arXiv Draft	[PDF]

Ph.D. Dissertation

D1	Data-Driven Dynamical Modeling of Complex Social Systems (复杂社交系统的数据驱动动力学建模研究) Chengxi Zang Defense Committee Members: Prof. Xindong Wu, Prof. Jianyong Wang, Prof. Jie Tang, Prof. Huawei Shen, Prof. Dan Li, and Prof. Wenwu Zhu	[PDF] [Webpage]

Tutorials & Talks

Tutorials

Generating Real-World Evidence with Real-world Data and Machine Learning, Chengxi Zang and Fei Wang, IEEE ICHI 2024,Orlando, Florida, USA, Tuesday, June 4, 8:30 AM - 12:00 PM @234 Second floor, [Webpage]
Mining Electronic Health Records for Real-World Evidence, Chengxi Zang, Weishen Pan, Fei Wang, KDD 2023, Tuesday, August 8th, 10:00 AM – 13:00 PM PDT, Room 202A, Long Beach Convention & Entertainment Center, [Webpage] , [PDF]
Recent Advances on Graph Analytics and Its Applications in Healthcare, Fei Wang (Cornell University); Peng Cui (Tsinghua University); Jian Pei (Simon Fraser University); Yangqiu Song (Hong Kong University of Science and Technology); Chengxi Zang(Cornell University).KDD 2020 Tutorial on August 23rd, [Webpage] , [PDF]
Differential Deep Learning on Graphs and its Applications, Chengxi Zang (Cornell University) and Fei Wang (Cornell University). AAAI 2020 Tutorial on February 7, 2020, New York City, NY. [Webpage] [PDF]

Invited Talks

Long COVID after SARS-CoV-2 during pregnancy, NIH RECOVER Research Review (R3) Seminar Series, (6/10/2025)
Prioritization of Open Innovation Tasks-Model Architecture, AI for Drug Discovery Open Innovation Forum (Boston University, IBM), (10/30/2024)
Generating Real-World Evidence with Real-World Data and Machine Learning, Columbia University Mailman School of Public Health (8/22/2024)
SARS-CoV-2 infection during pregnancy and development of Long COVID, NIH RECOVER Research Review (R3) Seminar Series, (5/14/2024)
High-Throughput Target Trial Emulation with Real-World Data for Alzheimer’s Disease Drug Repurposing and Beyond, Pharmaceutical Data Science (PharmaDS) Conference 2024 (3/18/2024)
High-Throughput Clinical Trial Emulation with Real-World Data and Machine Learning: A Case Study of Drug Repurposing for Alzheimer's Disease, Department of Computer Science, Emory University, (12/21/2021)
Drug repurposing driven by emulating trials on real-world data and causal AI: a case of Alzheimer's disease, Department of Statistics, University of Connecticut, (10/27/2021)
MoFlow: An Invertible Flow Model for Generating Molecular Graphs, ACM Conference on Knowledge Discovery and Data Mining (KDD). Speaker/Virtual. (8/27/2020)
Neural Dynamics on Complex Networks, ACM Conference on Knowledge Discovery and Data Mining (KDD). Speaker/Virtual. (8/24/2020)
Modeling the Dynamics of WeChat Social System, Notre Dame, Computer Science Department, (4/12/2018)
Modeling the Dynamics of WeChat Social System, Harvard Medical School, Channing Network Science Seminar, (3/15/2018)

Media, News & Tweets

Selected Media Coverage

April, 2025. Our Nature Communications paper Long COVID after SARS-CoV-2 during pregnancy in the United States was highlighted by the Weill Cornell Medicine News Pregnancy May Reduce Long COVID Risk? , Cornell Chronicle, News Medical, EurekAlert! AAAs , Contemporary OB/GYN , Neuroscience News , Drug Discovery & Development, Drug Topics, Technology Networks , healthleaders, Technology Networks, HealthDay, The Healthy etc.
March 05, 2025. Our research article "Emerging opportunities of using large language models for translation between drug molecules and indications" received 7,823 article downloads in 2024, placing it as one of the Top 25 downloaded Chemistry papers published in Scientific Reports last year. The journal published over 2,325 Chemistry papers in 2024 — Our science is of real value to the research community!
August 12, 2024. Our Communications Medicine paper Identification of risk factors of Long COVID and predictive modeling in the RECOVER EHR cohorts was highlighted by the Weill Cornell Medicine News titled Identification of risk factors of Long COVID and predictive modeling in the RECOVER EHR cohorts.

March 21, 2024. Our Nature Communications paper High-throughput target trial emulation for Alzheimer’s disease drug repurposing with real-world data was highlighted by the AMIA (American Medical Informatics Association) at AMIA 2024 Informatics Summit as the AMIA informatics year-in-review!
May 2023. Our Nature Communications paper Data-driven analysis to understand long COVID using electronic health records from the RECOVER initiative was highlighted in Cornell Chronicle: Long COVID risk and symptoms vary across populations and in Weill Cornell Medicine Newsroom: Study Discovers Long COVID Risk and Symptoms Vary in Different Populations.
May 2023. Our KDD molecular generative AI paper MoFlow: An Invertible Flow Model for Generating Molecular Graphs was highlighted in Weill Cornell Medicine Population Health Sciences News.
March 21st, 2023. Our Molecule Generative AI model - MoFlow was highlighted by NVIDIA CEO Jensen Huang @ NVIDIA GTC 2023 Keynote and being integrated into NVIDIA BioNeMo Service for AI-driven Drug Discovery! See the exciting moment and inspiring introduction at 48:00 mins at Youtube:GTC 2023 Keynote with NVIDIA CEO Jensen Huang. Also refer to the NVIDIA Developer Technical Blog: Build Generative AI Pipelines for Drug Discovery with NVIDIA BioNeMo Service for more details.
March 2023. Our Risk Factors and Predictive Modeling for Long Covid paper was highlighted in News Medical: What are the risk factors associated with post-acute SARS-CoV-2 infection?
March 2023. Our Racial/Ethnic Disparities in Long Covid paper was highlighted in BMJ News: Covid-19: US studies show racial and ethnic disparities in long covid.
March 2023. Our Environmental risk factors for Long Covid paper was highlighted in NIH Director's Blog: RECOVER: What Clinical Research Comes Next for Helping People with Long COVID.
February 2023. Our Racial/Ethnic Disparities in Long Covid paper was highlighted in NIH News Releases. NIH RECOVER research identifies potential long COVID disparities.
February 2023. Our Racial/Ethnic Disparities in Long Covid paper was highlighted in Cornell Chronicle and Weill Cornell Medicine Newsroom: Long COVID Symptoms Vary Among Racial and Ethnic Groups; Cancer Health: RECOVER Research Identifies Potential Long COVID Disparities; and Bet : Black, Hispanic Patients More Likely To Develop Lasting Symptoms After COVID.
February 2023. Our Nature Medicine paper Long Covid subphenotyping was highlighted in NIH - News and Stories: Researchers Identify Four Long COVID Categories.
Janunary 2023. Our Nature Medicine Long Covid subphenotyping paper was highlighted in Cornell Chronicle: Study identifies four major subtypes of long COVID; CN-HEALTHCARE 健康界: Nat Med：研究近3.5万名新冠患者数据，确定了长新冠存在四种主要的症状模式; Medical Xpress: Study identifies four major subtypes of long COVID; Verywellhealth: Long COVID May Manifest Itself in 4 Major Ways, Research Shows; Prevention: Study Finds There Are 4 Subtypes of Long COVID, New Atlas:Four distinct subtypes of long COVID defined in machine learning study; Miami Herald: There are 4 ‘major’ types of long COVID symptoms, study finds. How likely is each?; and BOSTON.com:New study categorizes long COVID symptoms, allowing for earlier detection, “They don’t have to suffer in silence.”
Janunary 2023. Our Long Covid subphenotyping paper was highlighted in Nature Medicine - Research Briefing: Machine learning identifies long COVID patterns from electronic health records.
December 2022. Our Long Covid subphenotyping paper was highlighted in Weill Cornell Medicine Newsroom. Study Identifies Four Major Subtypes of Long COVID, and MedPage Today. Are Subphenotypes for Long COVID Beneficial? — A new study can help physicians evaluate potential treatment approaches.
June 2022. Our Long Covid subphenotyping paper was highlighted in News Medical. Machine learning analysis suggests that there are four sub-phenotypes of long COVID
June 2022. Our Long Covid subphenotyping paper was highlighted in Fortune. Long COVID symptoms: What we know—and don’t know—about the mysterious illness that could affect up to 80% of COVID survivors
May 2022. Our Long Covid analysis paper was highlighted in News Medical.Largest study to date on long COVID identifies a broad list of diagnoses.

Tweets my own

Tweets by calvin_zcx

News

Upcoming KDD 2020 events of mine:

Our tutorial "Mining Electronic Health Records for Real-World Evidence" is happening on Tuesday, August 8th, 10:00 AM – 13:00 PM PDT Room 202A, Long Beach Convention & Entertainment Center. Click and find the materials and location!
Our tutorial "Recent Advances on Graph Analytics and Its Applications in Healthcare" will start on August 23rd, 8:00 AM PDT (11:00 AM EDT). Click and find the materials and virtual location!
A spotlight presentation of our system on Visualizing Deep Graph Generative Models for Drug Discovery in KDD 2020 Workshop on Applied Data Science for Healthcare on August 24, 2020!
An oral presentaion of our full research paper MoFlow: An Invertible Flow Model for Generating Molecular Graphs will start on August 25th, 4:00 PM PDT (7:00 PM EDT)!
Another oral presentaion of our full research paper Neural Dynamics on Complex Networks will start on August 27th, 10:00 AM PDT (1:00 PM EDT)!

Two workshop papers on drug disocvery has been accepted by ICML 2020 Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models and KDD 2020 Workshop on Applied Data Science for Healthcare respectively. See our presentation there.

Two papers has been accepted by KDD 2020 as full papers! One is on continuous-time Graph Neural Networks (GNN), and the other is on de novo drug design by flow-based generative GNN model. 2020-05-15

Our tutorial "Recent Advances on Graph Analytics and Its Applications in Healthcare" has been accepted by KDD 2020, curated and presented by: Fei Wang (Cornell University); Peng Cui (Tsinghua University); Jian Pei (Simon Fraser University); Yangqiu Song (Hong Kong University of Science and Technology); Chengxi Zang (Cornell University). 2020-06-01

Our Paper Neural Dynamics on Complex Networks won the Best Paper Award in AAAI 2020 Deep Learning on Graphs: Methodologies and Applications (DLGMA’20) workshop (February 8, 2020, New York City, NY).

My tutorial on Differential Deep Learning on Graphs and its Applications. Held at AAAI-20 (February 7, 2020, New York City, NY).