My name is Praveen Kumar, and I currently work as a Research Assistant Professor at the School of Medicine, University of New Mexico (UNM).
I hold a Bachelor’s degree in Computer Engineering from the National Institute of Technology (NIT) Surat, and both a Master’s and Ph.D. in Computer Science from the University of New Mexico (UNM), Albuquerque, USA. Following my undergraduate studies, I spent 12 years in the IT industry, working across the banking, insurance, and travel sectors in roles such as Software Engineer, Systems Analyst, and Associate Project Manager. However, my passion for artificial intelligence (AI) and machine learning (ML) led me to transition into academia. I returned to graduate school to pursue advanced studies, culminating in a Ph.D.
My research expertise spans health informatics and cheminformatics, with a focus on developing AI/ML algorithms to extract insights from complex, noisy, and high-dimensional datasets. This includes working with patient health records, chemical compound data, and biomedical knowledge graphs. In health informatics, I develop methods to identify mental health conditions and co-occurring disorders from EHR and claims data. In cheminformatics, my work centers on discovering potential therapeutic compounds by analyzing chemical datasets. I also work on biomedical knowledge graphs to infer associations between biological entities such as genes and diseases. Recently, I have been developing ML-based risk models for the early detection of cardiometabolic diseases.
Beyond my academic research, I have experience in teaching and mentoring, primarily gained during my time in the IT industry, where I was responsible for training new team members in both technical skills and domain-specific knowledge. I also hold certifications in Equity Derivatives and Mutual Funds from the National Stock Exchange (NSE) of India.
Research Publications
Journal Articles
- Kumar, P., Metzger, V. T., & Malec, S. A. (2025). Unsupervised Latent Pattern Analysis for Estimating Type 2 Diabetes Risk in Undiagnosed Populations. ArXiv. https://arxiv.org/abs/2505.21824
- Kumar P, Metzger VT, Purushotham ST, Kedia P, Bologa CG, Lambert CG, Yang JJ. KG2ML: Integrating Knowledge Graphs and Positive Unlabeled Learning for Identifying Disease-Associated Genes. medRxiv. 2025:2025-03.
- P. Kumar and C. G. Lambert, “Positive Unlabeled Learning Selected Not At Random (PULSNAR): Class proportion estimation without the selected completely at random assumption”, PeerJ Computer Science, 2024.
- P. Kumar, F. Moomtaheen, S. A. Malec, J. J. Yang, C. G. Bologa, K. A. Schneider, Y. Zhu, M. Tohen,G. Villarreal, D. J. Perkins, E. M. Fielstein, S. E. Davis, M. E. Matheny, and C. G. Lambert, “Detecting opioid use disorder in health claims data with positive unlabeled learning”, IEEE JBHI, 2024.
- M. Ranjbar, J. J. Yang, P. Kumar, D. R. Byrd, E. L. Bearer, and T. I. Oprea, “Autophagy dark genes: Can we find them with machine learning?”, Natural Sciences, e20220067, 2023.
- J. E. Evangelista, D. J. Clarke, Z. Xie, G. B. Marino, V. Utti, S. L. Jenkins, T. M. Ahooyi,C. G. Bologa, J. J. Yang, J. L. Binder, P. Kumar, C. G. Lambert, J. S. Grethe, E. Wenger, D. Taylor,T. I. Oprea, B. d. Bono, and A. Ma’ayan, “Toxicology knowledge graph for structural birth defects”,Communications Medicine, vol. 3, no. 1, p. 98, 2023.
- L. Jarratt, J. Situ, R. D. King, E. Montanez Ramos, H. Groves, R. Ormesher, M. Cossé, A. Raboff,A. Mahajan, J. Thompson, et al., “A comprehensive covid-19 daily news and medical literature briefing to inform health care and policy in new mexico: Implementation study”, JMIR Medical Education, vol. 8, no. 1, e23845, 2022.
- J. Binder, O. Ursu, C. Bologa, S. Jiang, N. Maphis, S. Dadras, D. Chisholm, J. Weick, O. Myers, P. Kumar, et al., “Machine learning prediction and tau-based screening identifies potential alzheimer’s disease genes relevant to immunity”, Communications Biology, vol. 5, no. 1, pp. 1–15, 2022.
- A. Nestsiarovich, P. Kumar, N. R. Lauve, N. G. Hurwitz, A. J. Mazurie, D. C. Cannon, Y. Zhu, S. J. Nelson, A. S. Crisanti, B. Kerner, et al., “Using machine learning imputed outcomes to assess drug-dependent risk of self-harm in patients with bipolar disorder: A comparative effectiveness study”, JMIR mental health, vol. 8, no. 4, e24522, 2021.
- P. Kumar, A. Nestsiarovich, S. J. Nelson, B. Kerner, D. J. Perkins, and C. G. Lambert, “Imputation and characterization of uncoded self-harm in major mental illness using machine learning”, Journal of the American Medical Informatics Association, vol. 27, no. 1, pp. 136–146, 2020.
- G. Zahoranszky-Kohalmi, V. B. Siramshetty, P. Kumar, M. Gurumurthy, B. Grillo, B. Mathew, D. Metaxatos, M. Backus, T. Mierzwa, R. Simon, et al., “A workflow of integrated resources to catalyze network pharmacology driven covid-19 research”, Journal of Chemical Information and Modeling, vol. 62, no. 3, pp. 718–729, 2022.
- J. F. Cavanagh, P. Kumar, A. A. Mueller, S. P. Richardson, and A. Mueen, “Diminished EEG habituation to novel events effectively classifies parkinson’s patients”, Clinical Neurophysiology, vol. 129, no. 2, pp. 409–418, 2018.
Conference Posters
- P. Kumar, V. Metzger, S. Purushotham, P. Kedia, C. Bologa, C. G. Lambert, and J. Yang (2025), “KG2ML: Integrating Knowledge Graphs and Positive Unlabeled Learning for Identifying Disease-Associated Genes with Case Studies for 12 Diseases“, in Common Fund Data Ecosystem (CFDE) All-Hands Meeting, 2025.
- P. Kumar and V. T. Metzger (2024), “Predicting Type 2 Diabetes Risk: A Non-negative Matrix Factorization Approach for Feature Selection“, in IEEE BHI, 2024.
- P. Kumar, F. Moomtaheen, S. A. Malec, J. J. Yang, C. G. Bologa, K. A. Schneider, Y. Zhu, M. Tohen, G. Villarreal, D. J. Perkins, E. M. Fielstein, S. E. Davis, M. E. Matheny, and C. G. Lambert, “Quantifying the opioid use disorder crisis: Pulsnar finds nearly 3/4 undiagnosed”, in OHDSI Symposium, 2024.
- P. Kumar, V. Metzger, S. Purushotham, P. Kedia, C. G. Lambert, and J. Yang, “Illuminating the druggable genome (idg) scientific use cases powered by the cfde data distillery biomedical knowledge graph, integrating multiple common fund datasets”, in Common Fund Data Ecosystem (CFDE) All-Hands Meeting, 2024.
- P. Kumar and C. G. Lambert, “Improving the detection of behavioral health conditions through positive and unlabeled learning: Opioid use disorder”, in OHDSI Symposium, 2023.
- P. Kumar, J. Tsosie, and C. G. Lambert, “Improving the detection of behavioral health conditions through positive and unlabeled learning: Self-harm and opioid use disorder”, in UNM Brain and Behavioral Health, 2023.
- P. Kumar, S. E. Davis, M. E. Matheny, G. Villarreal, Y. Zhu, M. Tohen, D. J. Perkins, and C. G. Lambert, “Pulsnar: Positive unlabeled learning selected not at random–towards imputing undocumented conditions in ehrs and estimating their incidence”, in OHDSI Symposium, 2022.
- S. E. Davis, P. Kumar, N. R. Lauve, S. K. Parr, D. Park, M. E. Matheny, G. Villarreal, Y. Zhu, M. Tohen, G. Uhl, D. J. Perkins, and C. G. Lambert, “Disparities in coded and imputed post-traumatic stress disorder and self-harm among us veterans”, in AMIA, 2021.
- P. Kumar, N. R. Lauve, S. E. Davis, S. K. Parr, D. Park, M. E. Matheny, G. Villarreal, G. Uhl, Y. Zhu, and M. Tohen, “Detecting ptsd and self-harm among us veterans using positive unlabeled learning”, in OHDSI Symposium, 2021.
- P. Kumar, J. J. Yang, D. Byrd, O. Ursu, C. G. Bologa, S. L. Mathias, J. Berendzen, and T. I. Oprea, “Proteingraphml – predicting disease-to-protein associations from a biomedical knowledge graph”, in FASEB, 2021.
- A. Nestsiarovich, P. Kumar, N. R. Lauve, A. J. Mazurie, N. G. Hurwitz, D. C. Cannon, Y. Zhu, S. J. Nelson, A. S. Crisanti, B. Kerner, M. Tohen, D. J. Perkins, and C. G. Lambert, “Comparing drug-dependent risk of self-harm in bipolar disorder using machine learning imputed outcomes”, in OHDSI Symposium, 2020.
- P. Kumar, A. Nestsiarovich, S. J. Nelson, B. Kerner, D. J. Perkins, and C. G. Lambert, “Visit level machine learning imputation of uncoded self-harm in major mental illness and characterization of incidence of self-harm”, in OHDSI Symposium, 2019.
- P. Kumar, A. Nestsiarovich, A. J. Mazurie, N. G. Hurwitz, S. J. Nelson, and C. G. Lambert, “Visit level suicidality/self-harm phenotyping in bipolar disorder”, in OHDSI Symposium, 2017.
- P. Kumar, Amritansh, and C. G. Lambert, “Transforming the 2.33m-patient medicare synthetic public use files to the omop cdmv5: Etl-cms software and processed data available and feature-complete”, in OHDSI Symposium, 2016.
Conference Talks
- P. Kumar, V. Metzger, S. Purushotham, P. Kedia, C. Bologa, C. G. Lambert, and J. Yang (2025), “KG2ML: Integrating Knowledge Graphs and Positive Unlabeled Learning for Identifying Disease-Associated Genes with Case Studies for 12 Diseases”, in Common Fund Data Ecosystem (CFDE) All-Hands Meeting, 2025.
- P. Kumar, F. Moomtaheen, S. A. Malec, J. J. Yang, C. G. Bologa, K. A. Schneider, Y. Zhu, M. Tohen,G. Villarreal, D. J. Perkins, E. M. Fielstein, S. E. Davis, M. E. Matheny, and C. G. Lambert, “Detecting opioid use disorder in health claims data with positive unlabeled learning”. IEEE BHI, 2024.
- P. Kumar, V. Metzger, S. Purushotham, P. Kedia, C. G. Lambert, and J. Yang, “Illuminating the druggable genome (idg) scientific use cases powered by the cfde data distillery biomedical knowledge graph, integrating multiple common fund datasets”. Common Fund Data Ecosystem (CFDE) All-Hands Meeting, 2024.
- P. Kumar, A. Nestsiarovich, A. J. Mazurie, N. G. Hurwitz, S. J. Nelson, and C. G. Lambert, “Visit level suicidality/self-harm phenotyping in major mental illness”. AMIA, 2018.
Ph.D. Dissertation
- “Machine learning methods for computational phenotyping using patient healthcare data with noisy labels”, The University of New Mexico, Albuquerque, NM, Dec. 2022.
Colloquium Talk
- “Imputation and characterization of uncoded self-harm in major mental illness using machine learning” – Department of Computer Science, University of New Mexico. 16 September 2020
Peer Review Activities
- Reviewed one paper for the ACDSA 2025 conference (International Conference on Artificial Intelligence, Computer, Data Sciences and Applications) [May 2025]
- Reviewed one paper for the Elsevier Neurocomputing Journal [May 2025]
- Reviewed two papers for the American Medical Informatics Association (AMIA) 2025 Annual Symposium: Main [Apr 2025]
- Reviewed four podium abstracts for the American Medical Informatics Association (AMIA) 2025 Annual Symposium: Main [Apr 2025]
- Reviewed two posters for the American Medical Informatics Association (AMIA) 2025 Annual Symposium: Main [Apr 2025]
- Reviewed a manuscript for Dove Medical Press journal [Mar 2025]
- Reviewed two posters for the American Medical Informatics Association (AMIA) 2025 Clinical Informatics Conference: Main [Jan 2025]
- Reviewed an oral presentation for the American Medical Informatics Association (AMIA) 2025 Clinical Informatics Conference: Main [Jan 2025]
- Reviewed a manuscript for Dove Medical Press journal [Nov 2024]
- Reviewed a manuscript for the PLOS One journal [Dec 2024]
- Reviewed a manuscript for the International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME 2024)
- Reviewed a manuscript for the International Conference on Electrical and Computer Engineering Research (ICECER 2024)
- Reviewed a manuscript for the International Conference on Electrical, Computer and Energy Technologies (ICECET 2024)
Academic and Professional Services
- Served as an external examiner for a PhD proposal at Amrita Vishwa Vidyapeetham, providing evaluation and feedback on the proposed research methodology and objectives (Dec 20, 2024).
Research Experience
University | Role | Duration | Location |
University of New Mexico (Department of Internal Medicine) | Research Assistant Professor | Feb 2024 – Current | Albuquerque, USA |
University of New Mexico (Department of Internal Medicine) | Data Scientist II | Dec 2022 – Jan 2024 | Albuquerque, USA |
University of New Mexico (Department of Internal Medicine) | Graduate Research Assistant | Mar 2016 – Nov 2022 | Albuquerque, USA |
Industry Experience
Company | Role | Duration | Location |
Interglobe Inc. | System Analyst/Associate Project Manager | Jul 2011 – Aug 2015 | Denver, USA |
Interglobe Ltd. | Lead Software Engineer | May 2007 – Jul 2011 | Gurugram, India |
Fiserv Ltd. | System Analyst | Apr 2006 – May 2007 | Noida, India |
Computer Sciences Corporation Ltd. | Software Engineer | Dec 2004 – Mar 2006 | Noida, India |
Satyam Computer Services Ltd. | Software Engineer | Jul 2003 – Dec 2004 | Hyderabad/Pune, India |
Education
Degree | University | Year |
Ph.D. (Computer Science) | University of New Mexico, Albuquerque, NM, USA | 2022 |
M.S. (Computer Science) | University of New Mexico, Albuquerque, NM, USA | 2017 |
B.E. (Computer Engineering) | Sardar Vallabhbhai National Institute of Technology, Surat, Gujarat, India | 2003 |
Technical Skills
- Programming Languages: Python, R, MATLAB, PHP, C, SQL, CQL, HTML, JavaScript, CSS
- Databases: Neo4j, MySQL, and PostgreSQL
- Operating Systems: Windows, and Linux
- Web Servers: Apache, and Nginx
Scholarships, Certifications and Awards
- OHDSI Top Community Contributor Award at the 2021 OHDSI Symposium (2021)
- Best poster award at the 2016 OHDSI Symposium collaborator showcase (2016)
- Received Amigo Scholarship from the University of New Mexico (2015–2017)
- Completed the online course “Python Programming for Everybody” with Distinction from the University of Michigan School of Information
- Completed the online course “Introduction to Databases” (covering Relational Databases, including XML, JSON, OLAP, XPath, XQuery, and XSLT) with Distinction from Stanford University
- Completed the “Travelport Apollo for Travel Professionals” certification from Travelport
- Received the “I Went the Extra Mile” recognition from InterGlobe for achieving business goals and customer satisfaction
- Third prize in the national level open software contest held at the Indian Institute of Technology (IIT) Kharagpur for open-source C-based software “Hindi Notepad” (2002)
- Third prize in the national level open software contest held at the National Institute of Technology(NIT) Surat for open-source C-based software “Hindi Notepad” (2002)
- Presented a paper titled “Networks and Protocols” at the Technical Symposium held at the National Institute of Technology (NIT) Surat in 2001
Hobbies and Interests
Apart from my research pursuits, I have a wide range of interests. I am passionate about exploring the ancient history of world civilizations and uncovering the mysteries and accomplishments of past cultures. I also have a strong interest in finance and stock market investing, particularly in analyzing economic trends and market behavior. In my free time, I enjoy teaching mathematics to middle and high school students, sharing my enthusiasm for problem-solving and logical thinking.