Cleveland Clinic Lerner Research Institute Logo
Cleveland Clinic Lerner Research Institute Logo
  • About
  • Science
    • Laboratories
    • Office of Research Development
    • Clinical & Translational Research
      Participating in Research
    • Departments
      Biomedical Engineering Cancer Biology Cardiovascular & Metabolic Sciences Florida Research & Innovation Center Genomic Medicine Immunotherapy & Precision Immuno-Oncology
      Infection Biology Inflammation & Immunity Neurosciences Ophthalmic Research Quantitative Health Sciences Translational Hematology & Oncology Research
    • Centers & Programs
      Advanced Musculoskeletal Imaging Angiogenesis Center Cardiovascular Diagnostics & Prevention Computational Life Sciences Consortium for Pain Genitourinary Malignancies Research Genome Center
      Microbiome & Human Health Musculoskeletal Research Northern Ohio Alcohol Center Pathogen & Human Health Research Populations Health Research Quantitative Metabolic Research Therapeutics Discovery
  • Core Services
    • Ohio
      3D Printing Bioimage AnalysisBioRobotics & Mechanical Testing Cell Culture Cleveland Clinic BioRepository Computational Oncology Platform Computing Services Discovery Lab Electron Microscopy Electronics Engineering
      Flow CytometryGenomic Medicine Institute Biorepository Genomics Glassware Histology Hybridoma Immunohistochemistry Immunomonitoring Lab Instrument Refurbishing & Repair Laboratory Diagnostic
      Lerner Research Institute BioRepository Light MicroscopyMechanical Prototyping Microbial Culturing & Engineering Microbial Sequencing & Analytics Resources Media Preparation Molecular Biotechnology Nitinol Polymer Proteomics & Metabolomics Therapeutics Discovery
    • Florida
      Bioinformatics
      Flow Cytometry
      Imaging
  • Education & Training
    • Graduate Programs Molecular Medicine PhD Program Postdoctoral Program
      Research Intensive Summer Experience (RISE) Undergraduate & High School Programs
  • News
  • Careers
    • Faculty Positions Research Associate & Project Staff Postdoctoral Positions Technical & Administrative Engagement & Belonging
  • Donate
  • Contact
  • About
  • Science
    • Scientific Programs
    • Laboratories
    • Office of Research Development
    • Clinical & Translational Research
      Participating in Research
    • Departments
      Biomedical Engineering Cancer Biology Cardiovascular & Metabolic Sciences Florida Research & Innovation Center Genomic Medicine Immunotherapy & Precision Immuno-Oncology
      Infection Biology Inflammation & Immunity Neurosciences Ophthalmic Research Quantitative Health Sciences Translational Hematology & Oncology Research
    • Centers & Programs
      Advanced Musculoskeletal Imaging Angiogenesis Center Cardiovascular Diagnostics & Prevention Computational Life Sciences Consortium for Pain Genitourinary Malignancies Research Genome Center
      Microbiome & Human Health Musculoskeletal Research Northern Ohio Alcohol Center Pathogen & Human Health Research Populations Health Research Quantitative Metabolic Research Therapeutics Discovery
  • Core Services
    • All Cores
    • Ohio
      3D Printing Bioimage Analysis BioRobotics & Mechanical Testing Cell Culture Cleveland Clinic BioRepository Computational Oncology Platform Computing Services Discovery Lab Electron Microscopy Electronics Engineering >
      Flow CytometryGenomic Medicine Institute BiorepositoryGenomics Glassware Histology Hybridoma Immunohistochemistry Immunomonitoring Lab Instrument Refurbishing & Repair Laboratory Diagnostic
      Lerner Research Institute BioRepository Light MicroscopyMechanical Prototyping Microbial Culturing & Engineering Microbial Sequencing & Analytics Resources Media Preparation Molecular Biotechnology Nitinol Polymer Proteomics & Metabolomics Therapeutics Discovery
    • Florida
      Bioinformatics
      Flow Cytometry
      Imaging
  • Education & Training
    • Research Education & Training Center
    • Graduate Programs Molecular Medicine PhD Program Postdoctoral Program
      Research Intensive Summer Experience (RISE) Undergraduate & High School Programs
  • News
  • Careers
    • Faculty Positions Research Associate & Project Staff Postdoctoral Positions Technical & AdministrativeEngagement & Belonging
  • Donate
  • Contact
  • Search

Research News

❮News Cleveland Clinic successfully applies unsupervised machine learning for patient subtyping

01/13/2025

Cleveland Clinic successfully applies unsupervised machine learning for patient subtyping

Researchers investigate machine learning models’ ability to analyze electronic health records for patterns in disease development.

An abstract pattern of colorful dots against a black background, that come together to form a right angle

Researchers from Cleveland Clinic’s Center for Quantitative Metabolic Research are exploring how unsupervised machine learning (ML) algorithms can analyze electronic health records (EHR) gathered across a lifetime. Findings published in PLOS Digital Health show how unsupervised ML clustering methods using this data – otherwise known as longitudinal EHR – can support effective clinical research. 

Unsupervised longitudinal ML can identify groups within massive amounts of data based on clinical similarities, which makes it useful for identifying patterns that otherwise may have gone unnoticed using other ML approaches. These methods are particularly useful for analyzing patient data over time, since certain factors like BMI or blood pressure may change, this information helps project future risks. For clinical trials, unsupervised ML also has the potential to identify why a certain drug may not have worked for a group of patients – and who might respond.  

“Identifying patient subtypes is important because it can help us understand why disease affects different patients in different ways,” says Daniel Rotroff, PhD, Director of the Center for Quantitative Metabolic Research. “By identifying clinical similarities among patients, researchers can better predict which patients are most likely to respond to a certain drug or treatment.” 

Why are longitudinal EHR important?  

Longitudinal EHR contains patient health information across their lifetime. These records include demographics, lab results, vital signs, medications and other information.  

By analyzing patients’ EHR, researchers have an opportunity to identify patterns and insights of disease progression from real-world cases and use that information for future patients. This can help researchers identify subtypes of patients that have the same disease and are also similar due to genetics, preexisting conditions and other clinical factors.  

Improving patient selection for clinical trials  

There are many factors that can influence if a drug will be effective including genetics, sex and BMI.  

“One of the primary determinants for whether a medication will be used to treat patients is success in clinical trials,” says Arshiya Mariam, PhD, a postdoctoral fellow and lead author of the study. “However, without a clear understanding of which patients are likely to respond, the trial may be unsuccessful, even if the drug works well in a subset of patients. Tools for identifying clinically relevant patient subtypes may help inform whether a drug should go to market and which patients are best suited for it.” 

Before a clinical trial, it can be difficult to determine what factors are going to have the greatest impact on a drug’s effectiveness. This can lead researchers to select a patient group with factors that make a drug appear ineffective – even though it may be able to help a different group of patients.  

Using simulated data to replicate real-world scenarios 

The team first tested their unsupervised ML algorithms on a large, simulated dataset before evaluating its use on data from more than 43,000 pediatric patients who all had different BMIs and metabolic health statuses (e.g. healthy, diabetes). Patients’ medical records can have gaps depending on provider history and what information is recorded, the use of simulated data helps to understand which methods handle these types of real-world challenges the best.  

One of the models that performed the best on the simulated information was used to classify patients at higher risk of pediatric metabolic syndrome, which is when a patient has a group of conditions that elevates risk for diseases like type 2 diabetes. The algorithm identified five different groups based on how a patient’s body mass index (BMI) increases or decreases over time, and used this to identify which patients are at risk of future disease. 

“Our findings demonstrate the algorithms’ ability to successfully group patients based on clinical factors that can predict disease development,” says Dr. Rotroff. “This information sets the stage for future tools to help clinicians determine exactly which patients are best suited for clinical trials, drugs and other treatments.”  

 

Featured Experts
Daniel  Rotroff Headshot
Daniel
Rotroff, PhD
News Category
Emerging technology
Related News
New Analysis of Previous Trial Results Offers Insights into Personalized Care for Type 2 DiabetesResearchers Embark on New Study to Identify Predictors of Chemotherapy-Associated Pain ConditionThis lab advances biomarker testing by collaborating with clinicians on tricky conditions

Research areas

Quantitative Health SciencesCenter for Quantitative Metabolic Research

Want To Support Ground-Breaking Research at Cleveland Clinic?

Discover how you can help Cleveland Clinic save lives and continue to lead the transformation of healthcare.

Give to Cleveland Clinic

Subscribe to get the latest research news in your inbox.

About Lerner

About Us Careers Contact Us Donate People Directory

Science

Clinical & Translational Research Core Services Departments, Centers & Programs Laboratories Research News

Education & Training

Graduate Programs Molecular Medicine PhD Program Postdoctoral Program RISE Program Undergraduate & High School Programs

Site Information & Policies

Search Site Site Map Privacy Policy Social Media Policy

9500 Euclid Avenue, Cleveland, Ohio 44195 | © 2025 Lerner Research Institute