How do I run a machine learning analysis on my data?

To run machine learning analyses in CMMI-DCC:

  1. Navigate to ML Pipelines: Click "Analysis" then "ML Pipelines" in the top navigation

  2. Create New Pipeline: Click "New ML Pipeline"

  3. Select Data:

    • Choose your data types (Metabolomics, Proteomics, CBC, etc.)
    • Select specific studies or cohorts
    • Choose which features (columns) to include
  4. Configure Analysis:

    • Select task type (Classification or Regression)
    • Choose target variable (what you want to predict)
    • Select algorithm (Random Forest, XGBoost, etc.)
    • Configure preprocessing options (scaling, feature selection, outlier detection)
  5. Submit Job: Review your configuration and click "Submit". The analysis will run in the background

  6. View Results: Once complete, you can view performance metrics, feature importance, and download results

Results include model performance metrics, visualizations, and the ability to export predictions.


Category: Machine Learning | Viewed 1 times