How do I run a machine learning analysis on my data?
To run machine learning analyses in CMMI-DCC:
Navigate to ML Pipelines: Click "Analysis" then "ML Pipelines" in the top navigation
Create New Pipeline: Click "New ML Pipeline"
Select Data:
- Choose your data types (Metabolomics, Proteomics, CBC, etc.)
- Select specific studies or cohorts
- Choose which features (columns) to include
Configure Analysis:
- Select task type (Classification or Regression)
- Choose target variable (what you want to predict)
- Select algorithm (Random Forest, XGBoost, etc.)
- Configure preprocessing options (scaling, feature selection, outlier detection)
Submit Job: Review your configuration and click "Submit". The analysis will run in the background
View Results: Once complete, you can view performance metrics, feature importance, and download results
Results include model performance metrics, visualizations, and the ability to export predictions.
Category: Machine Learning |
Viewed 1 times