OmixHub is a platform that interfaces with GDC using python to help users to apply ML based analysis on different sequencing data. Currently we support only for RNA-Seq based datasets from genomic data commons (GDC)
-
Cohort Creation of Bulk RNA Seq Tumor and Normal Samples from GDC.
-
Bioinformatics analysis:
- Application of PyDESeq2 and GSEA in a single pipeline.
-
Classical ML analysis:
- Applying clustering, supervised ML and outlier sum statistics.
-
Custom API Connections:
- Search and retrieval of Cancer Data cohorts from GDC using complex json filters (Methods in src.Connectors for GDC API search and retrieval using custom queries)
- Interacting with MongoDB database in a pythonic manner (DOCS coming soon).
- Interacting with Google cloud BigQuery in a pythonic manner (DOCS coming soon).
https://omixhub.readthedocs.io/en/latest/getting_started.html
- Clone the repository
git clone https://github.com/adhal007/OmixHub.git - Create the correct conda enviroment for OmixHub:
conda env create -f environment.yaml
Please follow the instructions on