Skip to content
@riverxdata

RiverXData

RiverXData - "Simple-Effective-Efficient"

Welcome to RiverXData 👋

RiverXData is an open-source initiative focused on driving innovation in bioinformatics, and multiomics research. We build scalable tools, workflows, and libraries designed to support complex experimental designs across bioinformatics applications. Our mission is to empower researchers with accessible, high-performance, and modern solutions for data analysis and integrative multiomics. Contact for work: [email protected]

🌟 Key Projects

🔬 Core platform

  • river-slurm: The standard setup for SLURM cluster (High performance computing) to power the data analysis.
  • river-utils: The CLI tool for setting up the remote environments and river-platform management.
  • river: A secure bioinformatics data platform for centralized storage, analysis, and management — all with unified authorization and access control which also supports the nf-core ecosystem.

🧪 Data softwares and workflows

Studio

Bioinformatics

GWAS

NF-core ecosystem:

  • sarek: Variant calling pipeline for germline and somatic variants from whole genome, exome, or targeted sequencing data. Supports tumor/normal analyses.
  • rnaseq: RNA-seq processing pipeline that includes quality control, alignment or pseudo-alignment, quantification, and generation of gene expression matrices.
  • ampliseq: Amplicon sequencing pipeline for microbial community profiling, such as 16S rRNA gene sequencing.
  • quantms: Quantitative proteomics pipeline for label-free and isobaric labeling analyses using both DDA and DIA data.
  • taxprofiler: Taxonomic profiling pipeline for shotgun metagenomics, supporting multiple tools and producing standardized outputs.
  • methylseq: DNA methylation analysis pipeline using bisulfite-treated sequencing data. Supports multiple aligners and provides comprehensive QC.
  • circrna: Pipeline for detecting and quantifying circular RNAs (circRNAs) from RNA-seq data, including miRNA target prediction.
  • mag: Metagenomic pipeline for assembling, binning, and annotating metagenome-assembled genomes (MAGs) from short or long reads.
  • atacseq: ATAC-seq pipeline to identify open chromatin regions, perform peak calling, and assess data quality with various QC metrics.
  • rnafusion: Fusion detection pipeline using RNA-seq data, combining results from multiple fusion detection tools into reports and visualizations.

AI and ML

  • smartsensor: The sensor utilizes a smartphone camera with machine learning to analyze and characterize chemical compounds.
  • enhancer-predictor: The pipeline to predict the enhancer

🤝 Contributions

We welcome contributions from the community! Whether it's bug fixes, feature suggestions, or documentation improvements, every contribution helps drive bioinformatics research forward.

🔗 Stay Connected

Join us on this journey to revolutionize data analysis in proteomics and beyond!

Popular repositories Loading

  1. river-slurm river-slurm Public

    This repo for setting up the local infrastructure for bioinformatics analysis

    Jinja 2 1

  2. doc-epigenetics-and-genetics-references doc-epigenetics-and-genetics-references Public

    Forked from nttg8100/Epigenetics-and-genetics-references

    With basic reference annotation for genetics and epigenetics

    Shell

  3. pro-machine-learning-prostate-cancer-miRNA-early-detection pro-machine-learning-prostate-cancer-miRNA-early-detection Public

    Python

  4. data-sw-visual-studio-code-server data-sw-visual-studio-code-server Public

    Host the code-server(based on open source of vscode) on slurm job using sbatch.

    Shell

  5. data-sw-rstudio data-sw-rstudio Public

    RStudio that allows to run on HPC

    Shell

  6. bioinfor-wf-quality-control-ngs bioinfor-wf-quality-control-ngs Public

    Nextflow

Repositories

Showing 10 of 24 repositories

Top languages

Loading…

Most used topics

Loading…