RiverXData

Welcome to RiverXData 👋

RiverXData is an open-source initiative focused on driving innovation in bioinformatics, and multiomics research. We build scalable tools, workflows, and libraries designed to support complex experimental designs across bioinformatics applications. Our mission is to empower researchers with accessible, high-performance, and modern solutions for data analysis and integrative multiomics. Contact for work: nttg8100@gmail.com

🌟 Key Projects

🔬 Core platform

river-slurm: The standard setup for SLURM cluster (High performance computing) to power the data analysis.
river-utils: The CLI tool for setting up the remote environments and river-platform management.
river: A secure bioinformatics data platform for centralized storage, analysis, and management — all with unified authorization and access control which also supports the nf-core ecosystem.

🧪 Data softwares and workflows

Studio

visual-studio-code-server: Visual studio code server for working with HPC remotely
rstudio-server: Rstudio server for working with HPC remotely
jupyterlab: Jupyterlab server for working with HPC remotely

Bioinformatics

GWAS

bioinfor-wf-nipt-human-genetics: A workflow for performing GWAS analysis using NIPT sequencing data
UKB-RAP-Notebooks-Genomics: The notebooks for working with UKB-RAP- a largest human WES/WGS biobank data platform.

NF-core ecosystem:

sarek: Variant calling pipeline for germline and somatic variants from whole genome, exome, or targeted sequencing data. Supports tumor/normal analyses.
rnaseq: RNA-seq processing pipeline that includes quality control, alignment or pseudo-alignment, quantification, and generation of gene expression matrices.
ampliseq: Amplicon sequencing pipeline for microbial community profiling, such as 16S rRNA gene sequencing.
quantms: Quantitative proteomics pipeline for label-free and isobaric labeling analyses using both DDA and DIA data.
taxprofiler: Taxonomic profiling pipeline for shotgun metagenomics, supporting multiple tools and producing standardized outputs.
methylseq: DNA methylation analysis pipeline using bisulfite-treated sequencing data. Supports multiple aligners and provides comprehensive QC.
circrna: Pipeline for detecting and quantifying circular RNAs (circRNAs) from RNA-seq data, including miRNA target prediction.
mag: Metagenomic pipeline for assembling, binning, and annotating metagenome-assembled genomes (MAGs) from short or long reads.
atacseq: ATAC-seq pipeline to identify open chromatin regions, perform peak calling, and assess data quality with various QC metrics.
rnafusion: Fusion detection pipeline using RNA-seq data, combining results from multiple fusion detection tools into reports and visualizations.

AI and ML

smartsensor: The sensor utilizes a smartphone camera with machine learning to analyze and characterize chemical compounds.
enhancer-predictor: The pipeline to predict the enhancer

🤝 Contributions

We welcome contributions from the community! Whether it's bug fixes, feature suggestions, or documentation improvements, every contribution helps drive bioinformatics research forward.

🔗 Stay Connected

Website: riverxdata.com
GitHub: RiverXData GitHub

Join us on this journey to revolutionize data analysis in proteomics and beyond!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RiverXData

Welcome to RiverXData 👋

🌟 Key Projects

🔬 Core platform

🧪 Data softwares and workflows

Studio

Bioinformatics

NF-core ecosystem:

AI and ML

🤝 Contributions

🔗 Stay Connected

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!