Skip to content
View ridox's full-sized avatar

Block or report ridox

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The Data Engineering Cookbook

Python 14,929 2,688 Updated Jan 17, 2026

open source training courses about distributed database and distributed systems

Rust 10,819 1,377 Updated Sep 18, 2023

Automatically identify anti-patterns in SQL queries

C++ 2,520 121 Updated Feb 21, 2024

A fast type checker and language server for Python

Rust 5,278 263 Updated Feb 3, 2026

A reactive notebook for Python โ€” run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 18,840 896 Updated Feb 3, 2026

The Internals of Apache Spark

1,538 460 Updated Jul 5, 2025

Apache DataFusion Ray

Python 229 25 Updated Oct 5, 2025

Apache DataFusion Python Bindings

Python 556 144 Updated Feb 2, 2026

Unofficial rust implementation of Apache Iceberg with integration for Datafusion

Rust 232 38 Updated Feb 2, 2026

Apache Spark Connect Client for Rust

Rust 117 22 Updated Jun 10, 2025

A collection of RBIR projects and posts for anyone interested in joining this journey.

Rust 306 11 Updated Feb 3, 2026

Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. โ€” rebuilt from scratch. Unified architecture on your S3.

Rust 9,127 852 Updated Feb 3, 2026

Apache DataFusion SQL Query Engine

Rust 8,352 1,932 Updated Feb 3, 2026

:octocat: A curated awesome list of lists of interview questions. Feel free to contribute! ๐ŸŽ“

22 7 Updated Feb 11, 2018

A pure-python rules engine. Packed with components to build rules and a rule parser. โ–ถ๏ธ

Python 33 6 Updated Feb 29, 2024

The SQL IDE for Your Terminal.

Python 5,707 133 Updated Feb 2, 2026

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 39,596 7,598 Updated Dec 15, 2025

the portable Python dataframe library

Python 6,377 695 Updated Feb 3, 2026

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,464 190 Updated Jan 30, 2026

data load tool (dlt) is an open source Python library that makes data loading easy ๐Ÿ› ๏ธ

Python 4,866 443 Updated Feb 3, 2026

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,744 29,051 Updated Feb 3, 2026

The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing

Rust 1,698 208 Updated Feb 3, 2026

Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)

C++ 10,107 1,981 Updated Feb 3, 2026

Free & OSS PostgreSQL RDS / DBaaS, Self-Host PG like a Pro

Shell 4,584 323 Updated Feb 2, 2026

Simple SQL in Python

Python 1,388 63 Updated Jan 6, 2026

Treat your database as Code

112 4 Updated Jan 15, 2025

This repository contains the source code for the paper First Order Motion Model for Image Animation

Jupyter Notebook 14,998 3,283 Updated Nov 14, 2024

๐Ÿ“š Parameterize, execute, and analyze notebooks

Python 6,366 444 Updated Jan 5, 2026
Next