Skip to content
View simonaubertbd's full-sized avatar

Block or report simonaubertbd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔬 Data science environment for k8s

TypeScript 823 108 Updated May 7, 2026

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Python 2,137 236 Updated May 7, 2026

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,720 4,100 Updated May 7, 2026

Python tools for geographic data

Python 5,124 1,015 Updated May 3, 2026

chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse

Python 2,686 112 Updated Apr 30, 2026

Flowfile is a visual ETL tool and Python library combining drag-and-drop workflows with Polars dataframes. Build data pipelines visually, define flows programmatically with a Polars-like API, and e…

Python 263 18 Updated May 8, 2026

This repository will allow users to create API requests from postman collections at scale.

Python 13 1 Updated Feb 18, 2026

Extremely fast Query Engine for DataFrames, written in Rust

Rust 38,408 2,804 Updated May 8, 2026

Run Jupyter notebooks as jobs

Python 220 36 Updated Mar 4, 2026

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Python 48,688 19,918 Updated May 8, 2026

mRemoteNG is the next generation of mRemote, open source, tabbed, multi-protocol, remote connections manager.

C# 10,783 1,587 Updated May 7, 2026

📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.

Python 6,506 631 Updated May 6, 2026

Specification for storing geospatial vector data (point, line, polygon) in Parquet

Python 1,041 66 Updated Mar 3, 2026

Datagenerator for Data Services

Java 16 6 Updated Sep 29, 2025

PyGWalker: Turn your dataframe into an interactive UI for visual analysis

Python 15,767 862 Updated Apr 4, 2026

PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially…

Python 2,906 330 Updated Feb 4, 2026

Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.

TypeScript 4,303 290 Updated Aug 7, 2025

The Database Toolkit for Python

Python 11,833 1,680 Updated May 7, 2026

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python 3,064 381 Updated Apr 29, 2026

the portable Python dataframe library

Python 6,522 717 Updated May 8, 2026

Automated testing to find logic and performance bugs in database systems

Java 1,728 398 Updated May 2, 2026

visual data prep powered by python

TypeScript 1,357 106 Updated May 2, 2026

ReplicaDB is open source tool for database replication, designed for efficiently transferring bulk data between relational and non-relational databases

Java 484 114 Updated May 6, 2026

Main repository containing all releases.

17 Updated Aug 8, 2024

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 13,837 2,086 Updated May 8, 2026

Apache ECharts is a powerful, interactive charting and data visualization library for browser

TypeScript 66,302 19,810 Updated May 8, 2026

Grafana plugin for MonetDB

TypeScript 4 2 Updated Feb 15, 2024

Apache NiFi

Java 6,081 2,946 Updated May 7, 2026

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 72,739 17,221 Updated May 8, 2026

The Context Platform for your Data and AI Stack

Python 11,882 3,476 Updated May 8, 2026
Next