- Bordeaux, France
- in/simon-aubert-76ab898a
Stars
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse
Flowfile is a visual ETL tool and Python library combining drag-and-drop workflows with Polars dataframes. Build data pipelines visually, define flows programmatically with a Polars-like API, and e…
This repository will allow users to create API requests from postman collections at scale.
Extremely fast Query Engine for DataFrames, written in Rust
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
mRemoteNG is the next generation of mRemote, open source, tabbed, multi-protocol, remote connections manager.
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
Specification for storing geospatial vector data (point, line, polygon) in Parquet
PyGWalker: Turn your dataframe into an interactive UI for visual analysis
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially…
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Scalable and efficient data transformation framework - backwards compatible with dbt.
Automated testing to find logic and performance bugs in database systems
ReplicaDB is open source tool for database replication, designed for efficiently transferring bulk data between relational and non-relational databases
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
Apache ECharts is a powerful, interactive charting and data visualization library for browser
Grafana plugin for MonetDB
Apache Superset is a Data Visualization and Data Exploration Platform
The Context Platform for your Data and AI Stack

