Stars
Utility to compare data between homogeneous or heterogeneous environments to ensure source and target tables match
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Official Microsoft repository for SQL Server in Docker resources
【AI低代码平台】“低代码+零代码”双模驱动AI智能平台 AI low-code platform empowers enterprises to quickly develop low-code solutions and build AI applications. 助力企业快速实现低代码开发和构建AI应用! AI应用平台涵盖:AI应用、AI模型、AI聊天助手、知识库、AI流程编排、MC…
Docker Official Image packaging for Postgres
Prometheus exporter for PgBouncer
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Fluentd: Unified Logging Layer (project under CNCF)
Pentaho Data Integration ( ETL ) a.k.a Kettle
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
Greenplum Database (GPDB) 4.3.7.1 "Single Node" Dockerized for testing purposes only.
pivotaldata / gpdb-docker
Forked from dbbaskette/gpdb-dockerPivotal Greenplum Database Base Docker Image (4.3.7.1)
Datafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具
Stream Processing with Apache Flink - Scala Examples
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
TPC-DS benchmark kit with some modifications/fixes
TPC-H benchmark kit with some modifications/additions
A library that provides an embeddable, persistent key-value store for fast storage.

