Skip to content
Change the repository type filter

All

    Repositories list

    • WailBrew

      Public
      Minimalistic Homebrew GUI made with Go, Wails and React.
      TypeScript
      49000Updated Oct 16, 2025Oct 16, 2025
    • AI exp : LLM, tools, MLOps, ... | #SE
      Jupyter Notebook
      1000Updated Jul 4, 2025Jul 4, 2025
    • Pocket data flows orchestrated using Prefect
      Python
      16000Updated Mar 14, 2025Mar 14, 2025
    • Postgres with GPUs for ML/AI apps.
      Rust
      352000Updated Jan 16, 2025Jan 16, 2025
    • Java Spring (Boot/Cloud..) backend playground | #SE
      JavaScript
      4000Updated Jun 20, 2024Jun 20, 2024
    • Apache Airflow Website
      403000Updated Mar 20, 2024Mar 20, 2024
    • prefect

      Public
      Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
      Python
      2k000Updated Mar 14, 2024Mar 14, 2024
    • Apache Flink Training Excercises
      Java
      701000Updated Jan 18, 2024Jan 18, 2024
    • This demo shows how to capture data changes from relational databases and stream them to Confluent Cloud.
      HCL
      4000Updated Dec 9, 2023Dec 9, 2023
    • drone-fly

      Public
      A service which allows Hive Metastore Listeners to be deployed outside of the Hive Metastore Service
      Java
      3000Updated Jul 3, 2023Jul 3, 2023
    • My CS learning : algorithm, data structure, and system design | #SE
      Python
      48000Updated Jun 11, 2023Jun 11, 2023
    • metabase

      Public
      The simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
      Clojure
      6.1k000Updated Mar 5, 2023Mar 5, 2023
    • Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
      Python
      1.3k000Updated Feb 10, 2023Feb 10, 2023
    • Spark Structured Streaming / Kafka / Cassandra / Elastic
      Scala
      75000Updated Feb 7, 2023Feb 7, 2023
    • datahub

      Public
      The Metadata Platform for the Modern Data Stack
      Java
      3.3k000Updated Feb 6, 2023Feb 6, 2023
    • Free Data Engineering course!
      Jupyter Notebook
      7.2k000Updated Feb 5, 2023Feb 5, 2023
    • Open-source data observability for analytics engineers.
      HTML
      201000Updated Jan 12, 2023Jan 12, 2023
    • AWS libraries/modules for working with Kinesis aggregated record data
      Java
      145000Updated Jan 3, 2023Jan 3, 2023
    • A library for scraping listings data from daft.ie
      Python
      7000Updated Dec 24, 2022Dec 24, 2022
    • Fish-like autosuggestions for zsh
      Shell
      1.9k000Updated Dec 23, 2022Dec 23, 2022
    • The software used to extract structured data from Wikipedia
      Scala
      290000Updated Nov 23, 2022Nov 23, 2022
    • python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,分布式爬虫,爬虫项目实战实例等
      Python
      3.9k000Updated Nov 23, 2022Nov 23, 2022
    • Extract metadata from a video to an sqlite database
      Python
      3000Updated Oct 5, 2022Oct 5, 2022
    • Process Common Crawl data with Python and Spark
      Python
      90000Updated Sep 21, 2022Sep 21, 2022
    • Scala
      213000Updated Sep 4, 2022Sep 4, 2022
    • Python
      71000Updated Aug 14, 2022Aug 14, 2022
    • The official repository for the Rock the JVM ZIO course
      Scala
      53000Updated Jul 9, 2022Jul 9, 2022
    • maxwell

      Public
      Maxwell's daemon, a mysql-to-json kafka producer
      Java
      1k000Updated Jul 7, 2022Jul 7, 2022
    • Amazon Kinesis Data Analytics Flink Starter Kit helps you with the development of Flink Application with Kinesis Stream as a source and Amazon S3 as a sink. This demonstrates the use of Session Window with AggregateFunction.
      Java
      15000Updated Jun 30, 2022Jun 30, 2022
    • REST job server for Apache Spark
      Scala
      986000Updated Jun 24, 2022Jun 24, 2022