Skip to content
View gitcommitshow's full-sized avatar
🎯
Focus
🎯
Focus

Organizations

@Git-Commit-Show

Block or report gitcommitshow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ML

40 repositories

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

Jupyter Notebook 85,699 20,753 Updated May 8, 2026

Dolt – Git for Data

Go 22,622 760 Updated May 9, 2026

Decoupling Reasoning from Observations for Efficient Augmented Language Models

Python 943 84 Updated Jul 28, 2023

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 14,683 1,713 Updated Feb 22, 2026

A comprehensive guide to building RAG-based LLM applications for production.

Jupyter Notebook 1,855 253 Updated Aug 2, 2024

Focus on prompting and generating

Python 48,445 7,889 Updated Dec 1, 2025

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,842 2,044 Updated Nov 19, 2024

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 10,135 1,324 Updated Nov 9, 2023

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,246 1,062 Updated Aug 5, 2024

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,385 5,358 Updated Sep 22, 2025

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 26,753 4,095 Updated Jun 19, 2025

A fast local neural text to speech engine for Mycroft

Python 1,262 130 Updated Mar 25, 2025

Mycroft Core, the Mycroft Artificial Intelligence platform.

Python 6,623 1,299 Updated Sep 8, 2024

Offline private voice assistant for many human languages

Shell 2,739 207 Updated Apr 22, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 99,208 12,171 Updated Apr 15, 2026

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 4,081 355 Updated Jan 8, 2025

We write your reusable computer vision tools. 💜

Python 38,437 3,426 Updated May 6, 2026

The code for some apps built with Sieve.

Python 86 19 Updated Nov 22, 2024

Mora: More like Sora for Generalist Video Generation

Python 1,587 110 Updated Oct 10, 2024

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 19,588 1,903 Updated May 9, 2026

End-to-End Speech Processing Toolkit

Python 9,829 2,400 Updated May 8, 2026

Paper Piano uses Python and OpenCV to detect key presses on a hand-drawn piano, translating them into digital notes and sound.

Python 43 5 Updated Aug 16, 2024

Routing on Random Forest (RoRF)

Python 238 20 Updated Sep 24, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 20,455 3,898 Updated May 9, 2026

Official inference framework for 1-bit LLMs

Python 38,914 3,538 Updated Mar 10, 2026

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 80,113 9,131 Updated May 10, 2026

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 39,139 3,750 Updated Jul 9, 2025

A Lightweight Recommendation System

Python 9,304 719 Updated Oct 13, 2025

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,528 1,908 Updated May 7, 2026

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 9,762 837 Updated May 8, 2026