Skip to content
View stepfunction83's full-sized avatar

Highlights

  • Pro

Block or report stepfunction83

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

Python 3,695 322 Updated Jun 22, 2026

OpenMOSS pure C++ pipeline based on GGML

C++ 52 7 Updated Jun 30, 2026

Generates dynamic, interactive-textbook lessons on the fly based on user interests and immediate doubts. Maps the learning journey as a tree structure, allowing users to backtrack to the previous s…

TypeScript 43 6 Updated May 26, 2026

A repository for agentic world building to roleplay in. A world seed template is used for the pipeline and the output is a Silly Tavern ready character cards, world info and system settings.

Python 54 4 Updated Jul 2, 2026

Linux software for the Stream Deck with support for original Elgato Stream Deck plugins

Rust 1,852 126 Updated Jun 29, 2026

High-Quality Voice Cloning TTS for 600+ Languages

Python 7,925 1,250 Updated Jun 24, 2026

Open Source Speech Language Model

Jupyter Notebook 1,004 109 Updated May 11, 2026
Python 1,905 289 Updated Jun 23, 2026

Fully automatic censorship removal for language models

Python 25,740 2,789 Updated Jul 1, 2026

Adventure Kid Wave Forms are a collection of sampled one cycle waveforms for use in synthesizers and samplers or similar sound generators.

C 602 89 Updated May 22, 2026

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)

Python 580 83 Updated Sep 26, 2025

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 1,135 429 Updated Jun 12, 2026

Noise supression using deep filtering

Python 4,399 484 Updated Oct 17, 2024

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.

Python 1,508 243 Updated Feb 18, 2026

Scheduler for ComfyUI and an attempt at optimized scheduler for the Chroma architecture.

Python 27 3 Updated May 13, 2026
Jupyter Notebook 347 27 Updated Nov 1, 2025

RIFE, Real-Time Intermediate Flow Estimation for Video Frame Interpolation implemented with ncnn library

C++ 64 7 Updated Feb 5, 2026

[AAAI 2025] Event-Enhanced Blurry Video Super-Resolution

Python 459 46 Updated Nov 11, 2025

[NeurIPS'25] One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution

Python 352 22 Updated Mar 9, 2026

~95% on SimpleQA (e.g. Qwen3.6-27B on a 3090). Supports all local and cloud LLMs (llama.cpp, Ollama, Google, ...). 10+ search engines - arXiv, PubMed, your private documents. Everything Local & En…

Python 8,640 760 Updated Jul 2, 2026

Lets make video diffusion practical!

Python 17,089 1,719 Updated Oct 16, 2025

The official implementation of Self-Play Preference Optimization (SPPO)

Python 588 48 Updated Jan 23, 2025

OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT

Python 437 82 Updated Sep 26, 2025

A Conversational Speech Generation Model

Python 14,682 1,487 Updated May 27, 2025

Roland S1 "Tweak" synthesiser quick reference sheet

27 1 Updated Jun 11, 2023

A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones

Python 38 5 Updated Feb 5, 2025
Next