-
Samsung Research
- Suwon, Republic of Korea
-
16:51
(UTC +09:00) - http://sephiroce.com
-
NeMo Public
Forked from NVIDIA-NeMo/NeMoA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python Apache License 2.0 UpdatedMar 4, 2026 -
personaplex Public
Forked from NVIDIA/personaplexPersonaPlex code.
Python MIT License UpdatedFeb 26, 2026 -
MOSS-Audio-Tokenizer Public
Forked from OpenMOSS/MOSS-Audio-TokenizerMOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA …
Python Apache License 2.0 UpdatedFeb 13, 2026 -
silero-vad Public
Forked from snakers4/silero-vadSilero VAD: pre-trained enterprise-grade Voice Activity Detector
Python MIT License UpdatedFeb 12, 2026 -
-
agents Public
Forked from livekit/agentsA powerful framework for building realtime voice AI agents 🤖🎙️📹
Python Apache License 2.0 UpdatedFeb 2, 2026 -
langgraph Public
Forked from langchain-ai/langgraphBuild resilient language agents as graphs.
Python MIT License UpdatedFeb 2, 2026 -
livekit Public
Forked from livekit/livekitEnd-to-end realtime stack for connecting humans and AI
Go Apache License 2.0 UpdatedFeb 2, 2026 -
python-sdks Public
Forked from livekit/python-sdksLiveKit real-time and server SDKs for Python
Python Apache License 2.0 UpdatedJan 30, 2026 -
supertonic Public
Forked from supertone-inc/supertonicLightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
C++ MIT License UpdatedJan 22, 2026 -
sherpa-onnx Public
Forked from k2-fsa/sherpa-onnxSpeech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
C++ Apache License 2.0 UpdatedJan 16, 2026 -
moshi-finetune Public
Forked from nu-dialogue/moshi-finetuneFine-tuning Moshi/J-Moshi on your own spoken dialogue data
Python Apache License 2.0 UpdatedJan 5, 2026 -
ai-agents-for-beginners Public
Forked from microsoft/ai-agents-for-beginners10 Lessons to Get Started Building AI Agents
Jupyter Notebook MIT License UpdatedJun 9, 2025 -
-
whisper.tflite Public
Forked from nyadla-sys/whisper.tfliteOptimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices
C++ MIT License UpdatedOct 28, 2023 -
openai-whisper Public
Forked from moonshine-ai/openai-whisperRobust Speech Recognition via Large-Scale Weak Supervision
C MIT License UpdatedAug 28, 2023 -
fast_rnnt Public
Forked from k2-fsa/fast_rnntA torch implementation of a recursion which turns out to be useful for RNN-T.
Python Other UpdatedJul 19, 2023 -
-
warp-transducer Public
Forked from HawkAaron/warp-transducerA fast parallel implementation of RNN Transducer.
C++ Apache License 2.0 UpdatedMay 5, 2021 -
faster-rnnlm Public
Forked from yandex/faster-rnnlmFaster Recurrent Neural Network Language Modeling Toolkit with Noise Contrastive Estimation and Hierarchical Softmax
C++ Other UpdatedFeb 2, 2021 -
ft_hinton86 Public
c++ implementation of http://www.cs.toronto.edu/~fritz/absps/ieee-lre.pdf
-
-

