- San Francisco, California
- https://kaicha.ng
Starred repositories
Repository for the research article Across the Atlantic: Distinguishing Between European and Brazilian Portuguese Dialects
Determine the East Asian Width of a Unicode character
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
🎥 Python and OpenCV-based scene cut/transition detection program & library.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Word alignments based on character-level attention maps in Whisper with unsupervised head selection.
modest natural-language processing
Clone of openai Whisperer text normalization done and tested on Typescript!
Robust Speech Recognition via Large-Scale Weak Supervision
🦀Fastest ever Trad/Simp and regional Chinese variants converter | 中文简繁及地區詞轉換
Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
A CLI tool for analyzing Claude Code/Codex CLI usage from local JSONL files.
Package for aligning audio files through audio fingerprinting
Relay-style pagination for NestJS GraphQL server.
An AR music collection game where users collect music around their live location and can trade, sell and collect songs. Complete albums, discographies, and try to find the rarest music for profile …
Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
Fetch worldathletics data in a standardized way with schema validation, value conversion and error handling
Crossword component for Svelte
A flexible, responsive, and easy-to-use crossword component for React apps.
Open Source Proxy Demographic module written in Python
🎮 GraphQL IDE for better development workflows (GraphQL Subscriptions, interactive docs & collaboration)




