Clipception automatically identifies and extracts the most viral and engaging moments from your streams. Perfect for content creators looking to repurpose streaming content with minimal effort.
⏰ Processing Time: For a 1080p VOD that is 4+ hours long, expect approximately 30 minutes processing time.
- Web App: clipception.xyz
- Twitch: twitch.tv/krystal_mess323
- YouTube: youtube.com/@krystal_mess323
- 🎯 AI-Powered Clip Detection: Automatically identifies engaging moments
- 🔊 Advanced Audio Analysis: Detects excitement, laughter, and key moments
- 💪 GPU Acceleration: Optimized for faster processing with CUDA support
- 📊 Engagement Metrics: Ranks clips based on potential virality
- 🎬 Reliable Extraction: Uses subprocess for improved stability
- 🔄 Celery Integration: Background task processing for web application
- Python 3.10 or later
- CUDA-compatible GPU (recommended for faster processing)
- FFmpeg installed on your system
-
Create a virtual environment:
conda create -n clipception python=3.10 conda activate clipception
-
Set API Key:
export OPEN_ROUTER_KEY='your_key_here'
-
Install Dependencies:
Option A - Using UV (recommended for speed):
wget https://astral.sh/uv/install.sh | sh uv pip install -r requirements.txtOption B - Using pip:
pip install -r requirements.txt
Additional dependencies that may need separate installation:
pip install git+https://github.com/openai/whisper.git pip install pydub
python process_video_v4.py /path/to/your/video.mp4Files are organized in the FeatureTranscribe/ directory:
[video_name].enhanced_transcription.json- Detailed transcription with audio featurestop_clips_one.json- Ranked clips with engagement metricsclips/- Directory containing extracted video segments ready for upload
You can customize the clip detection parameters by modifying the settings in config.json:
min_clip_duration: Minimum clip length in seconds (default: 20)max_clip_duration: Maximum clip length in seconds (default: 120)excitement_threshold: Level of excitement required (0-1)laughter_threshold: Laughter detection sensitivity (0-1)
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
- OpenAI Whisper for transcription
- OpenRouter for AI processing
- All the streamers and homies on twitch who've helped test and improve Clipception

