Built this over the past week using Cursor. Here’s the result
A fully automated system that watches your video library, queues and processes files, transcribes audio, and translates subtitles into any desired language
Models Used:
- Claude 4-sonnet MAX
- Claude 4-opus MAX
Cursor Usage:
- Rules:
RIPER-5
- Token Cost: ~
$80
Tech Stack Overview
Core System:
Dockerized with full GPU support
NVIDIA PyTorch container (CUDA 12.6)
WhisperX + RTX acceleration
GPT-4o translation engine with parallel processing
SQLite database for queue + state tracking
Auto file monitoring and video detection
Complete error handling + logging
Flask WebUI
Batch mode + file scheduling
API support
Job status dashboard
Manual reprocessing queue
Per-file language selection
External
.srt
file detection + translationSilent, Regular, and Focus modes
Enhancement Pipeline
Telegram notification alerts
Metadata fetcher for context (TMDB, etc.)
Subtitle styling (font, color, positioning, format)
I know tools like Subgen exist, but they don’t offer multilingual subtitle generation. English isn’t my first language—and after years of subtitle hunting, this is the solution my wife and I needed.
GitHub release with setup instructions coming soon