- Support srt format;
- Support using srt as reference;
- Support using video as reference (via ffmpeg);
- Support writing to stdout or file (read from stdin not yet supported; can only read from file);
- Misc bugfixes;
- Proper logging;
- Proper version handling;
- Add Chinese to the list of encodings that can be inferred;
- Make srt parsing more robust;
- Get rid of auditok (GPLv3, was hurting alignment algorithm);
- Change to alignment algo: don't penalize matching video non-speech with subtitle speech;
- Developer note: change progress-only to vlc-mode and remove from help docs;
- Allow reading input srt from stdin;
- Allow specifying encodings for reference, input, and output srt;
- Use the same encoding for both input srt and output srt by default;
- Developer note: using sklearn-style data pipelines now;
- Minor change to subtitle speech extraction;
- Add six to requirements.txt;
- Set default encoding to utf8 to ensure non ascii filenames handled properly;
- Clamp subtitles to maximum duration (default 10);
- Fix argument parsing regression;
- Add utf-16 to list of encodings to try for inference purposes;