Yay Emacs 5: Tweaking my video workflow with WhisperX and subed-record
| speechtotext, emacs, subed, yay-emacsI'm tweaking my video workflow. I use Orgzly Revived on my Android phone to write the text, and I use Easy Voice Recorder to record it. Syncthing automatically copies both to my laptop. I use WhisperX to transcribe my recording, and I use a little bit of Emacs Lisp to figure out timestamps for each word. I edit this to fix errors. I can even rearrange things and get rid of umms or ahs or anything I don't want.Then I use subed-convert to turn it into a VTT file. I can tweak the start and end times by looking at the waveforms. Then I add comments with the visuals I want. I can add images, animated GIFs, or videos, and they're automatically squeezed or stretched to fit. I can also have them play at original speed. Then I set up open captions and use subed-record-compile-video. Tada!
Links:
- Orgzly Revived
- Easy Voice Recorder
- WhisperX
- Using Emacs Lisp to process WhisperX timestamps
- Subed
- My other blog posts about subed
- Subed-record
- Animated GIF By DemonDeLuxe (Dominique Toussaint) - Image: Newtons cradle animation book.gif, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=3717500
You can watch this on YouTube, download the video, or download the audio.