Totally hacked together machine learning speech-to-text using IBM's Watson and Python with speaker identification. The script is good to speed up occasional transcription jobs but the output still requires editing.
Full instructions from Caret Juice Marketing.
- Install json-to-text.py on your computer,
- Enter your Watson speech-to-text API key in console.txt,
- Edit (if necessary) the URL in console.txt to point to the region where Watson is running,
- Navigate your terminal instance to the folder that you installed json-to-text.py,
- Save your input.flac audio file to that same folder,
- Copy the console script to your terminal to transcribe the audio,
- When the transciption is done, run the Python script to transform Watson's JSON output to text.