Skip to content

Totally hacked together machine learning speech-to-text using IBM's Watson and Python good for occasional transcription.

License

Notifications You must be signed in to change notification settings

dgitis/speech-to-text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speech-to-text

Totally hacked together machine learning speech-to-text using IBM's Watson and Python with speaker identification. The script is good to speed up occasional transcription jobs but the output still requires editing.

Full instructions from Caret Juice Marketing.

  1. Install json-to-text.py on your computer,
  2. Enter your Watson speech-to-text API key in console.txt,
  3. Edit (if necessary) the URL in console.txt to point to the region where Watson is running,
  4. Navigate your terminal instance to the folder that you installed json-to-text.py,
  5. Save your input.flac audio file to that same folder,
  6. Copy the console script to your terminal to transcribe the audio,
  7. When the transciption is done, run the Python script to transform Watson's JSON output to text.

About

Totally hacked together machine learning speech-to-text using IBM's Watson and Python good for occasional transcription.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages