A video introduction in Chinese
This application is based on OpenAI APIs and Microsoft Azure Cognitive Services.
API keys from both platforms are needed.
Additional needed python packages are: OpenAI Python Library, azure-cognitiveservices-speech, PyQt5
pip install openai azure-cognitiveservices-speech PyQt5
- Log in to OpenAI.
- In the overview page, click the upper right icon, and then click "View API keys".
- In the API keys page, click "+ Create new secret key", which will generate the secret key. Be sure to save it because it cannot be seen a second time.
- Log in to Microsoft Azure.
- In the search banner, search "Speech Service".
- Create an instance for Speech Service.
- On the page of the instance you created, click "Keys and Endpoint" under "Resource Management" on the left.
- Copy one of the keys and the "Location/Region".
Under the root directory, run src/main.py by running:
python src/main.py
The first time you run the program, you need to enter OpenAI API keys, default language, Azure API keys, and Azure Region. The settings will be saved under the root directory so that you do not need to enter the settings the next time you run the program.
Click the "Speak" button below. Then SPEAK!
On the toolbar, click "Suggestion", which will show the suggested response generated by ChatGPT. However, this will lower the speed because you need one more access every time.
Add hide text function (04/02/2023)
Web page version developed (07/02/2023): link
Change from single-shot recognition to continuous recognition (13/02/2023)
Under construction