The Audio Transcript Generator is a Python application that takes an MP3 or WAV audio file as input, processes the audio, and outputs the transcribed text of the audio content. This project leverages the power of speech recognition to convert spoken words into written text.
- Transcribe audio files (MP3/WAV) to text.
- Easy-to-use command-line interface.
- Supports multiple languages for transcription.
- Customizable output options.
- Python 3.6 or higher
- FFmpeg (required for processing audio files)
-
Clone this repository to your local machine:
git clone https://github.com/gpapadopp/voice-recognition-python.git
-
Navigate to the project directory:
cd voice-recognition-python
-
Install the required Python packages:
pip install -r requirements.txt
-
Place your MP3 or WAV audio file in the project directory.
-
Open a the main.py file of the project, go to the last row, and place the file name of your MP3 or WAV audio file.
-
The transcription will be displayed in the terminal.
Contributions are welcome! If you'd like to contribute to this project, please follow these steps:
- Fork the repository.
- Create a new branch for your feature/bugfix:
git checkout -b feature-name
. - Make your changes and commit them:
git commit -m "Description of changes"
. - Push your changes to your fork:
git push origin feature-name
. - Create a pull request detailing your changes.
This project is licensed under the MIT License.
For questions or inquiries, please contact [email protected].