Interview Transcriber is a Python application designed to convert MP4 videos of a 2-person interview into a text transcription. It distinguishes between speakers and provides a clear and structured output.
- Converts video interviews into text transcription.
- Automatically distinguishes between speakers.
- Outputs structured and readable text files.
- Python 3.10 or higher.
- OpenAI API key.
ffmpeg
installed on your system.
- Download and Install Visual Studio Code from here.
- Open Visual Studio Code and install the "Python" extension by Microsoft.
-
Open your terminal or command line.
-
Run:
git clone https://github.com/lukerbs/InterviewTranscriber.git
-
Navigate into the repository folder:
cd InterviewTranscriber
-
Ensure Python is installed on your device by running:
python3 --version
Make sure the version is 3.10 or higher.
-
Open the project in a terminal.
-
Create a Python virtual environment:
python3 -m venv venv source venv/bin/activate
-
Install the required dependencies:
pip3 install -r requirements.txt
-
Install
ffmpeg
using Homebrew:- If you don't have Homebrew, install it:
/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
- Then install ffmpeg:
brew install ffmpeg
-
Obtain your OpenAI API Key from a team lead or create your own at OpenAI Platform.
-
Create a new file named
.env
:cp .env-example .env
-
Copy the contents from
.env-example
to the.env
file. -
Place your OpenAI API key in the
.env
file, replacingxxxxxx
. -
Save your changes.
-
Launch the application by running:
python app.py
-
A GUI will appear. Use the "Select Video File" button to choose your MP4 video file.
-
Click "TRANSCRIBE INTERVIEW" to start the transcription process.
-
Once completed, the transcription is displayed. You can save the transcript using the "Save to File" button.
app.py
: Main application script.transcript_processor.py
: Handles the transcription processing.audio_files/
: Directory for storing audio files and chunks..env-example
: Template for environment variables including API keys.
Feel free to open issues or submit pull requests with improvements.
This project is licensed under the terms of the MIT license.