podcast-chapterize

This project aims to automatically provide longform audio podcast episodes with chapter markers. This is achieved with statistical natrual language processing algorithms that try to subdivide transcribed podcast episodes into topically cohesive parts.

[work in progress]

Requirements

CLI:

Python 3.6+
ffmpeg
MP4Box
matplotlib (intall via package manager)
Java
Python module requirements (installable via pip3 install -r requirements.txt)

Web interface:

node.js
npm

Usage

This program can be used in the command line or as an HTTP API with a web interface.

CLI

Usage: python3 main.py [subcommand] [options] ...

Help: python3 main.py --help
Subcommand help: python3 main.py [subcommand] --help
Possible subcommands:
- python3 main.py run: Start chapterization process from podcast RSS feed URL
- python3 main.py transcribe: Transcribe podcast episode from RSS feed URL
- python3 main.py chapterize: Chapterize transcript

Web interface

API:

Create python3 venv: python3 -m venv venv
Activate venv: source venv/bin/activate
(optional) Set environment variables for IP address and Port in the ´.flaskenv´ file
start API server with flask run

Frontend:

Serve frontend files 'web/client/dist' on web server

If the server is not running on the same machine:

specify API host in web/client/.env
cd into 'web/client'
install dependencies with npm install
build files with npm run build
serve built files from (web/client/dist) on web server

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
api		api
chapterize		chapterize
doc_files		doc_files
test_files		test_files
transcribe		transcribe
web		web
.dockerignore		.dockerignore
.flaskenv		.flaskenv
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
api_server.py		api_server.py
main.py		main.py
requirements.txt		requirements.txt
test_chapterize.py		test_chapterize.py
write_chapters.py		write_chapters.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

podcast-chapterize

Requirements

CLI:

Web interface:

Usage

CLI

Web interface

About

Releases

Packages

Languages

stereolith/podcast-chapterize

Folders and files

Latest commit

History

Repository files navigation

podcast-chapterize

Requirements

CLI:

Web interface:

Usage

CLI

Web interface

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages