Faster Speech To Speech:

[]

A very fast way to get S2S models

How? just abuse APIs (while they are still free)

Important: Currently it is still in development, overtime I would increase its feature capabilities and modifications :) <If my school hours permits ?>

A super fast and quick way to implement a speech to speech model using:

1. Groq

2. Google TTS Service

If you wish to just call Google TTS API just run

python s2s.py --gtts

3. Coqui TTS (Locally Run & Modifiable!) [Default]

Simple Pipeline:

Person Talking ===> Groq Whisper ===> Groq Llama-70b-8192 ===> Google TTS Services

Modifications:

How To Use:

Ensure you install the requirements via

pip install requirements.txt
Remember to sign up for Groq Playground and create an API_KEY
Run the script

python s2s.py

It goes by a simple CLI interface

Just run python s2s.py followed by the following args commands you can try:

--output file_to_path : To set path for audio

--gtts: Enables Google TTS Service, else fallsback to Coqui TTS

--temperature (insert int): controls creativity of the model

--audio (audio_name): Sets name of your microphone

--model (model_name): Set model name (Default is llama3-70b-8192)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
audio		audio
.gitignore		.gitignore
README.md		README.md
demo.MOV		demo.MOV
requirements.txt		requirements.txt
s2s.py		s2s.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Faster Speech To Speech:

A very fast way to get S2S models

How? just abuse APIs (while they are still free)

Important: Currently it is still in development, overtime I would increase its feature capabilities and modifications :) <If my school hours permits ?>

1. Groq

2. Google TTS Service

3. Coqui TTS (Locally Run & Modifiable!) [Default]

Simple Pipeline:

Person Talking ===> Groq Whisper ===> Groq Llama-70b-8192 ===> Google TTS Services

Modifications:

How To Use:

It goes by a simple CLI interface

Requirements:

1. Cuda 12.4

2. Groq Account

3. Accepting Coqui TTS UAT

Acknowledgements:

1. Groq (For providing superfast inference)

2. Coqui (For providing the open source weights)

About

Releases

Packages

Languages

harvestingmoon/S2S

Folders and files

Latest commit

History

Repository files navigation

Faster Speech To Speech:

A very fast way to get S2S models

How? just abuse APIs (while they are still free)

Important: Currently it is still in development, overtime I would increase its feature capabilities and modifications :) <If my school hours permits ?>

1. Groq

2. Google TTS Service

3. Coqui TTS (Locally Run & Modifiable!) [Default]

Simple Pipeline:

Person Talking ===> Groq Whisper ===> Groq Llama-70b-8192 ===> Google TTS Services

Modifications:

How To Use:

It goes by a simple CLI interface

Requirements:

1. Cuda 12.4

2. Groq Account

3. Accepting Coqui TTS UAT

Acknowledgements:

1. Groq (For providing superfast inference)

2. Coqui (For providing the open source weights)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages