GitHub - nishu88/InGenius-2018-hackathon: Speech and Speaker Recognition/Diarization using Hidden Markov Models(hmm) and Convolutional Neural Network(cnn)

http://www.ingeniushack.com/

inGenius, the flagship event of PESITSouth IEEE Computer Society, is an annual 24-hour hackathon.

Event title: 7th Edition ,INGENIUS 2k18 24hr hackathon

Date: 3rd and 4th November, 2018.

Prize won: 1st place, 40,000 rupees

PESIT South campus has this hackathon every year, called INGENIUS. This year about 250 teams had submitted their ideas and about 100 teams were qualified for the main hack.The hackathon was held in PESIT south campus for a duration of 24 hours, on 3rd and 4th November. The 1st place cash prize was 40,000 rupees,and 2nd place was 20,000 rupees.

Our team comprised of me(Nishanth D A) and Rajat keshri. We submitted our idea of smart attendance system using speech and speaker recognition system, which prevents proxy. Jury was very impressed with our approach and implementation of the idea to the problem and and the future extensions of the project. We secured first place in this hackathon, winning 40,000 rupees. Attendance system using speaker recogniton and speech recognition to avoid proxy .

We used two approaches here - HIDDEN MARKOV MODEL APPROACH AND CNN MODEL APPROACH

HIDDEN MARKOV MODEL APPROACH Inside the Hidden markov model folder there are multiple files

The NEW_TRAIN.py file is used to train a new user. It takes in 5 voice samples from the user and trains on it and then stores the voice model in the models folder. The training data is stored in a folder inside the dataset, with the speaker name as folder name.

The INDIVIDUAL_TRAIN.py is a script which trains speaker's voice, provided we already have the training data stored in a folder in the datasets folder. This script trains the model and stores it in the models folder.

The ONLY_PREDICT.py file has the script which takes in a new sample value from the speaker , and predicts who the speaker is. The speaker is asked to speak his/her roll number into the microphone and googles speech to text api converts it into text. If the roll number matches the roll number of the given person in the database, then a random word will be generated which the speaker has to speak again , for higher authentication, to prevent proxy givers if they plan to use a recorder audio of some other speaker. After this process, attendance will be given and the database will be updated.

RESET_DB.py is a file which is used to reset the attendance coloumn in .csv attendance file to all zeroes for the next day's attendance

update_to_db.py is a file which updates the attendance to the database.

s.txt holds the spakers names. Registry holds the attendance of all the days and keeps getting updated.

Inside the folder front end and backend, there is a script, which has all the functions which is used to predict and give attendance. It also include a front end in that same script. Just run this script after training and properly specifying the path for everything, and it will ask for your voice input and predict who the speaker is among the lost of speakers.

REMEMBER TO CHANGE THE PATH FOR MODELS AND DATASETS BECAUSE MY PATH USED WILL DEFINATELY BE DIFFERENT FROM THE PATH YOU USE.

CNN APPROACH Run the CNN_TRAIN.py file to run training on your datasets. Put in your training samples in the folder called DATASETS, inside which create a folder with each speakers name and put his/her voices in that particular folder

Run CNN_PREDICT.py to accept a new spoken voice which is recorded when you run the code and it then predicts who the speaker is.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
HIDDEN MARKOV MODELS		HIDDEN MARKOV MODELS
datasets		datasets
frontend and backend		frontend and backend
models		models
samples		samples
CNN_PREDICT.py		CNN_PREDICT.py
CNN_TRAIN.py		CNN_TRAIN.py
README.md		README.md
RESET_DB.py		RESET_DB.py
attendance_database.csv		attendance_database.csv
registry.csv		registry.csv
registry.txt		registry.txt
s.txt		s.txt
speaker_reco.model		speaker_reco.model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

inGenius, the flagship event of PESITSouth IEEE Computer Society, is an annual 24-hour hackathon.

Event title: 7th Edition ,INGENIUS 2k18 24hr hackathon

Date: 3rd and 4th November, 2018.

Prize won: 1st place, 40,000 rupees

About

Releases

Packages

Languages

nishu88/InGenius-2018-hackathon

Folders and files

Latest commit

History

Repository files navigation

inGenius, the flagship event of PESITSouth IEEE Computer Society, is an annual 24-hour hackathon.

Event title: 7th Edition ,INGENIUS 2k18 24hr hackathon

Date: 3rd and 4th November, 2018.

Prize won: 1st place, 40,000 rupees

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages