End-to-End-Audio-Recognition

The web has been deployed on 121.40.161.184, one can directly access on http://121.40.161.184:8484/music_voice.html(for audio to text translate) and http://121.40.161.184:8484/search.html(for search in database through keyword).

Dependencies

Prepare audio files for training model (train-model.py)
Use pre-trained model to classify targeted audio segments (audio-classifier.py)
Filter to get optimized the audio segments (audio_filter.py)
Segment an audio file into pieces according to segment points (audio-segmenter.py)
For each audio segment, do audio to text translation (audio_recognition.py)
Save the result data in HBase (pythrift.py)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
models		models
screenshots		screenshots
web_demo		web_demo
.gitignore		.gitignore
README.md		README.md
audio-classifier.py		audio-classifier.py
audio2text_api.py		audio2text_api.py
audio_filter.py		audio_filter.py
audio_recog.py		audio_recog.py
audio_recog_emr.py		audio_recog_emr.py
audio_recognition.py		audio_recognition.py
audio_segmenter.py		audio_segmenter.py
knn.png		knn.png
main.py		main.py
pythrift.py		pythrift.py
train-model.py		train-model.py