Skip to content

zweiein/End_to_end_Speech_Papers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 

Repository files navigation

End-to-end Speech Recognition

Importance Paper Published Notes
Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks 2006 Alex Grave
★★★★★★ Supervised sequence labelling with recurrent neural networks 2012 Alex Graves's PhD thesis
Speech Recognition with Deep Recurrent Neural Networks March 2013 Bi-LSTM + skip connections w/ CTC
★★★★ Towards end-to-end speech recognition with recurrent neural networks 2014
End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results 2014
Deep Speech: Scaling up end-to-end speech recognition December 2014 CNN + Bi-RNN + CTC (speech to letters), 25.9% WER if trained only on SWB
★★★ Deep Speech 2: End-to-End Speech Recognition in English and Mandarin December 2015 Baidu
★★★ Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition 2016 Google, Word-level end-to-end
End-to-End Attention-based Large Vocabulary Speech Recognition 2016
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning 2016
Listen, attend and spell: A neural network for large vocabulary conversational speech recognition 2016 Google
Very Deep Convolutional Networks for End-to-End Speech Recognition 2016
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM 2017
Attention-Based End-to-End Speech Recognition in Mandarin 2017
Residual Convolutional CTC Networks for Automatic Speech Recognition 2017

Connectionist Temporal Classification (CTC)

Article Notes
谁给讲讲语音识别中的CTC方法的基本原理?
论文——Connectionist Temporal Classification: Labelling Unsegmented Sequence Data with Recurrent Neural Networks

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published