Index of /~bryan/papers2/ai/speech-recognition
Name
Last modified
Size
Description
Parent Directory
-
Speech recognition system robustness to microphone variations - 1995.pdf
1999-05-25 13:10
713K
Landmark detection for distinctive featureābased speech recognition - 1996.pdf
2005-09-20 17:38
280K
Why are spectrograms hard to read?.pdf
2006-05-17 12:17
720K
Recent progress in the MIT spoken lecture processing project - 2007.pdf
2007-09-21 07:34
189K
An overview of speech recognition and synthesis.pdf
2010-08-12 04:26
2.7M
Intrinsic spectral analysis for zero and high resource speech recognition.pdf
2012-07-03 06:04
95K
Improving searchability of automatically transcribed lectures through dynamic language modelling - dissertation - wikipedia - 2012.pdf
2012-12-11 00:36
2.7M
Leveraging large amounts of loosely transcribed corporate videos for acoustic model training - 2011.pdf
2013-01-18 15:43
92K
Intrinsic spectral analysis - 2013.pdf
2013-03-20 12:36
1.9M
The speech recognition and machine translation system of IOIT for IWSLT 2013 - ted talks.pdf
2013-12-19 05:16
240K
Kaldi+PDNN: Building DNN-based ASR systems with kaldi and PDNN - 2014.pdf
2014-01-27 17:18
157K
A bottom-up modular search approach to large vocabulary continuous speech recognition - 2013.pdf
2014-02-09 19:57
1.3M
Speech recognition using Kaldi - 2014.pdf
2014-06-12 08:38
1.6M
A lecture transcription system combining neural network acoustic and language models - slides - 2013.pdf
2014-10-20 05:44
2.2M
End-to-end continuous speech recognition using attention-based recurrent NN: first results - 2014.pdf
2014-12-04 17:30
425K
Semi-supervised training for lecture transcription in resource-scarce environments - 2014.pdf
2014-12-19 00:28
95K
Deep speech: Scaling up end-to-end speech recognition - Baidu - 2014.pdf
2014-12-22 17:33
514K
Librispeech: an ASR corpus based on public domain audiobooks - 2015.pdf
2015-01-27 23:21
95K
Automatic speech recognition and machine translation system for MIT english lectures using MIT and TED corpus - 2013.pdf
2015-02-23 05:37
651K
Using keyword spotting to help humans correct captioning faster - 2015.pdf
2015-03-28 08:25
432K
Introducing CURRENNT: The Munich open-source CUDA recurrent neural network toolkit - 2015.pdf
2015-04-12 08:58
286K
The IBM 2015 English conversational telephone speech recognition system - 2015.pdf
2015-05-24 17:34
180K
Fast and accurate recurrent neural network acoustic models for speech recognition - 2015.pdf
2015-07-26 17:23
322K
Knowledge-based approach to consonant recognition.pdf
2015-07-28 19:16
44K
Scalable distributed DNN training using commodity GPU cloud computing - 2015.pdf
2015-09-20 15:32
1.1M
Effective approaches to attention-based neural machine translation - 2015.pdf
2015-09-21 17:33
244K
Character-based neural machine translation - 2015.pdf
2015-11-16 18:09
370K
Deep speech 2: End-to-end speech recognition in English and Mandarin - 2015.pdf
2015-12-08 17:28
857K
pyAudioAnalysis: An open-source python library for audio signal analysis - 2015.pdf
2015-12-11 10:32
3.2M
The IOIT english ASR system for IWSLT 2015 - ted talks.pdf
2015-12-19 04:46
82K
Environmental noise embeddings for robust speech recognition - 2016.pdf
2016-01-11 17:41
1.1M
Character-level incremental speech recognition with recurrent neural networks - 2016.pdf
2016-01-28 18:15
125K
An empirical exploration of CTC acoustic models - 2016.pdf
2016-01-29 13:38
148K
The effects of automatic speech recognition quality on human transcription latency - 2015.pdf
2016-03-04 07:34
1.3M
Optimizing performance of recurrent neural networks on GPUs - 2016.pdf
2016-04-07 17:37
93K
Integrated adaptation with multi-factor joint-learning for far-field speech recognition - 2016.pdf
2016-04-11 12:25
304K
Robust coherence-based spectral enhancement for speech recognition in adverse real-world environments - 2016.pdf
2016-04-13 17:55
327K
Convolutional, long short-term memory, fully connected deep neural networks - Google - 2015.pdf
2016-04-20 20:17
172K
Deep neural networks for acoustic modeling in speech recognition - 2012.pdf
2016-04-20 20:17
267K
Listen, attend and spell - Google - RNNs without CTC not CLDNN-HMM - 2015.pdf
2016-04-20 20:17
2.2M
Towards lecture transcription in resource-scarce environments - 2012.pdf
2016-04-28 14:24
72K
Can neural machine translation do simultaneous translation? - 2016.pdf
2016-06-07 17:18
642K
Neural machine translation of rare words with subword units - 2015.pdf
2016-06-12 22:31
189K
Calibration of phone likelihoods in automatic speech recognition - 2016.pdf
2016-06-14 17:38
887K
Multi-task recurrent model for speech and speaker recognition - 2016.pdf
2016-06-14 17:50
333K
Segmental recurrent neural networks for end-to-end speech recognition - 2016.pdf
2016-06-20 18:18
413K
A character-level decoder without explicit segmentation for neural machine translation - 2016.pdf
2016-06-21 17:05
671K
A segmental framework for fully-unsupervised large-vocabulary speech recognition - 2016.pdf
2016-06-22 17:27
1.0M
A comprehensive study of deep bidirectional LSTM RNNs for acoustic modeling in speech recognition - 2016.pdf
2016-06-24 15:09
218K
Directly comparing the listening strategies of humans and machines - 2016.pdf
2016-06-27 13:41
6.4M
A 6 mW, 5,000-word real-time speech recognizer using WFST models - 2015.pdf
2016-07-26 17:12
2.8M
Lightly supervised acoustic model training for imprecisely and asynchronously transcribed speech - 2013.pdf
2016-07-26 17:19
433K
End-to-end deep neural network for automatic speech recognition - 2015.pdf
2016-08-06 19:03
7.7K
Factored recurrent neural network language model in TED lecture transcription - 2012.pdf
2016-08-07 19:15
658K
Language model adaptation for academic lectures using character recognition result of presentation slides - 2015.pdf
2016-08-07 20:06
9.0K
Recurrent support vector machines for speech recognition - 2016.pdf
2016-08-08 10:50
110K
The Microsoft 2016 conversational speech recognition system - 2016.pdf
2016-09-12 17:58
226K
Advances in all-neural speech recognition - 2016.pdf
2016-09-20 17:37
96K
ba_dls_speech2016.pdf
2016-09-25 16:23
18M
Sonic shapes: visualizing vocal expression - 2013.pdf
2016-10-01 16:55
780K
Formalizing knowledge used in spectrogram reading: Acoustic and perceptual evidence from stops - 1988.pdf
2016-10-16 07:06
14M
Achieving human parity in conversational speech recognition - Microsoft - 2016.pdf
2016-10-17 17:38
292K
The perception of speech under adverse conditions - 2004.pdf
2016-10-19 14:22
809K
Exploiting deep neural networks for detection-based speech recognition - 2013.pdf
2016-10-19 14:35
1.0M
get.txt
2016-10-22 10:46
1.3K
Deep LSTM for large vocabulary continuous speech recognition - 2017.pdf
2017-03-21 17:25
164K
A wavenet for speech denoising - 2017.pdf
2017-06-22 17:41
685K
Generating adversarial examples for speech recognition.pdf
2017-06-26 10:59
506K
Stimulated deep neural network for speech recognition - 2016.pdf
2017-08-08 19:57
1.0M
The Microsoft 2017 conversational speech recognition system - 2017.pdf
2017-08-20 19:54
160K
url.txt
2017-08-21 05:40
1.4K
Apache/2.4.25 (Debian) Server at diyhpl.us Port 80