Suivre
Jan "Yenda" Trmal
Jan "Yenda" Trmal
Associate Research Scientist at Johns Hopkins University
Adresse e-mail validée de jhu.edu
Titre
Citée par
Citée par
Année
Improving deep neural network acoustic models using generalized maxout networks
X Zhang, J Trmal, D Povey, S Khudanpur
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
3872014
A PITCH EXTRACTION ALGORITHM TUNED FOR AUTOMATIC SPEECH RECOGNITION
P Ghahremani, B BabaAli, D Povey, K Riedhammer, J Trmal, ...
3492014
The fifth'CHiME'Speech Separation and Recognition Challenge: Dataset, task and baselines
J Barker, S Watanabe, E Vincent, J Trmal
arXiv preprint arXiv:1803.10609, 2018
3122018
Multi-task self-supervised learning for Robust Speech Recognition
M Ravanelli, J Zhong, S Pascual, P Swietojanski, J Monteiro, J Trmal, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2082020
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
1902020
Using proxies for OOV keywords in the keyword search task
G Chen, O Yilmaz, J Trmal, D Povey, S Khudanpur
2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 416-421, 2013
1132013
Quantifying the value of pronunciation lexicons for keyword search in low resource languages
G Chen, S Khudanpur, D Povey, J Trmal, D Yarowsky, O Yilmaz
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International …, 2013
602013
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
552021
A KEYWORD SEARCH SYSTEM USING OPEN SOURCE SOFTWARE
J Trmal, G Chen, D Povey, S Khudanpur, P Ghahremani, X Zhang, ...
Proceedings of SLT 2014; (accepted), 2014
522014
The Kaldi OpenKWS System: Improving Low Resource Keyword Search.
J Trmal, M Wiesner, V Peddinti, X Zhang, P Ghahremani, Y Wang, ...
INTERSPEECH, 3597-3601, 2017
442017
Adaptation of a feedforward artificial neural network using a linear transform
J Trmal, J Zelinka, L Müller
Text, Speech and Dialogue, 423-430, 2010
422010
Optimized acoustic likelihoods computation for NVIDIA and ATI/AMD graphics processors
J Vaněk, J Trmal, JV Psutka, J Psutka
Audio, Speech, and Language Processing, IEEE Transactions on 20 (6), 1818-1828, 2012
242012
Using ASR methods for OCR
A Arora, CC Chang, B Rekabdar, B BabaAli, D Povey, D Etter, D Raj, ...
2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019
202019
Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs.
A Prazák, Z Loose, J Trmal, JV Psutka, J Psutka
INTERSPEECH, 2012
202012
Voice-supported electronic health record for temporomandibular joint disorders.
R Hippmann, T Dostálová, J Zvárová, M Nagy, M Seydlová, P Hanzlicek, ...
Methods of information in medicine 49 (2), 168-172, 2009
202009
Voice-controlled data entry in dental electronic health record.
M Nagy, P Hanzlicek, J Zvarova, T Dostalova, M Seydlova, R Hippman, ...
Studies in health technology and informatics 136, 529, 2008
202008
Adversarial Attacks and Defenses for Speech Recognition Systems
P Żelasko, S Joshi, Y Shao, J Villalba, J Trmal, N Dehak, S Khudanpur
arXiv preprint arXiv:2103.17122, 2021
182021
Combination of FST and CN search in spoken term detection
J Chiu, Y Wang, J Trmal, D Povey, G Chen, A Rudnicky
Proc. Interspeech, 2014
182014
Topic Identification for Speech without ASR
C Liu, J Trmal, M Wiesner, C Harman, S Khudanpur
arXiv preprint arXiv:1703.07476, 2017
172017
Low-Resource Open Vocabulary Keyword Search Using Point Process Models
C Liu, A Jansen, G Chen, K Kintzley, J Trmal, S Khudanpur
Fifteenth Annual Conference of the International Speech Communication …, 2014
162014
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20