Suivre
Jan "Yenda" Trmal
Jan "Yenda" Trmal
Associate Research Scientist at Johns Hopkins University
Adresse e-mail validée de jhu.edu
Titre
Citée par
Citée par
Année
Improving deep neural network acoustic models using generalized maxout networks
X Zhang, J Trmal, D Povey, S Khudanpur
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
3992014
The fifth'CHiME'Speech Separation and Recognition Challenge: Dataset, task and baselines
J Barker, S Watanabe, E Vincent, J Trmal
arXiv preprint arXiv:1803.10609, 2018
3972018
A PITCH EXTRACTION ALGORITHM TUNED FOR AUTOMATIC SPEECH RECOGNITION
P Ghahremani, B BabaAli, D Povey, K Riedhammer, J Trmal, ...
3832014
Multi-task self-supervised learning for Robust Speech Recognition
M Ravanelli, J Zhong, S Pascual, P Swietojanski, J Monteiro, J Trmal, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2932020
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
2872020
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
1592021
Using proxies for OOV keywords in the keyword search task
G Chen, O Yilmaz, J Trmal, D Povey, S Khudanpur
2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 416-421, 2013
1162013
Quantifying the value of pronunciation lexicons for keyword search in low resource languages
G Chen, S Khudanpur, D Povey, J Trmal, D Yarowsky, O Yilmaz
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International …, 2013
632013
A KEYWORD SEARCH SYSTEM USING OPEN SOURCE SOFTWARE
J Trmal, G Chen, D Povey, S Khudanpur, P Ghahremani, X Zhang, ...
Proceedings of SLT 2014; (accepted), 2014
502014
The Kaldi OpenKWS System: Improving Low Resource Keyword Search.
J Trmal, M Wiesner, V Peddinti, X Zhang, P Ghahremani, Y Wang, ...
INTERSPEECH, 3597-3601, 2017
472017
Adaptation of a feedforward artificial neural network using a linear transform
J Trmal, J Zelinka, L Müller
Text, Speech and Dialogue, 423-430, 2010
442010
DiPCo--Dinner Party Corpus
M Van Segbroeck, A Zaid, K Kutsenko, C Huerta, T Nguyen, X Luo, ...
arXiv preprint arXiv:1909.13447, 2019
302019
Optimized acoustic likelihoods computation for NVIDIA and ATI/AMD graphics processors
J Vaněk, J Trmal, JV Psutka, J Psutka
Audio, Speech, and Language Processing, IEEE Transactions on 20 (6), 1818-1828, 2012
262012
Adversarial Attacks and Defenses for Speech Recognition Systems
P Żelasko, S Joshi, Y Shao, J Villalba, J Trmal, N Dehak, S Khudanpur
arXiv preprint arXiv:2103.17122, 2021
242021
Using ASR methods for OCR
A Arora, CC Chang, B Rekabdar, B BabaAli, D Povey, D Etter, D Raj, ...
2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019
232019
Topic Identification for Speech without ASR
C Liu, J Trmal, M Wiesner, C Harman, S Khudanpur
arXiv preprint arXiv:1703.07476, 2017
212017
Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs.
A Prazák, Z Loose, J Trmal, JV Psutka, J Psutka
INTERSPEECH, 2012
212012
Voice-supported electronic health record for temporomandibular joint disorders.
R Hippmann, T Dostálová, J Zvárová, M Nagy, M Seydlová, P Hanzlicek, ...
Methods of information in medicine 49 (2), 168-172, 2009
212009
Voice-controlled data entry in dental electronic health record.
M Nagy, P Hanzlicek, J Zvarova, T Dostalova, M Seydlova, R Hippman, ...
Studies in health technology and informatics 136, 529, 2008
202008
Combination of FST and CN search in spoken term detection
J Chiu, Y Wang, J Trmal, D Povey, G Chen, A Rudnicky
Proc. Interspeech, 2014
172014
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20