Text-based editing of talking-head video O Fried, A Tewari, M Zollhöfer, A Finkelstein, E Shechtman, DB Goldman, ... ACM Transactions on Graphics (TOG) 38 (4), 1-14, 2019 | 284 | 2019 |
HiFi-GAN: High-fidelity denoising and dereverberation based on speech deep features in adversarial networks J Su, Z Jin, A Finkelstein arXiv preprint arXiv:2006.05694, 2020 | 168 | 2020 |
FFTNet: A real-time speaker-dependent neural vocoder Z Jin, A Finkelstein, GJ Mysore, J Lu 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 137 | 2018 |
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder K Qian, Z Jin, M Hasegawa-Johnson, GJ Mysore ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 119 | 2020 |
Hear: Holistic evaluation of audio representations J Turian, J Shier, HR Khan, B Raj, BW Schuller, CJ Steinmetz, C Malloy, ... NeurIPS 2021 Competitions and Demonstrations Track, 125-145, 2022 | 104 | 2022 |
Voco: Text-based insertion and replacement in audio narration Z Jin, GJ Mysore, S Diverdi, J Lu, A Finkelstein ACM Transactions on Graphics (TOG) 36 (4), 1-13, 2017 | 84 | 2017 |
A differentiable perceptual audio metric learned from just noticeable differences P Manocha, A Finkelstein, R Zhang, NJ Bryan, GJ Mysore, Z Jin arXiv preprint arXiv:2001.04460, 2020 | 82 | 2020 |
CDPAM: Contrastive learning for perceptual audio similarity P Manocha, Z Jin, R Zhang, A Finkelstein ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 75 | 2021 |
Controllable deep melody generation via hierarchical music structure representation S Dai, Z Jin, C Gomes, RB Dannenberg arXiv preprint arXiv:2109.00663, 2021 | 61 | 2021 |
Music creation by example E Frid, C Gomes, Z Jin Proceedings of the 2020 CHI conference on human factors in computing systems …, 2020 | 60 | 2020 |
HiFi-GAN-2: Studio-quality speech enhancement via generative adversarial networks conditioned on acoustic features J Su, Z Jin, A Finkelstein 2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021 | 57 | 2021 |
Cute: A concatenative method for voice conversion using exemplar-based unit selection Z Jin, A Finkelstein, S DiVerdi, J Lu, GJ Mysore 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 51 | 2016 |
Disentangled multidimensional metric learning for music similarity J Lee, NJ Bryan, J Salamon, Z Jin, J Nam ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 44 | 2020 |
Bandwidth extension is all you need J Su, Y Wang, A Finkelstein, Z Jin ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 43 | 2021 |
Metric learning vs classification for disentangled music representation learning J Lee, NJ Bryan, J Salamon, Z Jin, J Nam arXiv preprint arXiv:2008.03729, 2020 | 41 | 2020 |
Pose2Pose: Pose selection and transfer for 2D character animation NS Willett, HV Shin, Z Jin, W Li, A Finkelstein Proceedings of the 25th International Conference on Intelligent User …, 2020 | 35 | 2020 |
Acoustic matching by embedding impulse responses J Su, Z Jin, A Finkelstein ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 34 | 2020 |
Context-aware prosody correction for text-based speech editing M Morrison, L Rencker, Z Jin, NJ Bryan, JP Caceres, B Pardo ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 33 | 2021 |
Controllable neural prosody synthesis M Morrison, Z Jin, J Salamon, NJ Bryan, GJ Mysore arXiv preprint arXiv:2008.03388, 2020 | 23 | 2020 |
Perceptually-motivated environment-specific speech enhancement J Su, A Finkelstein, Z Jin ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 23 | 2019 |