Quality at a glance: An audit of web-crawled multilingual datasets J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ... Transactions of the Association for Computational Linguistics 10, 50-72, 2022 | 182* | 2022 |
Open-source multi-speaker speech corpora for building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu speech synthesis systems F He, SHC Chu, O Kjartansson, C Rivera, A Katanova, A Gutkin, ... Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 69 | 2020 |
Crowd-Sourced Speech Corpora for Javanese, Sundanese, Sinhala, Nepali, and Bangladeshi Bengali. O Kjartansson, S Sarin, K Pipatsrisawat, M Jansche, L Ha SLTU, 52-55, 2018 | 67 | 2018 |
A Step-by-Step Process for Building TTS Voices Using Open Source Data and Frameworks for Bangla, Javanese, Khmer, Nepali, Sinhala, and Sundanese. K Sodimana, P De Silva, S Sarin, O Kjartansson, M Jansche, ... SLTU, 66-70, 2018 | 44* | 2018 |
Quality at a glance: An audit of web-crawled multilingual datasets I Caswell, J Kreutzer, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ... arXiv e-prints, arXiv: 2103.12028, 2021 | 32 | 2021 |
Crowdsourcing Latin American Spanish for low-resource text-to-speech A Guevara-Rukoz, I Demirsahin, F He, SHC Chu, S Sarin, K Pipatsrisawat, ... Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 32 | 2020 |
Building open Javanese and Sundanese corpora for multilingual text-to-speech JAE Wibawa, S Sarin, C Li, K Pipatsrisawat, K Sodimana, O Kjartansson, ... Proceedings of the Eleventh International Conference on Language Resources …, 2018 | 18* | 2018 |
Burmese speech corpus, finite-state text normalization and pronunciation grammars with an application to text-to-speech YM Oo, T Wattanavekin, C Li, P De Silva, S Sarin, K Pipatsrisawat, ... Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 | 15 | 2020 |
Joint Equal Contribution of Global and Local Features for Image Annotation. S Sarin, W Kameyama CLEF (Working Notes), 2009 | 13 | 2009 |
On automatic contextual metadata generation for personal digital photographs S Sarin, T Nagahashi, T Miyosawa, W Kameyama The 9th International Conference on Advanced Communication Technology 1, 66-71, 2007 | 10 | 2007 |
Google crowdsourced speech corpora and related open-source resources for low-resource languages and dialects: an overview A Butryna, SHC Chu, I Demirsahin, A Gutkin, L Ha, F He, M Jansche, ... arXiv preprint arXiv:2010.06778, 2020 | 8 | 2020 |
Building ASR Corpora Using Eyra. J Guðnason, M Pétursson, R Kjaran, S Klüpfel, AB Nikulásdóttir INTERSPEECH, 2173-2177, 2017 | 8 | 2017 |
Holistic feature extraction for automatic image annotation S Sarin, M Fahrmair, M Wagner, W Kameyama 2011 Fifth FTRA International Conference on Multimedia and Ubiquitous …, 2011 | 8 | 2011 |
Towards location recognition using range images A Al-Nuaimi, R Huitl, S Taifour, S Sarin, X Song, YX Gu, E Steinbach, ... 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 1-6, 2013 | 6 | 2013 |
Exploiting users' personal and public information for personal photo annotation S Sarin, T Nagahashi, T Miyosawa, W Kameyama 2007 IEEE International Conference on Multimedia and Expo, 564-567, 2007 | 5 | 2007 |
Crowdsource by Google: A Platform for Collecting Inclusive and Representative Machine Learning Data S Sarin, K Pipatsrisawat, K Pham, A Batra, L Valente HCOMP 2019, 2019 | 4 | 2019 |
Targeting diversity in photographic retrieval task with commonsense knowledge S Sarin, W Kameyama CEUR Workshop Proceedings 1174, 2008 | 4 | 2008 |
Affective and Holistic Approach at TRECVID 2010 Task-Semantic Indexing (SIN). KM Ong, S Sarin, W Kameyama TRECVID, 2010 | 3 | 2010 |
On the design and exploitation of user's personal and public information for semantic personal digital photograph annotation S Sarin, T Nagahashi, T Miyosawa, W Kameyama Advances in Multimedia 2008, 2008 | 3 | 2008 |
Pasindu De Silva, Richard Sproat, A Theeraphol, Chen Fang Li, Alexander Gutkin, Supheakmungkol Sarin, and Knot Pipatsrisawat. 2018. Text normalization for bangla, khmer, nepali … K Sodimana 6th International Workshop on Spoken Language Technologies for Under …, 2018 | 2 | 2018 |