Folgen
Pavel Denisov
Pavel Denisov
Bestätigte E-Mail-Adresse bei iais.fraunhofer.de - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 897-904, 2021
952021
ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet
S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
772022
Investigations on speech recognition systems for low-resource dialectal Arabic–English code-switching speech
I Hamed, P Denisov, CY Li, M Elmahdy, S Abdennadher, NT Vu
Computer Speech & Language 72, 101278, 2022
442022
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
P Denisov, NT Vu
Interspeech 2020, 881-885, 2020
342020
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study
X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
282024
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
S Meyer, P Tilli, P Denisov, F Lux, J Koch, NT Vu
2022 IEEE Spoken Language Technology Workshop (SLT), 912-919, 2023
282023
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
P Denisov, NT Vu
Interspeech 2019, 4425-4429, 2019
282019
Speaker Anonymization with Phonetic Intermediate Representations
S Meyer, F Lux, P Denisov, J Koch, P Tilli, NT Vu
Interspeech 2022, 4925-4929, 2022
262022
Unsupervised domain adaptation by adversarial learning for robust speech recognition
P Denisov, NT Vu, MF Font
Speech Communication; 13th ITG-Symposium, 1-5, 2018
242018
The IMS Toucan System for the Blizzard Challenge 2023
F Lux, J Koch, S Meyer, T Bott, N Schauffler, P Denisov, A Schweitzer, ...
18th Blizzard Challenge Workshop, 2023
192023
Prosody Is Not Identity: A Speaker Anonymization Approach Using Prosody Cloning
S Meyer, F Lux, J Koch, P Denisov, P Tilli, NT Vu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
182023
IMS-speech: A speech to text tool
P Denisov, NT Vu
Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung …, 2019
162019
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents
CY Li, D Ortega, D Väth, F Lux, L Vanderlyn, M Schmidt, M Neumann, ...
arXiv preprint arXiv:2005.01777, 2020
132020
Context-aware Neural-based Dialog Act Classification on Automatically Generated Transcriptions
D Ortega, CY Li, G Vallejo, P Denisov, NT Vu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
132019
Findings of the Second AmericasNLP Competition on Speech-to-Text Translation
A Ebrahimi, M Mager, A Wiemerslage, P Denisov, A Oncevay, D Liu, ...
NeurIPS 2022 Competition Track 220, 217-232, 2022
52022
Cascade of Phonetic Speech Recognition, Speaker Embeddings GAN and Multispeaker Speech Synthesis for the VoicePrivacy 2022 Challenge
S Meyer, P Tilli, F Lux, P Denisov, J Koch, NT Vu
2nd Symposium on Security and Privacy in Speech Communication, 2022
52022
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
P Denisov, M Mager, NT Vu
2021 International Conference on Spoken Language Translation (IWSLT), 175-181, 2021
52021
Findings of the AmericasNLP 2024 shared task on the creation of educational materials for indigenous languages
L Chiruzzo, P Denisov, A Molina-Villegas, SF Sabido, R Coto-Solano, ...
Proceedings of the 4th Workshop on Natural Language Processing for …, 2024
22024
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training
P Denisov, T Vu
Findings of the Association for Computational Linguistics: NAACL 2024, 814–834, 2024
12024
Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings
SD Shukla, P Denisov, T Turan
arXiv preprint arXiv:2409.06222, 2024
2024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20