Follow
Sungwon Kim
Sungwon Kim
NVIDIA, Seoul National University
Verified email at nvidia.com
Title
Cited by
Cited by
Year
ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models
J Choi, S Kim, Y Jeong, Y Gwon, S Yoon
ICCV 2021 (arXiv preprint arXiv:2108.02938), 2021
442*2021
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
J Kim, S Kim, J Kong, S Yoon
Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020
4062020
FloWaveNet: A generative flow for raw audio
S Kim, S Lee, J Song, J Kim, S Yoon
Proceedings of the International Conference on Machine Learning (ICML), 2018
1952018
Perception Prioritized Training of Diffusion Models
J Choi, J Lee, C Shin, S Kim, H Kim, S Yoon
CVPR 2022 (arXiv preprint arXiv:2204.00227), 2022
1232022
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance
H Kim, S Kim, S Yoon
Proceedings of the International Conference on Machine Learning (ICML), 2021
602021
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
S Kim, H Kim, S Yoon
arXiv preprint arXiv:2205.15370, 2022
342022
AligNART: Non-autoregressive Neural Machine Translation by Jointly Learning to Estimate Alignment and Translate
J Song, S Kim, S Yoon
EMNLP 2021 (arXiv preprint arXiv:2109.06481), 2021
312021
FICGAN: Facial Identity Controllable GAN for De-identification
Y Jeong, J Choi, S Kim, Y Ro, TH Oh, D Kim, H Ha, S Yoon
arXiv preprint arXiv:2110.00740, 2021
122021
NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity
S Lee, S Kim, S Yoon
Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020
122020
UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data
H Kim, S Kim, J Yeom, S Yoon
InterSpeech 2023, 2023
52023
P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting
S Kim, K Shih, JF Santos, E Bakhturina, M Desta, R Valle, S Yoon, ...
Advances in Neural Information Processing Systems 36, 2024
12024
Scaling NVIDIA's multi-speaker multi-lingual TTS systems with voice cloning to Indic Languages
A Arora, R Badlani, S Kim, R Valle, B Catanzaro
arXiv preprint arXiv:2401.13851, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–12