Follow
Zhuo Chen
Zhuo Chen
Bytedance (formerly Microsoft, Columbia University)
Verified email at columbia.edu
Title
Cited by
Cited by
Year
Wavlm: Large-scale self-supervised pre-training for full stack speech processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022
16582022
Deep clustering: Discriminative embeddings for segmentation and separation
JR Hershey, Z Chen, J Le Roux, S Watanabe
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
15952016
Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation
Y Luo, Z Chen, T Yoshioka
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
8362020
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2301.02111, 2023
5522023
Deep attractor network for single-microphone speaker separation
Z Chen, Y Luo, N Mesgarani
arXiv preprint arXiv:1611.08930, 2016
5132016
Single-channel multi-speaker separation using deep clustering
Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey
arXiv preprint arXiv:1607.02173, 2016
5052016
Speaker-independent speech separation with deep attractor network
Y Luo, Z Chen, N Mesgarani
IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (4), 787-796, 2018
2982018
BEATs: Audio Pre-Training with Acoustic Tokenizers
S Chen, Y Wu, C Wang, S Liu, D Tompkins, Z Chen, F Wei
arXiv preprint arXiv:2212.09058, 2022
2462022
Continuous speech separation: Dataset and analysis
Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2252020
Deep clustering and conventional networks for music separation: Stronger together
Y Luo, Z Chen, JR Hershey, J Le Roux, N Mesgarani
2017 IEEE international conference on acoustics, speech and signal …, 2017
2142017
End-to-end attention based text-dependent speaker verification
SX Zhang, Z Chen, Y Zhao, J Li, Y Gong
2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016
2082016
End-to-end microphone permutation and number invariant multi-channel speech separation
Y Luo, Z Chen, N Mesgarani, T Yoshioka
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1902020
Integration of speech enhancement and recognition using long-short term memory recurrent neural network
Z Chen, S Watanabe, H Erdogan, J Hershey
Proc. Interspeech, 1-7, 2015
1862015
Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network
Z Chen, X Xiao, T Yoshioka, H Erdogan, J Li, Y Gong
2018 IEEE Spoken Language Technology Workshop (SLT), 558-565, 2018
1552018
Speaker-invariant training via adversarial learning
Z Meng, J Li, Z Chen, Y Zhao, V Mazalov, Y Gong, BH Juang
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1492018
Continuous speech separation with conformer
S Chen, Y Wu, Z Chen, J Wu, J Li, T Yoshioka, C Wang, S Liu, M Zhou
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1442021
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2303.03926, 2023
1422023
Multi-Microphone Neural Speech Separation for Far-Field Multi-Talker Speech Recognition
T Yoshioka, H Erdogan, Z Chen, F Alleva
ICASSP 2018, 2018
1342018
Neural decoding of attentional selection in multi-speaker environments without access to clean sources
J O’Sullivan, Z Chen, J Herrero, GM McKhann, SA Sheth, AD Mehta, ...
Journal of Neural Engineering 14 (5), 056001, 2017
1342017
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
T Yoshioka, H Erdogan, Z Chen, X Xiao, F Alleva
Interspeech 2018, 2018
1002018
The system can't perform the operation now. Try again later.
Articles 1–20