Ziyang Ma

Zitiert von

	Alle	Seit 2019
Zitate	85	85
h-index	4	4
i10-index	3	3

2022202320248 27 50

Öffentlicher Zugriff

Alle anzeigen

1 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Xie ChenShanghai Jiao Tong UniversityBestätigte E-Mail-Adresse bei sjtu.edu.cn
ShiLiang ZhangSpeechLab，AlibabaBestätigte E-Mail-Adresse bei mail.ustc.edu.cn
Qian Chen (陈谦)Alibaba GroupBestätigte E-Mail-Adresse bei alibaba-inc.com
Changli TangTsinghua UniversityBestätigte E-Mail-Adresse bei mails.tsinghua.edu.cn
Kai Yu（俞凯）Shanghai Jiao Tong UniversityBestätigte E-Mail-Adresse bei sjtu.edu.cn
gao zhifuSpeech Lab, Alibaba GroupBestätigte E-Mail-Adresse bei alibaba-inc.com
Siqi ZhengDAMO Academy, Alibaba GroupBestätigte E-Mail-Adresse bei mail.harvard.edu
Yiwei GuoShanghai Jiao Tong UniversityBestätigte E-Mail-Adresse bei sjtu.edu.cn
Yifan YangMachine Learning Engineer, Xiaomi Corp.Bestätigte E-Mail-Adresse bei xiaomi.com
Xuemeng SongShandong UniversityBestätigte E-Mail-Adresse bei sdu.edu.cn
Liqiang Nie (聂礼强), IAPR FellowHarbin Institute of Technology (Shenzhen)Bestätigte E-Mail-Adresse bei hit.edu.cn
Wen WuUniversity of CambridgeBestätigte E-Mail-Adresse bei cam.ac.uk
Xuenan XuShanghai Jiao Tong UniversityBestätigte E-Mail-Adresse bei sjtu.edu.cn
Qi ChenShanghai Jiao Tong UniversityBestätigte E-Mail-Adresse bei sjtu.edu.cn
Ruibin YuanHKUSTBestätigte E-Mail-Adresse bei andrew.cmu.edu
Ge ZhangUniversity of WaterlooBestätigte E-Mail-Adresse bei stardust.ai
Jiaxin YePh.D. Student, Fudan UniversityBestätigte E-Mail-Adresse bei m.fudan.edu.cn

Folgen

Ziyang Ma

Shanghai Jiao Tong University

Bestätigte E-Mail-Adresse bei sjtu.edu.cn - Startseite

Speech and Language Processing Textless NLP Self-supervised Learning Multimedia


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
MT4SSL: Boosting self-supervised speech representation learning by integrating multiple targets Z Ma, Z Zheng, C Tang, Y Wang, X Chen Proc. Interspeech 2023, 2022	18	2022
Lauragpt: Listen, attend, understand, and regenerate audio with gpt Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, W Wang, S Zheng, ... arXiv preprint arXiv:2310.04673, 2023	11	2023
Hierarchical deep residual reasoning for temporal moment localization Z Ma, X Han, X Song, Y Cui, L Nie Proceedings of the 3rd ACM International Conference on Multimedia in Asia, 1-7, 2021	10	2021
Leveraging speech ptm, text llm, and emotional tts for speech emotion recognition Z Ma, W Wu, Z Zheng, Y Guo, Q Chen, S Zhang, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	4	2024
ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering Y Song, Z Chen, X Wang, Z Ma, X Chen arXiv preprint arXiv:2401.07333, 2024	4	2024
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation Z Ma, Z Zheng, G Yang, Y Wang, C Zhang, X Chen Proc. Interspeech 2023, 2023	4	2023
Tessp: text-enhanced self-supervised speech pre-training Z Yao, S Ren, S Chen, Z Ma, P Guo, L Xie arXiv preprint arXiv:2211.13443, 2022	4	2022
VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching Y Guo, C Du, Z Ma, X Chen, K Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	3*	2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ... arXiv preprint arXiv:2402.16153, 2024	3	2024
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer W Chen, Y Liang, Z Ma, Z Zheng, X Chen arXiv preprint arXiv:2401.03497, 2024	3	2024
Fast-Hubert: an Efficient Training Framework for Self-Supervised Speech Representation Learning G Yang, Z Ma, Z Zheng, Y Song, Z Niu, X Chen 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023	3	2023
Front-end adapter: Adapting front-end input of speech based self-supervised learning for speech recognition X Chen, Z Ma, C Tang, Y Wang, Z Zheng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	3	2023
Improving few-shot learning for talking face system with tts data augmentation Q Chen, Z Ma, T Liu, X Tan, Q Lu, K Yu, X Chen ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	3	2023
Towards universal speech discrete tokens: A case study for asr and tts Y Yang, F Shen, C Du, Z Ma, K Yu, D Povey, X Chen ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	2	2024
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang, X Chen arXiv preprint arXiv:2312.15185, 2023	2	2023
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition Z Zheng, Z Ma, Y Wang, X Chen Proc. Interspeech 2023, 2023	2	2023
Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation Z Liang, Z Song, Z Ma, C Du, K Yu, X Chen Proc. Interspeech 2023, 2023	2	2023
Hourglass-AVSR: Down-Up Sampling-Based Computational Efficiency Model for Audio-Visual Speech Recognition F Yu, H Wang, Z Ma, S Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
MuPT: A Generative Symbolic Music Pretrained Transformer X Qu, Y Bai, Y Ma, Z Zhou, KM Lo, J Liu, R Yuan, L Min, X Liu, T Zhang, ... arXiv preprint arXiv:2404.06393, 2024	1	2024
Exploring effective distillation of self-supervised speech models for automatic speech recognition Y Wang, C Tang, Z Ma, Z Zheng, X Chen, WQ Zhang 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-6, 2023	1	2023

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren