Folgen
Zhengkun Tian
Zhengkun Tian
Meituan Inc.
Bestätigte E-Mail-Adresse bei meituan.com
Titel
Zitiert von
Zitiert von
Jahr
Add 2022: the first audio deep synthesis detection challenge
J Yi, R Fu, J Tao, S Nie, H Ma, C Wang, T Wang, Z Tian, Y Bai, C Fan, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
1222022
Gated recurrent fusion with joint training framework for robust end-to-end speech recognition
C Fan, J Yi, J Tao, Z Tian, B Liu, Z Wen
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 198-209, 2020
782020
Self-attention transducers for end-to-end speech recognition
Z Tian, J Yi, J Tao, Y Bai, Z Wen
Interspeech 2019, 4395--4399, 2019
772019
Synchronous Transformers for End-to-End Speech Recognition
Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen
ICASSP 2020, 2019
712019
Spike-triggered non-autoregressive transformer for end-to-end speech recognition
Z Tian, J Yi, J Tao, Y Bai, S Zhang, Z Wen
arXiv preprint arXiv:2005.07903, 2020
592020
Fast end-to-end speech recognition via non-autoregressive models and cross-modal knowledge transferring from BERT
Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1897-1911, 2021
552021
Half-truth: A partially fake audio detection dataset
J Yi, Y Bai, J Tao, H Ma, Z Tian, C Wang, T Wang, R Fu
arXiv preprint arXiv:2104.03617, 2021
522021
Listen attentively, and spell once: Whole sentence generation via a non-autoregressive architecture for low-latency speech recognition
Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang
arXiv preprint arXiv:2005.04862, 2020
432020
Adversarial transfer learning for punctuation restoration
J Yi, J Tao, Y Bai, Z Tian, C Fan
arXiv preprint arXiv:2004.00248, 2020
422020
Learn spelling from teachers: Transferring knowledge from language models to sequence-to-sequence speech recognition
Y Bai, J Yi, J Tao, Z Tian, Z Wen
arXiv preprint arXiv:1907.06017, 2019
372019
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting.
Y Bai, J Yi, J Tao, Z Wen, Z Tian, C Zhao, C Fan
INTERSPEECH, 2190-2194, 2019
342019
Rnn-transducer with language bias for end-to-end mandarin-english code-switching speech recognition
S Zhang, J Yi, Z Tian, J Tao, Y Bai
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
262021
A large-scale Chinese multimodal NER dataset with speech clues
D Sui, Z Tian, Y Chen, K Liu, J Zhao
Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021
252021
Continual learning for fake audio detection
H Ma, J Yi, J Tao, Y Bai, Z Tian, C Wang
arXiv preprint arXiv:2104.07286, 2021
252021
Focal Loss for Punctuation Prediction.
J Yi, J Tao, Z Tian, Y Bai, C Fan
Interspeech, 721-725, 2020
212020
Deep imitator: Handwriting calligraphy imitation via deep attention networks
B Zhao, J Tao, M Yang, Z Tian, C Fan, Y Bai
Pattern Recognition 104, 107080, 2020
192020
Fully automated end-to-end fake audio detection
C Wang, J Yi, J Tao, H Sun, X Chen, Z Tian, H Ma, C Fan, R Fu
Proceedings of the 1st International Workshop on Deepfake Detection for …, 2022
162022
Decoupling pronunciation and language for end-to-end code-switching automatic speech recognition
S Zhang, J Yi, Z Tian, Y Bai, J Tao
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
142021
Fsr: Accelerating the inference process of transducer-based models by applying fast-skip regularization
Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen
arXiv preprint arXiv:2104.02882, 2021
142021
Hybrid autoregressive and non-autoregressive transformer models for speech recognition
Z Tian, J Yi, J Tao, S Zhang, Z Wen
IEEE Signal Processing Letters 29, 762-766, 2022
132022
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20