Folgen
Jiangyan Yi
Titel
Zitiert von
Zitiert von
Jahr
Add 2022: the first audio deep synthesis detection challenge
J Yi, R Fu, J Tao, S Nie, H Ma, C Wang, T Wang, Z Tian, Y Bai, C Fan, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
1212022
Gated recurrent fusion with joint training framework for robust end-to-end speech recognition
C Fan, J Yi, J Tao, Z Tian, B Liu, Z Wen
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 198-209, 2020
782020
Self-attention transducers for end-to-end speech recognition
Z Tian, J Yi, J Tao, Y Bai, Z Wen
INTERSPEECH, 2019
772019
Synchronous transformers for end-to-end speech recognition
Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
712020
Continuous multimodal emotion prediction based on long short term memory recurrent neural network
J Huang, Y Li, J Tao, Z Lian, Z Wen, M Yang, J Yi
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, 11-18, 2017
702017
Self-attention based model for punctuation prediction using word and speech embeddings
J Yi, J Tao
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
682019
Language-adversarial transfer learning for low-resource speech recognition
J Yi, J Tao, Z Wen, Y Bai
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (3), 621-630, 2018
602018
Spike-triggered non-autoregressive transformer for end-to-end speech recognition
Z Tian, J Yi, J Tao, Y Bai, S Zhang, Z Wen
INTERSPEECH, 2020
592020
Fast end-to-end speech recognition via non-autoregressive models and cross-modal knowledge transferring from BERT
Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1897-1911, 2021
552021
Half-truth: A partially fake audio detection dataset
J Yi, Y Bai, J Tao, Z Tian, C Wang, T Wang, R Fu
INTERSPEECH, 2021
512021
CTC regularized model adaptation for improving LSTM RNN based multi-accent mandarin speech recognition
J Yi, Z Wen, J Tao, H Ni, B Liu
Journal of Signal Processing Systems 90, 985-997, 2018
482018
Listen attentively, and spell once: Whole sentence generation via a non-autoregressive architecture for low-latency speech recognition
Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang
INTERSPEECH, 2020
432020
Adversarial transfer learning for punctuation restoration
J Yi, J Tao, Y Bai, Z Tian, C Fan
arXiv preprint arXiv:2004.00248, 2020
422020
ADD 2023: the Second Audio Deepfake Detection Challenge
J Yi, J Tao, R Fu, X Yan, C Wang, T Wang, CY Zhang, X Zhang, Y Zhao, ...
arXiv preprint arXiv:2305.13774, 2023
382023
End-to-end post-filter for speech separation with deep attention fusion features
C Fan, J Tao, B Liu, J Yi, Z Wen, X Liu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1303-1314, 2020
382020
Learn spelling from teachers: Transferring knowledge from language models to sequence-to-sequence speech recognition
Y Bai, J Yi, J Tao, Z Tian, Z Wen
INTERSPEECH, 2019
372019
Speech emotion recognition using semi-supervised learning with ladder networks
J Huang, Y Li, J Tao, Z Lian, M Niu, J Yi
2018 First Asian conference on affective computing and intelligent …, 2018
362018
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting.
Y Bai, J Yi, J Tao, Z Wen, Z Tian, C Zhao, C Fan
INTERSPEECH, 2190-2194, 2019
342019
Distilling Knowledge from an Ensemble of Models for Punctuation Prediction.
J Yi, J Tao, Z Wen, Y Li
Interspeech, 2779-2783, 2017
342017
End-to-end continuous emotion recognition from video using 3D ConvLSTM networks
J Huang, Y Li, J Tao, Z Lian, J Yi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
322018
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20