Temporal action detection using a statistical language model A Richard, J Gall Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 232 | 2016 |
Weakly supervised action learning with rnn based fine-to-coarse modeling A Richard, H Kuehne, J Gall Proceedings of the IEEE conference on Computer Vision and Pattern …, 2017 | 196 | 2017 |
When will you do what?-anticipating temporal occurrences of activities Y Abu Farha, A Richard, J Gall Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 139 | 2018 |
Neuralnetwork-viterbi: A framework for weakly supervised video learning A Richard, H Kuehne, A Iqbal, J Gall Proceedings of the IEEE conference on Computer Vision and Pattern …, 2018 | 112 | 2018 |
Weakly supervised learning of actions from transcripts H Kuehne, A Richard, J Gall Computer Vision and Image Understanding 163, 78-89, 2017 | 108 | 2017 |
Mean-normalized stochastic gradient for large-scale deep learning S Wiesler, A Richard, R Schlüter, H Ney 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 80 | 2014 |
Action sets: Weakly supervised action segmentation without ordering constraints A Richard, H Kuehne, J Gall Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 78 | 2018 |
A hybrid rnn-hmm approach for weakly supervised temporal action segmentation H Kuehne, A Richard, J Gall IEEE transactions on pattern analysis and machine intelligence 42 (4), 765-779, 2018 | 59 | 2018 |
A bag-of-words equivalent recurrent neural network for action recognition A Richard, J Gall Computer Vision and Image Understanding 156, 79-91, 2017 | 56 | 2017 |
RASR/NN: The RWTH neural network toolkit for speech recognition S Wiesler, A Richard, P Golik, R Schlüter, H Ney 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 54 | 2014 |
Meshtalk: 3d face animation from speech using cross-modality disentanglement A Richard, M Zollhöfer, Y Wen, F De la Torre, Y Sheikh Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 42 | 2021 |
Audio-and gaze-driven facial animation of codec avatars A Richard, C Lea, S Ma, J Gall, F De la Torre, Y Sheikh Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021 | 37 | 2021 |
Neural Synthesis of Binaural Speech From Mono Audio A Richard, D Markovic, ID Gebru, S Krenn, GA Butler, F Torre, Y Sheikh International Conference on Learning Representations, 2021 | 26 | 2021 |
Conditional diffusion probabilistic model for speech enhancement YJ Lu, ZQ Wang, S Watanabe, A Richard, C Yu, Y Tsao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 23 | 2022 |
Mining youtube-a dataset for learning fine-grained action concepts from webly supervised video data H Kuehne, A Iqbal, A Richard, J Gall arXiv preprint arXiv:1906.01012, 2019 | 16 | 2019 |
Implicit hrtf modeling using temporal convolutional networks ID Gebru, D Marković, A Richard, S Krenn, GA Butler, F De la Torre, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 15 | 2021 |
A critical evaluation of stochastic algorithms for convex optimization S Wiesler, A Richard, R Schlüter, H Ney 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 10 | 2013 |
Deep impulse responses: Estimating and parameterizing filters with deep networks A Richard, P Dodds, VK Ithapu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
Audio-visual speech codecs: Rethinking audio-visual speech enhancement by re-synthesis K Yang, D Marković, S Krenn, V Agrawal, A Richard Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 6 | 2022 |
Enhancing temporal action localization with transfer learning from action recognition A Iqbal, A Richard, J Gall Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 6 | 2019 |