Follow
Georg Heigold
Georg Heigold
Research Scientist, Google Inc.
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
An image is worth 16x16 words: Transformers for image recognition at scale
A Dosovitskiy, L Beyer, A Kolesnikov, D Weissenborn, X Zhai, ...
arXiv preprint arXiv:2010.11929, 2020
378722020
Vivit: A video vision transformer
A Arnab, M Dehghani, G Heigold, C Sun, M Lučić, C Schmid
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
19572021
End-to-end text-dependent speaker verification
G Heigold, I Moreno, S Bengio, N Shazeer
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
7662016
Object-centric learning with slot attention
F Locatello, D Weissenborn, T Unterthiner, A Mahendran, G Heigold, ...
Advances in neural information processing systems 33, 11525-11538, 2020
6892020
Small-footprint keyword spotting using deep neural networks
G Chen, C Parada, G Heigold
2014 IEEE international conference on acoustics, speech and signal …, 2014
6642014
Multilingual acoustic models using distributed deep neural networks
G Heigold, V Vanhoucke, A Senior, P Nguyen, MA Ranzato, M Devin, ...
2013 IEEE international conference on acoustics, speech and signal …, 2013
3882013
An empirical study of learning rates in deep neural networks for speech recognition
A Senior, G Heigold, MA Ranzato, K Yang
2013 IEEE international conference on acoustics, speech and signal …, 2013
2182013
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv preprint 2021
A Dosovitskiy, L Beyer, A Kolesnikov, D Weissenborn, X Zhai, ...
arXiv preprint arXiv:2010.11929, 2010
2062010
Word embeddings for speech recognition.
S Bengio, G Heigold
Interspeech, 1053-1057, 2014
1842014
Sequence discriminative distributed training of long short-term memory recurrent neural networks
H Sak, O Vinyals, G Heigold, A Senior, E McDermott, R Monga, M Mao
entropy 15 (16), 17-18, 2014
1682014
Conditional object-centric learning from video
T Kipf, GF Elsayed, A Mahendran, A Stone, S Sabour, G Heigold, ...
arXiv preprint arXiv:2111.12594, 2021
1652021
Asynchronous optimization for sequence training of neural networks
G Heigold, E McDermott, VO Vanhoucke, AW Senior, MAU Bacchiani
US Patent 10,019,985, 2018
1492018
The RWTH Aachen University open source speech recognition system
D Rybach, C Gollan, G Heigold, B Hoffmeister, J Lööf, R Schlüter, H Ney
Tenth Annual Conference of the International Speech Communication Association, 2009
1492009
Speech recognition process
G Heigold, PAP Nguyen, M Weintraub, VO Vanhoucke
US Patent 8,775,177, 2014
1152014
A linguistic evaluation of rule-based, phrase-based, and neural MT engines
A Burchardt, V Macketanz, J Dehdari, G Heigold, P Jan-Thorsten, ...
The Prague bulletin of mathematical linguistics 108 (1), 159, 2017
1092017
GMM-free DNN acoustic model training
A Senior, G Heigold, M Bacchiani, H Liao
2014 IEEE international conference on acoustics, speech and signal …, 2014
872014
Cross-lingual, character-level neural morphological tagging
R Cotterell, G Heigold
arXiv preprint arXiv:1708.09157, 2017
802017
The RWTH 2007 TC-STAR evaluation system for european English and Spanish.
J Lööf, C Gollan, S Hahn, G Heigold, B Hoffmeister, C Plahl, D Rybach, ...
Interspeech, 2145-2148, 2007
782007
A Gaussian mixture model layer jointly optimized with discriminative features within a deep neural network architecture
E Variani, E McDermott, G Heigold
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
772015
Asynchronous stochastic optimization for sequence training of deep neural networks
G Heigold, E McDermott, V Vanhoucke, A Senior, M Bacchiani
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
772014
The system can't perform the operation now. Try again later.
Articles 1–20