Follow
Mitchell Stern
Mitchell Stern
Verified email at berkeley.edu
Title
Cited by
Cited by
Year
The marginal value of adaptive gradient methods in machine learning
AC Wilson, R Roelofs, M Stern, N Srebro, B Recht
Advances in neural information processing systems 30, 2017
9222017
Adafactor: Adaptive learning rates with sublinear memory cost
N Shazeer, M Stern
International Conference on Machine Learning, 4596-4604, 2018
3312018
Abstract syntax networks for code generation and semantic parsing
M Rabinovich, M Stern, D Klein
arXiv preprint arXiv:1704.07535, 2017
3112017
Insertion transformer: Flexible sequence generation via insertion operations
M Stern, W Chan, J Kiros, J Uszkoreit
International Conference on Machine Learning, 5976-5985, 2019
1732019
A minimal span-based neural constituency parser
M Stern, J Andreas, D Klein
arXiv preprint arXiv:1705.03919, 2017
1652017
Stochastic cubic regularization for fast nonconvex optimization
N Tripuraneni, M Stern, C Jin, J Regier, MI Jordan
Advances in neural information processing systems 31, 2018
1402018
Kernel feature selection via conditional covariance minimization
J Chen, M Stern, MJ Wainwright, MI Jordan
Advances in Neural Information Processing Systems 30, 2017
652017
Kermit: Generative insertion-based modeling for sequences
W Chan, N Kitaev, K Guu, M Stern, J Uszkoreit
arXiv preprint arXiv:1906.01604, 2019
572019
What's going on in neural constituency parsers? an analysis
D Gaddy, M Stern, D Klein
arXiv preprint arXiv:1804.07853, 2018
542018
Imitation attacks and defenses for black-box machine translation systems
E Wallace, M Stern, D Song
arXiv preprint arXiv:2004.15015, 2020
472020
Blockwise parallel decoding for deep autoregressive models
M Stern, N Shazeer, J Uszkoreit
Advances in Neural Information Processing Systems 31, 2018
452018
Effective inference for generative neural parsing
M Stern, D Fried, D Klein
arXiv preprint arXiv:1707.08976, 2017
452017
Improving neural parsing by disentangling model combination and reranking effects
D Fried, M Stern, D Klein
arXiv preprint arXiv:1707.03058, 2017
372017
Dynamic posted-price mechanisms for the blockchain transaction-fee market
MVX Ferreira, DJ Moroz, DC Parkes, M Stern
Proceedings of the 3rd ACM conference on Advances in Financial Technologies …, 2021
112021
Semantic scaffolds for pseudocode-to-code generation
R Zhong, M Stern, D Klein
arXiv preprint arXiv:2005.05927, 2020
102020
Insertion-deletion transformer
L Ruis, M Stern, J Proskurnia, W Chan
arXiv preprint arXiv:2001.05540, 2020
102020
An empirical study of generation order for machine translation
W Chan, M Stern, J Kiros, J Uszkoreit
arXiv preprint arXiv:1910.13437, 2019
92019
Towards end-to-end in-image neural machine translation
E Mansimov, M Stern, M Chen, O Firat, J Uszkoreit, P Jain
arXiv preprint arXiv:2010.10648, 2020
32020
Generating neural network outputs using insertion commands
W Chan, MT Stern, N Kitaev, K Gu, JD Uszkoreit
US Patent App. 16/883,772, 2020
12020
Parallel decoding using transformer models
NM Shazeer, JD Uszkoreit, MT Stern
US Patent App. 16/682,611, 2020
12020
The system can't perform the operation now. Try again later.
Articles 1–20