Nathan Lambert

Cited by

	All	Since 2019
Citations	1483	1477
h-index	16	16
i10-index	27	27

560

280

140

420

20182019202020212022202320245 16 43 120 201 546 545

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Roberto CalandraProfessor, TU Dresden, Centre for Tactile Internet with Human-in-the-Loop (CeTI)Verified email at tu-dresden.de
Kristofer PISTERUC BerkeleyVerified email at berkeley.edu
Daniel S. DrewUniversity of UtahVerified email at utah.edu
Tom ZickHarvardVerified email at berkeley.edu
Thomas Krendl GilbertNew York Academy of SciencesVerified email at nyas.org
Brandon AmosMetaVerified email at fb.com
Sarah DeanCornellVerified email at cornell.edu
Luis PinedaResearch Engineer, Facebook AI ResearchVerified email at fb.com
Craig B. SchindlerUniversity of California, BerkeleyVerified email at berkeley.edu
Lydia LeeSandia National LaboratoriesVerified email at sandia.gov

Nathan Lambert

Research Scientist, Allen AI

Verified email at allenai.org - Homepage

Reinforcement Learning Machine Learning Robotics Responsible AI


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
[Github] Diffusers: State-of-the-art diffusion models P von Platen, S Patil, A Lozhkov, P Cuenca, N Lambert, K Rasul, ... https://github.com/huggingface/diffusers, 2022	219*	2022
Low Level Control of a Quadrotor with Deep Model-Based Reinforcement Learning N Lambert, DS Drew, J Yaconelli, R Calandra, S Levine, KSJ Pister IEEE Robotics and Automation Letters 4 (4), 4224-4230, 2019	163	2019
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	153	2023
Open LLM Leaderboard E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ... URL https://huggingface. co/spaces/HuggingFaceH4/open_llm_leaderboard, 2023	119	2023
On the importance of hyperparameter optimization for model-based reinforcement learning B Zhang, R Rajan, L Pineda, N Lambert, A Biedenkapp, K Chua, F Hutter, ... International Conference on Artificial Intelligence and Statistics, 4015-4023, 2021	101	2021
Objective Mismatch in Model-based Reinforcement Learning N Lambert, B Amos, O Yadan, R Calandra Learning for Dynamics and Control (L4DC), 2020	88	2020
Toward controlled flight of the ionocraft: a flying microrobot using electrohydrodynamic thrust with onboard sensing and no moving parts D Drew, N Lambert, C Schindler, K Pister IEEE Robotics and Automation Letters 3 (4), 2807-2813, 2018	75	2018
[Blog] Illustrating reinforcement learning from human feedback (RLHF) N Lambert, L Castricato, L von Werra, A Havrilla https://hf.co/blog/rlhf, 2022	70*	2022
[Github] Trl: Transformer reinforcement learning L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert https://github.com/lvwerra/trl, 2020	59*	2020
Mbrl-lib: A modular library for model-based reinforcement learning L Pineda, B Amos, A Zhang, NO Lambert, R Calandra arXiv preprint arXiv:2104.10159, 2021	48	2021
Camels in a changing climate: Enhancing lm adaptation with tulu 2 H Ivison, Y Wang, V Pyatkin, N Lambert, M Peters, P Dasigi, J Jang, ... arXiv preprint arXiv:2311.10702, 2023	46	2023
Learning generalizable locomotion skills with hierarchical reinforcement learning T Li, N Lambert, R Calandra, F Meier, A Rai IEEE International Conference on Robotics and Automation (ICRA), 413-419, 2020	45	2020
The challenges of exploration for offline reinforcement learning N Lambert, M Wulfmeier, W Whitney, A Byravan, M Bloesch, V Dasagi, ... arXiv preprint arXiv:2201.11861, 2022	38	2022
Reward reports for reinforcement learning TK Gilbert, N Lambert, S Dean, T Zick, A Snoswell, S Mehta Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 84-130, 2023	28	2023
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning N Lambert, A Wilcox, H Zhang, K Pister, R Calandra IEEE Conference on Decision and Control (CDC), 2880-2887, 2021	25	2021
Olmo: Accelerating the science of language models D Groeneveld, I Beltagy, P Walsh, A Bhagia, R Kinney, O Tafjord, AH Jha, ... arXiv preprint arXiv:2402.00838, 2024	20	2024
Investigating compounding prediction errors in learned dynamics models N Lambert, K Pister, R Calandra arXiv preprint arXiv:2203.09637, 2022	16	2022
[HuggingFace] H4 Stack Exchange Preference Dataset N Lambert, NR Lewis Tunstall, T Thrush https://huggingface.co/datasets/HuggingFaceH4/stack-exchange-preferences, 2023	15*	2023
Stackllama: An rl fine-tuned llama model for stack exchange question and answering E Beeching, Y Belkada, K Rasul, L Tunstall, L von Werra, N Rajani, ... URL https://huggingface.co/blog/stackllama, 2023	14	2023
Measuring data M Mitchell, AS Luccioni, N Lambert, M Gerchick, A McMillan-Major, ... arXiv preprint arXiv:2212.05129, 2022	14	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors