Nathan Lambert

Zitiert von

	Alle	Seit 2019
Zitate	1385	1379
h-index	16	16
i10-index	24	24

560

280

140

420

20182019202020212022202320245 16 43 120 201 549 446

Öffentlicher Zugriff

Alle anzeigen

2 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Roberto CalandraProfessor, TU Dresden, Centre for Tactile Internet with Human-in-the-Loop (CeTI)Bestätigte E-Mail-Adresse bei tu-dresden.de
Kristofer PISTERUC BerkeleyBestätigte E-Mail-Adresse bei berkeley.edu
Daniel S. DrewUniversity of UtahBestätigte E-Mail-Adresse bei utah.edu
Tom ZickHarvardBestätigte E-Mail-Adresse bei berkeley.edu
Thomas Krendl GilbertNew York Academy of SciencesBestätigte E-Mail-Adresse bei nyas.org
Brandon AmosMetaBestätigte E-Mail-Adresse bei fb.com
Sarah DeanCornellBestätigte E-Mail-Adresse bei cornell.edu
Luis PinedaResearch Engineer, Facebook AI ResearchBestätigte E-Mail-Adresse bei fb.com
Craig B. SchindlerUniversity of California, BerkeleyBestätigte E-Mail-Adresse bei berkeley.edu
Lydia LeeSandia National LaboratoriesBestätigte E-Mail-Adresse bei sandia.gov

Folgen

Nathan Lambert

Research Scientist, Allen AI

Bestätigte E-Mail-Adresse bei allenai.org - Startseite

Reinforcement Learning Machine Learning Robotics Responsible AI


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
[Github] Diffusers: State-of-the-art diffusion models P von Platen, S Patil, A Lozhkov, P Cuenca, N Lambert, K Rasul, ... https://github.com/huggingface/diffusers, 2022	210*	2022
Low Level Control of a Quadrotor with Deep Model-Based Reinforcement Learning N Lambert, DS Drew, J Yaconelli, R Calandra, S Levine, KSJ Pister IEEE Robotics and Automation Letters 4 (4), 4224-4230, 2019	162	2019
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	132	2023
Open LLM Leaderboard E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ... URL https://huggingface. co/spaces/HuggingFaceH4/open_llm_leaderboard, 2023	107	2023
On the importance of hyperparameter optimization for model-based reinforcement learning B Zhang, R Rajan, L Pineda, N Lambert, A Biedenkapp, K Chua, F Hutter, ... International Conference on Artificial Intelligence and Statistics, 4015-4023, 2021	100	2021
Objective Mismatch in Model-based Reinforcement Learning N Lambert, B Amos, O Yadan, R Calandra Learning for Dynamics and Control (L4DC), 2020	86	2020
Toward controlled flight of the ionocraft: a flying microrobot using electrohydrodynamic thrust with onboard sensing and no moving parts D Drew, N Lambert, C Schindler, K Pister IEEE Robotics and Automation Letters 3 (4), 2807-2813, 2018	74	2018
[Blog] Illustrating reinforcement learning from human feedback (RLHF) N Lambert, L Castricato, L von Werra, A Havrilla https://hf.co/blog/rlhf, 2022	69*	2022
[Github] Trl: Transformer reinforcement learning L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert https://github.com/lvwerra/trl, 2020	55*	2020
Mbrl-lib: A modular library for model-based reinforcement learning L Pineda, B Amos, A Zhang, NO Lambert, R Calandra arXiv preprint arXiv:2104.10159, 2021	47	2021
Learning generalizable locomotion skills with hierarchical reinforcement learning T Li, N Lambert, R Calandra, F Meier, A Rai IEEE International Conference on Robotics and Automation (ICRA), 413-419, 2020	44	2020
Camels in a changing climate: Enhancing lm adaptation with tulu 2 H Ivison, Y Wang, V Pyatkin, N Lambert, M Peters, P Dasigi, J Jang, ... arXiv preprint arXiv:2311.10702, 2023	38	2023
The challenges of exploration for offline reinforcement learning N Lambert, M Wulfmeier, W Whitney, A Byravan, M Bloesch, V Dasagi, ... arXiv preprint arXiv:2201.11861, 2022	37	2022
Reward reports for reinforcement learning TK Gilbert, N Lambert, S Dean, T Zick, A Snoswell, S Mehta Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 84-130, 2023	28	2023
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning N Lambert, A Wilcox, H Zhang, K Pister, R Calandra IEEE Conference on Decision and Control (CDC), 2880-2887, 2021	24	2021
Investigating compounding prediction errors in learned dynamics models N Lambert, K Pister, R Calandra arXiv preprint arXiv:2203.09637, 2022	16	2022
Stackllama: An rl fine-tuned llama model for stack exchange question and answering E Beeching, Y Belkada, K Rasul, L Tunstall, L von Werra, N Rajani, ... URL https://huggingface.co/blog/stackllama, 2023	14	2023
[HuggingFace] H4 Stack Exchange Preference Dataset N Lambert, NR Lewis Tunstall, T Thrush https://huggingface.co/datasets/HuggingFaceH4/stack-exchange-preferences, 2023	13*	2023
Measuring data M Mitchell, AS Luccioni, N Lambert, M Gerchick, A McMillan-Major, ... arXiv preprint arXiv:2212.05129, 2022	13	2022
[Blog] Stable Diffusion with 🧨 Diffusers S Patil, P Cuenca, N Lambert, P von Platen Hugging Face–The AI community building the future. https://huggingface.co …, 2022	13*	2022

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren