diff --git a/index.md b/index.md index 9944c81..7ade052 100644 --- a/index.md +++ b/index.md @@ -72,7 +72,7 @@ Large language models (LLMs) have revolutionized a wide range of domains. In par | Oct 28 | **Towards a unified framework of Neural and Symbolic Decision Making**
Yuandong Tian, Meta AI (FAIR)
Slides [Original Recording](https://bcourses.berkeley.edu/courses/1535641/external_tools/90481) [Edited Video](https://www.youtube.com/live/wm9-7VBpdEo) | - [Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping](https://arxiv.org/abs/2402.14083)
- [Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces](https://arxiv.org/abs/2410.09918v1)
- [Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets](https://arxiv.org/abs/2410.01779)
- [SurCo: Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems](https://arxiv.org/abs/2210.12547) | | Nov 4 | **Project GR00T: A Blueprint for Generalist Robotics**
Jim Fan, NVIDIA
Slides [Original Recording](https://bcourses.berkeley.edu/courses/1535641/external_tools/90481) [Edited Video](https://www.youtube.com/live/Qhxr0uVT2zs) | - [Voyager: An Open-Ended Embodied Agent with Large Language Models](https://voyager.minedojo.org/)
- [Eureka: Human-Level Reward Design via Coding Large Language Models](https://eureka-research.github.io/)
- [DrEureka: Language Model Guided Sim-To-Real Transfer](https://eureka-research.github.io/dr-eureka/) | | Nov 11 | **No Class - Veterans Day** | | -| Nov 18 | **Open-Source and Science in the Era of Foundation Models**
Percy Liang, Stanford University
Slides | - [Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models](https://arxiv.org/abs/2408.08926) | +| Nov 18 | **Open-Source and Science in the Era of Foundation Models**
Percy Liang, Stanford University
Slides [Original Recording](https://bcourses.berkeley.edu/courses/1535641/external_tools/90481) [Edited Video](https://www.youtube.com/live/f3KKx9LWntQ) | - [Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models](https://arxiv.org/abs/2408.08926) | | Nov 25 | **Measuring Agent capabilities and Anthropic's RSP**
Ben Mann, Anthropic | - [Announcing our updated Responsible Scaling Policy](https://www.anthropic.com/news/announcing-our-updated-responsible-scaling-policy)
- [Developing a computer use model](https://www.anthropic.com/news/developing-computer-use) | | Dec 2 | **LLM Agent Safety**
Dawn Song, UC Berkeley | |