What Everyone in Technical Alignment is Doing and Why: Anthropic, OpenAI, DeepMind Safety, Conjecture

Wednesday 25 January 2023
19:00 21:00

Google Calendar ICS

We’ll be discussing what everyone in practical safety is doing and why, using Thomas Larsen’s LessWrong article as a jumping off point. There’s a lot to get through, so we’ll be focusing just on the four biggest players: Anthropic, OpenAI, DeepMind Safety, and Conjecture. We’ll try to identify common themes (like LLM alignment) to see where the community is allocating its resources.

What Everyone in Technical Alignment is Doing and Why: Anthropic, OpenAI, DeepMind Safety, Conjecture

Agent Incentives: A Causal Perspective

What Everyone in Technical Alignment is Doing and Why: CHAI, CAIS, Sam Bowman and MIRI

AI Safety 東京