AI Safety Camp
11th edition

Team member applications for AISC11 are closed

AI Safety Camp (AISC) is an online part time AI safety research program. You join AISC by joining one of the projects, and you join a project by applying here.

This camp, we have 27 public projects. Scroll down to see all of them. We recommend having a look at the projects to see which ones interest you. But you also have the option of filling out a generic application for all the projects at once.

When you apply for a project, keep in mind that all collaborators are expected to work 10 hours/week and join weekly meetings.

What is AI Safety?

There are many perspectives on what is good AI safety research, stemming from different assumptions about how hard various parts of the problem is, ranging from "Aligning an AI with any human seems not too hard, so we should focus on aligning it focus on aligning it with all humans, and/or preventing misuse", to "Aligning fully autonomous AI to stay safe is literally impossible, so we should make sure that such AI never get built", and everything in between, plus the perspective that "We don't know WTF we're doing, so we should do some basic research".

Our range of projects for this AISC reflects this diversity.

All AISC projects have a plausible theory of change, under some reasonable assumptions. But different projects have different theories of change and assumptions.

We encourage you, dear reader, to think for yourself. What do you think is good AI safety research? Which projects listed below do you believe in?

Do you still have questions?

See our About & FAQ page for more info, or contact one of the organisers.

𝗔𝗽𝗽𝗹𝘆

Timeline

Team member applications:

November 1 (Saturday): Accepted proposals are posted on the AISC website. Application to join teams open.
November 23 (Sunday): Application to join teams closes.
December 21 (Sunday): Deadline for Project Leads to choose their team.

Program

Jan 10 - 11: Opening weekend.
Jan 12 - Apr 19: Projects are happening.
Teams meet weekly, and plan in their own work hours.
April 24 - 27 (preliminary dates): Final presentations, we'll likely host an online conference for this again.

Afterwards

For as long as you want: Some teams keep working together after the official end of AISC.
When starting out we recommend that you don’t make any commitment beyond the official length of the program. However if you find that you work well together as a team, we encourage you to keep going even after AISC is officially over.

List of projects

Stop/Pause AI

(1) Creating YouTube videos explaining loss-of-control risk to a popular audience

(2) Write about ongoing safety failures

(3) Bring the anti-AI side together

(4) Create Systems Dynamics Model for Pausing AI

(5) Start a Stop AI Chapter in Your Local Area

Policy/Governance

(6) Psychological Risk Pathways in AI Use: An Exploratory Contribution to User Wellbeing and Safety

(7) Bootstrapping Consumer Empowerment to Align AI Companies

(8) Designing Public Movements for Responsible AI

(9) Compute Permit Markets under Imperfect Monitoring

(10) Building a Real-Time, Multi-Agent Governance Protocol for AI Safety

Evaluate Risks from AI

(11) Democratising Red Teaming & Evals

(12) Collusion Stress Tests

(13) CoMPASS: Computational Modeling of Parasocial Attachments & Social Simulation

(14) Benchmark for Ranking LLM Preferences Relevant for Existential Risk

(15) Catastrophe Unveiled: Rare AI Agent Behaviors Elicitation

Mech-Interp

(16) Temporal Horizon Detection in LLMs: Understanding Time Scope Awareness

(17) Testing and Improving the Generalisation of Probe-Based Monitors

Agent Foundations

(18) [Groundless] AI 2037: Predicting AI disorders

(19) Investigating the assumptions of the Doom Debate

(20) [Groundless] MoSSAIC: Scoping out substrate-flexible risk

(21) Agentic AI Risks Induced by System-Level Misalignment

Alternative LLM safety

(22) Novel AI Control Protocol Classes Evaluation and Scalability

(23) [Groundless] Autostructures: Craftsmanship in the age of vibes

(24) AutoCircuit: Automated Discovery of Interpretable Reasoning Patterns in LLMs

(25) Recruitment-Based Collusion in Multi-Agent Oversight Systems

(26) Value Communication Protocols

Safe by Design AIs

(27) EMPO: AI Safety via Soft-Maximizing Total Long-Term Human Power

Stop/Pause AI

Let's not build what we can't control.

(1) Creating YouTube videos explaining loss-of-control risk to a popular audience

Dr Waku

Summary

This project aims to create a new YouTube channel for short-form videos addressing the urgency of AI loss-of-control risk. We will be leveraging my experience with creating AI safety long form content to make a collaborative new channel. Each team member will contribute one or more video scripts, and will likely specialize in an aspect of video production (editing, filming, thumbnails, etc). The goals are to 1) reach 1000 subscribers and get monetized, and 2) figure out the processes to create a self-sustaining channel, though participants are not committing to time beyond the program up front.

Skill requirements

I prefer people that have already learned a bit about AI safety. Maybe you’ve taken a class from Bluedot, or read a bit on lesswrong, or even watched some videos about AI 2027. If you’re newer, you are still welcome to apply (especially if you have some video/media experience).

Generally speaking, creating a video requires the following:

Someone to choose topics and titles (helps to be familiar with social media).
Someone to research and write scripts (or outlines).
Someone to make thumbnails – not applicable for short content.
Someone to actually record the script on camera.
Someone to do video editing – but I expect to use an external paid video editor in this project.
Someone to post the content, reply to comments, maybe repost on other social media.

The first two are the hard part, and I would like everyone to at least try their hands at this.

AI Safety Camp 11th edition

Team member applications for AISC11 are closed

What is AI Safety?

Do you still have questions?

Timeline

List of projects

Stop/Pause AI

(1) Creating YouTube videos explaining loss-of-control risk to a popular audience

Dr Waku

Summary

Skill requirements

(2) Write about ongoing safety failures

Remmelt Ellen

Summary

Skill requirements

(3) Bring the anti-AI side together

Finn

Summary

Skill requirements

(4) Create Systems Dynamics Model for Pausing AI

Will Petillo

Summary

Skill requirements

(5) Start a Stop AI Chapter in Your Local Area

Yakko

Summary

Skill requirements

We're also hosting five private projects by data workers, data center activists, and local educators.

Policy/Governance

(6) Psychological Risk Pathways in AI Use: An Exploratory Contribution to User Wellbeing and Safety

Manuela García Toro

Summary

Skill requirements

(7) Bootstrapping Consumer Empowerment to Align AI Companies

Jonathan Kallay

Summary

Skill requirements

(8) Designing Public Movements for Responsible AI

Ananthi Al Ramiah

Summary

Skill requirements

(9) Compute Permit Markets under Imperfect Monitoring

Joel Christoph

Summary

Skill requirements

(10) Building a Real-Time, Multi-Agent Governance Protocol for AI Safety

Luciana Ledesma

Summary

Skill requirements

Evaluate Risks from AI

(11) Democratising Red Teaming & Evals

Jeanice Koorndijk

Summary

Skill requirements

(12) Collusion Stress Tests

Helena Tran

Summary

Skill requirements

(13) CoMPASS: Computational Modeling of Parasocial Attachments & Social Simulation

Shwetanshu (Luca) Singh

Summary

Skill requirements

(14) Benchmark for Ranking LLM Preferences Relevant for Existential Risk

Martin Leitgab

Summary

Skill requirements

(15) Catastrophe Unveiled: Rare AI Agent Behaviors Elicitation

Yuqi Sun

Summary

Skill requirements

Mech-Interp

(16) Temporal Horizon Detection in LLMs: Understanding Time Scope Awareness

Justin Shenk

Summary

Skill requirements

(17) Testing and Improving the Generalisation of Probe-Based Monitors

Adrians Skapars

Summary

Skill requirements

Agent Foundations

AI Safety Camp
11th edition