Apart > Blog

News

September 27, 2024
 – 
Research

Do models really internalize our preferences?

Apart Research's newest paper (alongside academics from the University of Oxford, Cambridge, and Cynch.ai) looks at whether models actually internalize human preferences or not. But why does this matter? Because if an LLM’s behavior diverges from human feedback, unintended consequences may arise.
September 13, 2024
 – 
Events

Can startups be impactful in AI safety?

This post details the top projects from our technical AI safety startups hackathon where researchers and entrepreneurs joined from across the world.
August 24, 2024
 – 
AI Security

Where we are on for-profit AI safety

Read about how Big Tech's AI race leaves safety in the dust, non-profits struggle to keep up, and the challenges for-profit AI safety ventures must overcome to leverage resources and make a real impact.
July 23, 2024
 – 
Research

Finding Deception in Language Models

This June, Apart Research and Apollo Research joined forces to host the Deception Detection Hackathon, bringing together students, researchers and engineers from around the world to tackle one of the most pressing challenges in AI safety: Preventing AI from deceiving humans.
June 20, 2024
 – 
Events

Code Red LLM Evaluations Hackathon Wrap Up (METR and Apart)

A few months ago, Apart, in collaboration with METR, ran the Code Red Hackathon to engage talent across the world in impactful AI safety research. Our 128 participants submitted more than 200 project ideas, 100 detailed task specifications, and more than 20 complete implementations! In this post, we also get an exclusive interview with one of the winners.
June 13, 2024
 – 
Research

Results from the AI x Democracy Research Sprint

We ran a 3-day research sprint on AI governance, motivated by the need for demonstrations of the risks to democracy by AI, supporting AI governance work. Here we share the 4 winning projects but many of the other 21 entries were also incredibly interesting and we suggest you take a look.
May 17, 2024
 – 
Guides

The ultimate guide to AI safety research hackathons

Research hackathons are an amazing way to dive into a new field, collaborate with passionate people, and create impactful projects in just a short weekend. Having organized and participated in several AI safety hackathons with Apart Research, here are some key tips to help you get the most out of your hackathon experience:
April 19, 2024
 – 
Events

Join us at the AI x Democracy research hackathon

Participate online or in-person on the weekend 3rd to 5th May in an exciting and intense AI safety research hackathon focused on demonstrating and extrapolating risks to democracy from real-life threat models. We invite researchers, cybersecurity professionals, and governance experts to join but it is open for everyone, and we will introduce starter code templates to help you kickstart your team's projects. Join at apartresearch.com/event/ai-democracy.
March 18, 2024
 – 
Events

Join the AI Evaluation Tasks Bounty Hackathon with METR

In this collaboration between METR and Apart, you get the chance to contribute directly to model evaluations research. Take part in the Code Red Hackathon, where you can earn money, connect with experts, and help create tasks to evaluate frontier AI systems.
March 1, 2024
 – 
Guides

How to organize a research hackathon

Organizing a hackathon can bring a unique and exciting energy to people interested in AI safety research! This post summarizes how you can organize a successful hackathon.
February 1, 2024
 – 
AI Security

For-profit AI Safety

AI development attracts more than $67 billion in yearly investments, contrasting sharply with the $250 million allocated to AI safety. This gap suggests there's a large opportunity for AI safety to tap into the commercial market. The big question then is, how do you close that gap?
January 23, 2024
 – 
Guides

Taking your next steps after a research hackathon

With the research hackathon, your journey into the world of AI safety is definitely not over! Besides the chance to join the Apart Lab Fellowship, we have collected a bunch of resources here for you to dive even deeper into the field!
December 12, 2023
 – 
Community

Why organize a research hackathon?

There are many reasons to run a hackathon but some of the main ones are that hackathons are an amazing way to engage the local groups in AI security research and create a sense of community. The participants get an amazing practical research experience and can show the finished projects off to potential employers and colleagues, and it's a really fun way to spend a weekend.
July 13, 2023
 – 
Guides

Updated quickstart guide for mechanistic interpretability

Written by Neel Nanda, who previously worked on mech interp under Chris Olah at Anthropic, who is currently a researcher on the DeepMind mechanistic interpretability team.
February 22, 2023
 – 
Events

Results from the Scale Oversight hackathon

Check out the top projects from the "Scale Oversight" hackathon hosted in February 2023: Playing games with LLMs, scaling of prompt specificity, and more.
January 2, 2023
 – 
Events

Results from the AI testing hackathon

See the winning projects from the AI testing hackathon held in December 2022: Trojan networks, unsupervised latent knowledge representation, and token loss trajectories to target interpretability methods.
November 21, 2022
 – 
Events

Results from the language model hackathon

See winning projects from the language model hackathon hosted November 2022: GPT-3 shows sycophancy, OpenAI's flagging is biased, and truthfulness is sensitive to prompt design.
November 17, 2022
 – 
Events

Results from the interpretability hackathon

Read the winning projects from the interpretability hackathon hosted in November 2022: Automatic interpretability, backup backup name mover heads, and "loud facts" in memory editing.
October 4, 2024
 – 
Community

Apart News: Agents, Submissions & Spain

Apart News is our newsletter to keep you up-to-date.
September 27, 2024
 – 
Community

Apart News: New Research, NeurIPS Papers & Team Offsite

Apart News is our newsletter to keep you up-to-date.
September 20, 2024
 – 
Community

Apart News: o1, Awards & Singapore

Apart News is our newsletter to keep you up-to-date.
September 13, 2024
 – 
Community

Apart News: AI Startups, India & Concordia

Apart News is our newsletter to keep you up-to-date.