The hackathon is happening right now! Join by signing up below and be a part of our community server.
Apart > Sprints

Deception Detection Hackathon: Can we prevent AI from deceiving humans?

--
No items found.
Signups
--
Entries
June 28, 2024 7:00 PM
 to
July 1, 2024 3:00 AM
 (UTC)
Hackathon starts in
--
Days
--
Hours
--
Minutes
--
Seconds
Sign upSign up
This event is finished. It occurred between 
June 28, 2024
 and 
July 1, 2024

Join us for the Deception Detection Hackathon: Ensuring Trustworthy AI

As artificial intelligence systems become increasingly advanced and capable, it is crucial that we develop robust methods to detect and prevent deceptive behavior. The potential for AI to mislead and manipulate humans poses significant risks to society, and we must proactively address these challenges to ensure a future where AI remains safe, transparent, and trustworthy.

We invite you to participate in the Deception Detection Hackathon, a collaborative event aimed at developing innovative techniques and benchmarks to identify and mitigate deceptive behavior in AI systems. Over the course of a weekend, you'll work alongside researchers, developers, and experts in AI safety to create solutions that can help prevent AI from deceiving humans.

Why Deception Detection Matters

Deception in AI, a concept severely under-explored, occurs when an AI system is capable of deceiving a user, either designed for it by a malicious actor or due to misaligned goals. Such systems may appear to be aligned with users and humans values during training and evaluation but pursue malign objectives when deployed, potentially causing harm or undermining trust in AI.

Examples of such work can be found in:

To mitigate these risks, we must develop robust deception detection methods that can identify instances of strategic deception, make headway on understanding AI capabilities for deception, and prevent AI systems from misleading humans. By participating in this hackathon, you'll contribute to the critical task of ensuring that AI remains transparent, accountable, and aligned with human values.

What to Expect at the Deception Detection Hackathon

During the hackathon, you'll have the opportunity to:

  • Learn from experts in AI safety, deceptive alignment, and strategic deception
  • Collaborate with a diverse group of participants to ideate and develop deception detection techniques
  • Create benchmarks and evaluation methods to assess the effectiveness of deception detection approaches
  • Compete for prizes and recognition for the most innovative and impactful solutions
  • Network with like-minded individuals passionate about ensuring the safety and trustworthiness of AI

Whether you're an AI researcher, developer, or enthusiast, this hackathon provides a unique platform to apply your skills and knowledge to address one of the most pressing challenges in AI safety.

Join us in late June for a weekend of collaboration, innovation, and problem-solving as we work together to prevent AI from deceiving humans. Stay tuned for more details on the exact dates, format, and registration process.

Don't miss this opportunity to contribute to the development of trustworthy AI systems and help shape a future where AI and humans can work together safely and transparently. Let's hack for a deception-free AI future!

Speakers & Collaborators

No items found.

📍 Registered jam sites

Beside the remote and virtual participation, our amazing organizers also host local hackathon locations where you can meet up in-person and connect with others in your area.
Register the first event below!

🏠 Register a location

The in-person events for the Apart Sprints are run by passionate individuals just like you! We organize the schedule, speakers, and starter templates, and you can focus on engaging your local research, student, and engineering community. Read more about organizing.
Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
Thank you! Your submission has been received! Your event will show up on this page.
Oops! Something went wrong while submitting the form.

📣 Social media images and text snippets

No media added yet
No text snippets added yet
Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
Uploading...
fileuploaded.jpg
Upload failed. Max size for files is 10 MB.
You have successfully submitted! You should receive an email and your project should appear here. If not, contact operations@apartresearch.com.
Oops! Something went wrong while submitting the form.
No projects submitted yet! Add your project information in the form. We usually see projects submitted quite close to the deadline.