Skip to content
View esbenkc's full-sized avatar

Organizations

@apartresearch
Block or Report

Block or report esbenkc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Which objects are visible through the holes in a picture book? This visual task is easy for adults, doable for primary schoolers, but hard for vision transformers.

Jupyter Notebook 1 Updated Jul 26, 2024

Draw more samples

Python 129 16 Updated Jun 23, 2024
Jupyter Notebook 2 Updated Jul 24, 2024

This repository contains code for the Democracy x AI Hackathon by Apart Research

Jupyter Notebook 6 2 Updated May 9, 2024
JavaScript 1 Updated May 29, 2024

🤝 Cyberwarfare vulnerabilities for democracy

Python 2 Updated May 5, 2024

How to get started in evaluations and demonstrations research for dangerous capabilities

5 1 Updated May 24, 2024

A GPT-empowered penetration testing tool

Python 6,768 802 Updated Jun 22, 2024

PrimeVul with the assets under version control on github, not on google drive

Python 1 Updated Mar 30, 2024

WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining …

Jupyter Notebook 55 13 Updated Apr 27, 2024

Test your AI model's security through CLI

Python 14 2 Updated Aug 1, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 12,165 1,222 Updated Jul 29, 2024

we got you bro

30 Updated Jul 29, 2024

🔐 Make sure AI applications are not injecting 1) suspicious API calls, 2) vulnerabilities, and 3) rogue capabilities

JavaScript 2 Updated Apr 3, 2024
HTML 41 3 Updated Jun 26, 2024

Contains open source code for the paper "Perfectly-secure Steganography using Minimum Entropy Coupling"

Python 46 8 Updated Aug 1, 2023

AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.

TypeScript 4,114 319 Updated Jul 29, 2024

🚨 METR Task Standard fork for the Code Red Hackathon

TypeScript 1 Updated Feb 29, 2024

METR Task Standard

TypeScript 91 24 Updated Aug 1, 2024

An easy-to-use Python framework to generate adversarial jailbreak prompts.

Python 349 31 Updated Jul 12, 2024

Python code for "Fishing for the answer: Mapping the flow of information in LLM agent groups using lessons from fish schools" submitted to Apart Research Multi-Agent Security Hackathon 2024.

Python 4 Updated Mar 7, 2024

Dark Patterns in Chatbot Design

HTML 5 3 Updated Jun 15, 2024

Repo for the paper on Escalation Risks of AI systems

Python 34 6 Updated Apr 12, 2024

✱ Interpreting how similar sequence continuation tasks share internal representations ✱

Jupyter Notebook 1 Updated Jul 1, 2024

🎧 Fully automated Wikipedia audiobooks [WIP]

HTML 3 Updated Jan 30, 2024

Examples using the Leap Interpretability Engine!

6 3 Updated Dec 22, 2023
Jupyter Notebook 14 2 Updated Mar 31, 2024

Enable decision-making based on simulations

Python 217 21 Updated May 15, 2024
Next