Skip to content
View esbenkc's full-sized avatar

Organizations

@apartresearch
Block or Report

Block or report esbenkc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Draw more samples

Python 94 12 Updated Jun 23, 2024
Jupyter Notebook 2 Updated May 27, 2024

This repository contains code for the Democracy x AI Hackathon by Apart Research

Jupyter Notebook 4 2 Updated May 9, 2024
JavaScript 1 Updated May 29, 2024

🀝 Cyberwarfare vulnerabilities for democracy

Python 2 Updated May 5, 2024

How to get started in evaluations and demonstrations research for dangerous capabilities

5 1 Updated May 24, 2024

A GPT-empowered penetration testing tool

Python 6,626 788 Updated Jun 22, 2024

PrimeVul with the assets under version control on github, not on google drive

Python 1 Updated Mar 30, 2024

WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining …

Jupyter Notebook 50 10 Updated Apr 27, 2024

Test your AI model's security through CLI

Python 13 2 Updated Jun 21, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 11,777 1,168 Updated Jun 23, 2024

we got you bro

28 Updated Apr 1, 2024

πŸ” Make sure AI applications are not injecting 1) suspicious API calls, 2) vulnerabilities, and 3) rogue capabilities

JavaScript 2 Updated Apr 3, 2024
HTML 32 3 Updated Apr 6, 2024

Contains open source code for the paper "Perfectly-secure Steganography using Minimum Entropy Coupling"

Python 44 8 Updated Aug 1, 2023

AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.

TypeScript 4,076 320 Updated Jun 10, 2024

🚨 METR Task Standard fork for the Code Red Hackathon

TypeScript 1 Updated Feb 29, 2024

METR Task Standard

TypeScript 76 23 Updated Jun 21, 2024

An easy-to-use Python framework to generate adversarial jailbreak prompts.

Python 302 27 Updated Apr 25, 2024

Python code for "Fishing for the answer: Mapping the flow of information in LLM agent groups using lessons from fish schools" submitted to Apart Research Multi-Agent Security Hackathon 2024.

Python 4 Updated Mar 7, 2024

Dark Patterns in Chatbot Design

HTML 4 3 Updated Jun 15, 2024

Repo for the paper on Escalation Risks of AI systems

Python 33 6 Updated Apr 12, 2024

✱ Interpreting how similar sequence continuation tasks share internal representations ✱

Jupyter Notebook 1 Updated Jun 12, 2024

🎧 Fully automated Wikipedia audiobooks [WIP]

HTML 3 Updated Jan 30, 2024

Examples using the Leap Interpretability Engine!

6 3 Updated Dec 22, 2023
Jupyter Notebook 13 2 Updated Mar 31, 2024

Enable decision-making based on simulations

Python 216 20 Updated May 15, 2024
Python 1 Updated Feb 12, 2024
Python 2,450 296 Updated May 19, 2024
Next