Skip to content
View JFChi's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report JFChi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A trivial programmatic Llama 3 jailbreak. Sorry Zuck!

Python 446 50 Updated Apr 28, 2024

Papers and resources related to the security and privacy of LLMs 🤖

Python 306 20 Updated Jun 30, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 7,995 564 Updated Jul 6, 2024

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

Python 95 5 Updated Jun 25, 2024

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Python 201 17 Updated Feb 23, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,511 964 Updated Jun 22, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 45,978 4,422 Updated Jul 7, 2024

LlamaIndex is a data framework for your LLM applications

Python 33,352 4,668 Updated Jul 7, 2024

Multilingual safety benchmark for Large Language Models

16 1 Updated Oct 13, 2023

Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier

Shell 12,197 2,877 Updated Jan 21, 2024

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 3,734 331 Updated Jul 5, 2024

An unofficial implementation of AutoDAN attack on LLMs (arXiv:2310.15140)

Python 23 6 Updated Feb 8, 2024

Set of tools to assess and improve LLM security.

Python 2,142 355 Updated Jul 3, 2024

All the projects related to Llama

Jupyter Notebook 353 69 Updated May 29, 2024

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

1,264 68 Updated Jun 30, 2024

Awesome-LLM: a curated list of Large Language Model

15,991 1,279 Updated Jul 6, 2024

New ways of breaking app-integrated LLMs

Jupyter Notebook 1,739 111 Updated Jun 17, 2023

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 957 77 Updated Aug 16, 2023

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Python 1,376 276 Updated May 4, 2020

FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods.

Python 23 2 Updated May 10, 2024
Python 10 2 Updated May 25, 2023

Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.

Python 1,134 104 Updated Apr 28, 2024

Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.

Jupyter Notebook 1,423 89 Updated Jul 7, 2024

OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.

Python 519 27 Updated Oct 3, 2023

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,119 819 Updated Jul 7, 2024

Robust machine learning for responsible AI

Python 438 52 Updated Mar 19, 2024

Text perturbation methods to evaluate the robustness of NLP models

Python 21 2 Updated Oct 6, 2021
Python 4 2 Updated Feb 7, 2023

[ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models

Python 56 8 Updated Nov 2, 2022
Next