Skip to content
View khipp's full-sized avatar
  • Germany
  • 07:39 (UTC +02:00)

Highlights

  • Pro
Block or Report

Block or report khipp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM101n: Let's build a Storyteller

12,800 557 Updated Jun 28, 2024

The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models"

Python 29 1 Updated Jun 22, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 128,356 25,466 Updated Jul 1, 2024

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 39,184 2,028 Updated Jun 30, 2024

Official implementation of Goldfish Loss: Mitigating Memorization in Generative LLMs

Python 53 4 Updated Jun 24, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 20,948 2,105 Updated Jun 29, 2024

Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 897 65 Updated Jun 27, 2024

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Python 123 10 Updated Jun 28, 2024

"On the Privacy Risks of Algorithmic Recourse". Martin Pawelczyk, Himabindu Lakkaraju* and Seth Neel*. In International Conference on Artificial Intelligence and Statistics (AISTATS), PMLR, 2023.

Jupyter Notebook 5 Updated Mar 26, 2023

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 108 9 Updated Jun 20, 2024

A framework for few-shot evaluation of language models.

Python 5,706 1,521 Updated Jul 1, 2024

An Open Robustness Benchmark for Jailbreaking Language Models [arXiv 2024]

Python 109 12 Updated Jun 13, 2024

[ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning

Python 71 4 Updated May 23, 2024

Source code for paper "Sampling-based Pseudo-Likelihood for Membership Inference Attacks".

Python 4 1 Updated Apr 18, 2024

Enhancing small language models with LLM generated counterfactuals.

Python 5 Updated Sep 15, 2023
Python 10 3 Updated Mar 13, 2024
Jupyter Notebook 12 Updated Feb 21, 2024

Min-K%++: Improved baseline for detecting pre-training data of LLMs https://arxiv.org/abs/2404.02936

Python 21 2 Updated Jun 10, 2024

PAL: Proxy-Guided Black-Box Attack on Large Language Models

Python 37 3 Updated Jun 2, 2024
Python 31 5 Updated Jun 13, 2024

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 225 50 Updated Jun 19, 2024

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Python 195 17 Updated Feb 23, 2024

Weak-to-Strong Jailbreaking on Large Language Models

Python 55 7 Updated Feb 21, 2024

Python package for measuring memorization in LLMs.

Jupyter Notebook 79 11 Updated May 10, 2024

A comprehensive toolbox for model inversion attacks and defenses, which is easy to get started.

Python 97 3 Updated Jun 22, 2024
Python 2 Updated Feb 21, 2024

Few-Shot Membership Inference Attacks

Python 1 Updated Apr 24, 2023

Implementation of "Membership Inference Attacks against Language Models via Neighbourhood Comparison" by Justus Mattern, Fatemehsadat Mireshghallah, Zhijing Jin, Bernhard Schölkopf, Mrinmaya Sachan…

Python 1 Updated Nov 13, 2023

Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation"

Python 20 Updated May 8, 2023
Next