Skip to content
View ZFancy's full-sized avatar
💥
Focusing
💥
Focusing

Block or report ZFancy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
448 results for source starred repositories
Clear filter

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Python 239 28 Updated Feb 23, 2024

[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"

10 Updated Oct 30, 2024

This repo contains papers, books, tutorials and resources on Riemannian optimization.

14 1 Updated Nov 8, 2024

Using Explanations as a Tool for Advanced LLMs

Python 50 1 Updated Sep 11, 2024

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

987 56 Updated Sep 23, 2024

source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"

Python 18 3 Updated Oct 4, 2024
Python 66 10 Updated Nov 13, 2023

[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"

Python 55 5 Updated Aug 3, 2024

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 1,854 161 Updated Nov 7, 2024

Code for the paper 🌳 Tree Search for Language Model Agents

Python 138 17 Updated Jul 25, 2024

agent q - oss advanced reasoning and learning for autonomous ai agents

Python 341 72 Updated Sep 26, 2024

VisualWebArena is a benchmark for multimodal agents.

Python 236 45 Updated Nov 9, 2024

[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Python 88 12 Updated Mar 26, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,035 91 Updated May 8, 2024

Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"

Python 83 11 Updated Mar 22, 2024

A programming framework for agentic AI 🤖

Jupyter Notebook 33,051 4,811 Updated Nov 10, 2024
Python 2 1 Updated Oct 25, 2024
Python 15 Updated Nov 4, 2024

[CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"

Python 61 5 Updated Jul 15, 2024

https://arxiv.org/pdf/2402.18025

22 1 Updated Aug 27, 2024

Repo for the research paper "Aligning LLMs to Be Robust Against Prompt Injection"

Python 18 1 Updated Oct 29, 2024

Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>

60 1 Updated Mar 15, 2024

Holistic evaluation of multimodal foundation models

Python 41 Updated Aug 11, 2024

[NeurIPS 2024] "What If the Input is Expanded in OOD Detection?"

Python 3 1 Updated Oct 25, 2024

PAIR.withgoogle.com and friend's work on interpretability methods

JavaScript 148 30 Updated Oct 29, 2024

The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"

Shell 51 5 Updated Nov 9, 2024

"Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?"

Python 58 4 Updated Oct 11, 2024
Next