ZFancy

Follow

💥

Focusing

Jianing Zhu ZFancy

💥

Focusing

Follow

CS Ph.D. Student @ HKBU & TMLR Group

46 followers · 101 following

Department of Computer Science, HKBU
Hong Kong
15:20 (UTC +08:00)
https://zfancy.github.io/

Achievements

Achievements

Stars

448 results for source starred repositories

LLM-Tuning-Safety / LLMs-Finetuning-Safety

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Python 239 28 Updated Feb 23, 2024

tmlr-group / NoisyRationales

[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"

10 Updated Oct 30, 2024

andyjm3 / Awesome-Riemannian-Optimization

This repo contains papers, books, tutorials and resources on Riemannian optimization.

14 1 Updated Nov 8, 2024

JacksonWuxs / UsableXAI_LLM

Using Explanations as a Tool for Advanced LLMs

Python 50 1 Updated Sep 11, 2024

MetaGLM / zhipuai-sdk-python-v4

Python 165 20 Updated Sep 4, 2024

zchoi / Awesome-Embodied-Agent-with-LLMs

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

987 56 Updated Sep 23, 2024

deeplearning-wisc / haloscope

source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"

Python 18 3 Updated Oct 4, 2024

arobey1 / smooth-llm

Python 66 10 Updated Nov 13, 2023

BillChan226 / AgentPoison

[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"

Python 55 5 Updated Aug 3, 2024

BAAI-Agents / Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 1,854 161 Updated Nov 7, 2024

kohjingyu / search-agents

Code for the paper 🌳 Tree Search for Language Model Agents

Python 138 17 Updated Jul 25, 2024

sentient-engineering / agent-q

agent q - oss advanced reasoning and learning for autonomous ai agents

Python 341 72 Updated Sep 26, 2024

web-arena-x / visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Python 236 45 Updated Nov 9, 2024

sail-sg / Agent-Smith

[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Python 88 12 Updated Mar 26, 2024

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,035 91 Updated May 8, 2024

ucl-dark / llm_debate

Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"

Python 83 11 Updated Mar 22, 2024

microsoft / autogen

A programming framework for agentic AI 🤖

Jupyter Notebook 33,051 4,811 Updated Nov 10, 2024

warriors-30 / SCT

Python 2 1 Updated Oct 25, 2024

sail-sg / Meta-Unlearning

Python 15 Updated Nov 4, 2024

kdiAAA / TDA

[CVPR 2024] Official Repository for "Efficient Test-Time Adaptation of Vision-Language Models"

Python 61 5 Updated Jul 15, 2024

collin-burns / discovering_latent_knowledge

Python 252 36 Updated Mar 2, 2024

LeiLiLab / LingoLLM

https://arxiv.org/pdf/2402.18025

22 1 Updated Aug 27, 2024

facebookresearch / SecAlign

Repo for the research paper "Aligning LLMs to Be Robust Against Prompt Injection"

Python 18 1 Updated Oct 29, 2024

IBM / model-reprogramming

Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>

60 1 Updated Mar 15, 2024

pliang279 / HEMM

Holistic evaluation of multimodal foundation models

Python 41 Updated Aug 11, 2024

ZBox1005 / CoVer

[NeurIPS 2024] "What If the Input is Expanded in OOD Detection?"

Python 3 1 Updated Oct 25, 2024

PAIR-code / interpretability

PAIR.withgoogle.com and friend's work on interpretability methods

JavaScript 148 30 Updated Oct 29, 2024

llm-misinformation / llm-misinformation

The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"

Shell 51 5 Updated Nov 9, 2024

hy-zhao23 / Explainability-for-Large-Language-Models

106 12 Updated Jan 15, 2024

Luckfort / CD

"Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?"

Python 58 4 Updated Oct 11, 2024