Skip to content
View Fake10086's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report Fake10086

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 18 Updated May 9, 2024

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

HTML 176 77 Updated Feb 7, 2024

ViT Prisma is a mechanistic interpretability library for Vision Transformers (ViTs).

Python 145 12 Updated Jul 18, 2024

This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and continuously update our survey, we maintain this repository of rel…

21 3 Updated Jul 26, 2024

[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

Python 40 5 Updated May 12, 2024

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Python 276 14 Updated Jul 30, 2024

Efficient Multimodal Large Language Models: A Survey

198 7 Updated May 31, 2024

[ECCV 2024] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 177 8 Updated Jul 4, 2024

A comprehensive survey on Internal Consistency and Self-Feedback in Large Language Models, including theoretical frameworks, task classifications, evaluation methods, future research directions and…

TeX 25 1 Updated Jul 23, 2024
Jupyter Notebook 11 2 Updated Jun 19, 2023

Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

257 13 Updated May 16, 2024

Easy multi-task learning with HuggingFace Datasets and Trainer

Python 40 3 Updated Jun 8, 2024

The trainer for HF to record losses of different tasks and objectives.

Python 2 1 Updated Jul 10, 2024

Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Python 137 12 Updated Feb 6, 2024

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Python 57 11 Updated Jul 1, 2024

Sparse autoencoders

Python 217 26 Updated Jul 30, 2024

Training Sparse Autoencoders on Language Models

HTML 260 81 Updated Jul 31, 2024

Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).

HTML 111 27 Updated Jul 27, 2024

[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Python 113 2 Updated Jul 23, 2024

The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.

Python 76 5 Updated Sep 5, 2021

Creative interactive views of any dataset.

Python 819 42 Updated Feb 25, 2024

The attention heads in the Transformer architecture possess a variety of capabilities. This is a carefully compiled list that summarizes the diverse functions of the attention heads.

52 3 Updated Jul 31, 2024

My solutions to DLFC - Deep Learning: Foundations and Concepts

31 7 Updated Jul 18, 2024

This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.

Python 55 2 Updated Jan 1, 2021
Next