Skip to content
View dwzhu-pku's full-sized avatar

Highlights

  • Pro

Organizations

@PKU-TANGENT
Block or Report

Block or report dwzhu-pku

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs

Python 12 1 Updated Jul 2, 2024

awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

87 5 Updated Jun 28, 2024

LOFT: A 1 Million+ Token Long-Context Benchmark

85 2 Updated Jun 21, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

1,193 63 Updated Jul 3, 2024
Python 9 Updated Jun 23, 2024

Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement

8 Updated Jun 17, 2024

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and…

Python 6,249 1,224 Updated Jul 3, 2024

Efficient retrieval head analysis with triton flash attention that supports topK probability

Jupyter Notebook 12 Updated Jun 15, 2024
Jupyter Notebook 108 1 Updated Jul 3, 2024

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

115 4 Updated Jun 12, 2024

The repo for In-context Autoencoder

Jupyter Notebook 55 2 Updated May 11, 2024

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

295 8 Updated Jun 18, 2024

[ACL 2024] A Prospector of Long-Dependency Data for Large Language Models

Python 38 1 Updated Jun 11, 2024
Python 10 1 Updated May 30, 2024

The this is the official implementation of "CAPE: Context-Adaptive Positional Encoding for Length Extrapolation"

Python 6 Updated May 23, 2024

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 327 18 Updated Jul 2, 2024

🐚 OpenDevin: Code Less, Make More

Python 28,282 3,246 Updated Jul 3, 2024

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Jupyter Notebook 109 11 Updated Jun 28, 2024

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Python 205 7 Updated Jun 27, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

2,850 107 Updated Jun 26, 2024

The official repo for "LLoCo: Learning Long Contexts Offline"

Python 101 9 Updated Jun 15, 2024

Go ahead and axolotl questions

Python 6,811 748 Updated Jul 2, 2024

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 120 10 Updated Jun 19, 2024

personal website

Ruby 1 Updated Apr 24, 2024
Python 138 4 Updated May 1, 2024

LongHeads: Multi-Head Attention is Secretly a Long Context Processor

Python 23 1 Updated Apr 8, 2024

The official Meta Llama 3 GitHub site

Python 22,789 2,387 Updated Jul 3, 2024

Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"

Python 100 6 Updated Apr 26, 2024
Next