Stars
Distributed platform for building autonomic network functions.
Official implementation for DiffusionVG: Exploring Iterative Refinement with Diffusion Models for Video Grounding
Everything for the Paper: 'Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing'
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset
Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, …
Learning to Tokenize for Generative Retrieval (NeurIPS 2023)
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
16-fold memory access reduction with nearly no loss
A library for advanced large language model reasoning
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
TruthfulQA: Measuring How Models Imitate Human Falsehoods
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks