Highlights
- Pro
Stars
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
PyTorch code and models for V-JEPA self-supervised learning from video.
Implementation of Diffusion Transformer (DiT) in JAX
π€ LeRobot: Making AI for Robotics more accessible with end-to-end learning
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
LLM Agora, debating between open-source LLMs to refine the answers
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Solutions provided to Chip Huyen's Machine Learning Interview Book with GPT
LAVIS - A One-stop Library for Language-Vision Intelligence
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (ATP).
γICLR 2024π₯γ Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
QLoRA: Efficient Finetuning of Quantized LLMs
A curated list of awesome vision and language resources (still under construction... stay tuned!)
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
π Jekyll theme for building a personal site, blog, project documentation, or portfolio.
Train transformer language models with reinforcement learning.
A collection of design patterns/idioms in Python
π Create and deploy a dynamic portfolio by just providing your GitHub username.
End-to-End Object Detection with Transformers
Curated list of awesome tools, demos, docs for ChatGPT and GPT-3