Highlights
- Pro
Stars
A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.
This project aims to automatically translate and summarize Huggingface's daily papers into Korean using ChatGPT.
Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.
A high-throughput and memory-efficient inference and serving engine for LLMs
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Visualizing the DROID dataset using Rerun
Fast and memory-efficient exact attention
A Benchmark for Evaluating Generalization for Robotic Manipulation
[ECCV 2024 Award Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds
[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
🔨🔨🔨(mmplot)used to draw graphs of multiple index parameters such as algorithm accuracy and speed of multiple deep learning models.
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
Tools for merging pretrained large language models.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dat…
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Reaching LLaMA2 Performance with 0.1M Dollars
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
A framework for few-shot evaluation of language models.