Highlights
- Pro
Block or Report
Block or report jonasgrebe
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
VQA counting task with higher object numbers
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
✨✨Latest Advances on Multimodal Large Language Models
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports
MaggiR / MAFC
Forked from google-deepmind/long-form-factualityMultimodal automated fact-checking [WIP]
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
A holistic way of understanding how LLaMA and its components run in practice, with code and detailed documentation.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Standalone evaluation scripts and starter code for the ICDAR 2023 DUDE competition
Official repository of paper "GraFIQs: Face Image Quality Assessment Using Gradient Magnitudes"
Language Models as Multi-Modal Query Planners
LILO: Library Induction with Language Observations
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
🤯 Mindstorm in Natural Language-based Societies of Mind
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
Code for "Analyzing Modular Approaches for Visual Question Decomposition" (EMNLP 2023)