Lists (3)
Sort Last updated
Stars
Official inference repo for FLUX.1 models
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
[ECCV 2024, Oral] Self-Supervised Video Desmoking for Laparoscopic Surgery
Official pytorch implementation of ZiRa, a method for incremental vision language object detection (IVLOD),which has been accepted by NeurIPS 2024.
Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
A natural language interface for computers
aider is AI pair programming in your terminal
Official repository of Agent Attention (ECCV2024)
A PyTorch implementation of Lookaround optimizer (Lookaround optimizer: $k$ steps around, 1 step average)
[CVPR-2023] Official Code for DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
State-of-the-art Parameter-Efficient MoE Fine-tuning Method
澎湃新闻,新浪新闻,腾讯新闻,搜狐新闻,新闻联播,泰晤士报,纽约时报,BBCNews,旨在爬取所有新闻门户网站的新闻,禁止将所得数据商用!
[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning
Official repository for "FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms", AAAI 2023.
MINT-1T: A one trillion token multimodal interleaved dataset.
This repository contains the csv files and documentation for a dataset of 10,258 local news outlets in the U.S. and their social media handles.
Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 2024)
Train transformer language models with reinforcement learning.
An Incremental Learning, Continual Learning, and Life-Long Learning Repository
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched