Lists (1)
Sort Name ascending (A-Z)
Stars
Run Segment Anything Model 2 on a live video stream
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
szxysdt / LLM-TPU
Forked from sophgo/LLM-TPURun RWKV models in sophgo BM1684X
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
MambaOut: Do We Really Need Mamba for Vision?
[Arxiv 2024] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models"
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
🦜🔗 Build context-aware reasoning applications
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
SAM with text prompt
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
[CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".
Official Code for DragGAN (SIGGRAPH 2023)