Skip to content
View GoldenFishes's full-sized avatar

Block or report GoldenFishes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Run Segment Anything Model 2 on a live video stream

Jupyter Notebook 106 17 Updated Sep 3, 2024

Muggled SAM: Segmentation without the magic

Python 20 2 Updated Sep 6, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,460 826 Updated Aug 21, 2024

Run RWKV models in sophgo BM1684X

Python 2 Updated Aug 25, 2024

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,385 90 Updated Aug 7, 2024

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…

TypeScript 5,022 477 Updated Aug 30, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 31,992 2,382 Updated Sep 6, 2024

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 309 11 Updated Jul 9, 2024

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 183 11 Updated Aug 11, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,162 279 Updated May 4, 2024

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Python 320 18 Updated Aug 7, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 1,950 33 Updated Jun 6, 2024
Python 9 2 Updated May 16, 2023

[Arxiv 2024] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models"

Python 136 4 Updated Apr 7, 2024

Zero-cost improvement approach in Diffusion U-ViT

Python 4 Updated Sep 3, 2024

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Python 744 42 Updated Aug 5, 2024

Multi-modality pre-training

Python 467 36 Updated May 8, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 92,062 14,653 Updated Sep 8, 2024

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,353 86 Updated May 31, 2023

SAM with text prompt

Jupyter Notebook 1,524 167 Updated Aug 1, 2024
Python 5 Updated Aug 20, 2023

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 17,554 2,144 Updated Feb 4, 2024

[CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".

Python 238 11 Updated Mar 28, 2023

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,649 3,437 Updated May 18, 2024