Skip to content
View jerett's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Insta360
  • shenzhen,china

Block or report jerett

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 9,164 817 Updated Aug 7, 2024

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 2,961 276 Updated Sep 26, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 47,525 5,618 Updated Sep 18, 2024

real time face swap and one-click video deepfake with only a single image

Python 39,684 5,797 Updated Nov 8, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 850 58 Updated Nov 4, 2024

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 3,345 545 Updated Aug 15, 2024
C++ 9 1 Updated Aug 17, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 29,836 4,504 Updated Nov 9, 2024
Python 22 3 Updated Jul 20, 2017

Eclipse iceoryx™ - true zero-copy inter-process-communication

C++ 1,676 391 Updated Oct 29, 2024

LLM101n: Let's build a Storyteller

29,858 1,631 Updated Aug 1, 2024

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 21,583 2,175 Updated Nov 7, 2024
Python 977 128 Updated Oct 3, 2022

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Python 318 15 Updated Oct 8, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,171 861 Updated Jul 1, 2024

video stabilization implementation of "A Non-linear filter for gyroscope-based video stabalization"

Python 51 19 Updated Jul 4, 2022

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

Python 242 11 Updated Jan 2, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,837 1,255 Updated Dec 6, 2023

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 35,317 4,028 Updated Nov 7, 2024

We wirte a filtflit function in java . The filtflit's output is the same as it's in Matlab .

C++ 43 19 Updated Jan 20, 2022

DSP IIR realtime filter library written in C++

C++ 639 141 Updated Aug 12, 2024

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

Python 171 8 Updated Sep 24, 2023

🚀 Power Your World with AI - Explore, Extend, Empower.

JavaScript 6,483 475 Updated Sep 17, 2024

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 936 42 Updated Aug 12, 2024

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,322 70 Updated Sep 20, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 4,803 501 Updated Jan 29, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,531 1,023 Updated Nov 6, 2024

[SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars

Python 1,072 94 Updated Feb 15, 2023

(IJCAI 2023) Sph2Pob: Boosting Object Detection on Spherical Images with Planar Oriented Boxes Methods

Python 5 Updated Aug 23, 2023

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,969 216 Updated Sep 25, 2024
Next