Skip to content
View GewelsJI's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report GewelsJI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloading the trained model checkpoints, and example notebooks / gra…

Python 175 4 Updated May 12, 2024

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 4,331 140 Updated Jul 22, 2024

🔥Deep Learning for Face Anti-Spoofing

507 64 Updated Jul 11, 2023

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 340 11 Updated Jul 16, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,416 3,336 Updated Jul 23, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,583 96 Updated Jul 6, 2024

Advances in recent large vision language models (LVLMs)

7 Updated Jul 7, 2024
Python 3 Updated Jun 5, 2024

(ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation

Python 11 1 Updated Jul 4, 2024

[AAAI2024] Official implementation of SurgicalSAM

Python 62 9 Updated Jul 7, 2024

Efficient Multimodal Large Language Models: A Survey

187 5 Updated May 31, 2024

A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

794 70 Updated Jul 12, 2024

Statistics and Visualization of acceptance information, main keyword of CVPR 2022 accepted papers for the main Computer Vision conferences (CVPR/ICCV/WACV...)

Python 6 1 Updated May 9, 2024

[ICML 2024] Official repository of the paper: "Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset"

Python 63 8 Updated Jun 8, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,035 827 Updated Jul 23, 2024

张量计算系列教程 (Tensor Computations Tutorials)

86 7 Updated Feb 12, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 11,292 1,002 Updated Jul 23, 2024

The medical imaging meta-learning toolbox allows to build models that learn to learn in a setting with diverse tasks. It also provides code for working with the MIMeta Dataset as well as simple bas…

Python 33 5 Updated May 9, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,422 277 Updated Jul 22, 2024

The official Meta Llama 3 GitHub site

Python 23,501 2,539 Updated Jul 23, 2024

Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.

Python 42 1 Updated Jul 18, 2024

Collection of AWESOME vision-language models for vision tasks

2,040 182 Updated Jul 10, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 10,947 975 Updated Jul 23, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,378 433 Updated May 3, 2024

A Framework of Small-scale Large Multimodal Models

Python 507 44 Updated Jul 21, 2024

Code for the MedRAG toolkit

Python 130 24 Updated Jun 20, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 8,644 1,319 Updated Jul 21, 2024

Free ChatGPT 3.5 API for Javascript.

JavaScript 62 14 Updated Apr 25, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,105 277 Updated May 4, 2024

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,143 818 Updated Jul 23, 2024
Next