Skip to content
View yuweihao's full-sized avatar
πŸ’­
I may be slow to respond.
πŸ’­
I may be slow to respond.
  • National University of Singapore
  • Singapore

Block or report yuweihao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is a fast serving framework for large language models and vision language models.

Python 4,616 298 Updated Aug 27, 2024

[Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.

Python 27 Updated Jul 16, 2024

A More Fair and Comprehensive Comparison between KAN and MLP

Jupyter Notebook 112 4 Updated Aug 17, 2024

Vico: Compositional Video Generation as Flow Equalization

Python 44 2 Updated Jul 9, 2024

Reference implementation of Megalodon 7B model

Cuda 501 51 Updated Apr 18, 2024

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Python 670 22 Updated Aug 13, 2024

Explore the Limits of Omni-modal Pretraining at Scale

Python 78 3 Updated Jun 28, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,616 381 Updated Aug 23, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Python 912 123 Updated Aug 27, 2024

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Jupyter Notebook 801 76 Updated Jun 22, 2024

Code for CVPR 2024 Oral "Neural Lineage"

Python 11 1 Updated Jun 18, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,273 81 Updated Aug 27, 2024

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,117 57 Updated Mar 14, 2024

Autoregressive Model Beats Diffusion: πŸ¦™ Llama for Scalable Image Generation

Python 1,160 45 Updated Aug 15, 2024

Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching

Python 52 5 Updated Jul 15, 2024

FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs πŸš€ πŸš€ πŸš€

Jupyter Notebook 1,367 195 Updated Aug 23, 2024

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Python 3,776 338 Updated Aug 1, 2024

Receptive field as experts

5 Updated Feb 17, 2023

Official repository of MLLA

Python 159 6 Updated Jul 11, 2024

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

561 33 Updated Aug 23, 2024

Code for paper "Unsegment Anything by Simulating Deformation" (CVPR 2024)

Jupyter Notebook 20 1 Updated May 27, 2024

Experiencing lightning fast (~1s) and accurate drag-based image editing

248 11 Updated Jul 17, 2024

Multilingual Medicine: Model, Dataset, Benchmark, Code

Python 152 7 Updated Apr 26, 2024

Adapting LLaMA Decoder to Vision Transformer

Python 25 Updated May 20, 2024

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Python 1,811 106 Updated Aug 6, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 31,243 4,683 Updated Aug 27, 2024

[CVPR 2024] Code release for TransNeXt model

Python 336 16 Updated Jun 13, 2024

βš“οΈ Sailor: Open Language Models for South-East Asia

Python 93 7 Updated Jul 11, 2024
Python 69 4 Updated May 10, 2024
Next