Skip to content
View zhouzaida's full-sized avatar

Block or report zhouzaida

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

LLM training code for Databricks foundation models

Python 3,981 525 Updated Sep 27, 2024

On-device AI across mobile, embedded and edge for PyTorch

C++ 1,819 299 Updated Sep 27, 2024

Model components of the Llama Stack APIs

Python 1,628 174 Updated Sep 27, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,275 202 Updated Aug 30, 2024

PyTorch native quantization and sparsity for training and inference

Python 834 100 Updated Sep 27, 2024

PyZMQ: Python bindings for zeromq

Python 3,683 637 Updated Sep 22, 2024

High-resolution models for human tasks.

Python 4,041 211 Updated Sep 25, 2024
Python 5,683 420 Updated Sep 27, 2024

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Python 159 26 Updated Sep 26, 2024
Python 7,092 549 Updated Aug 12, 2024

Transformers with Arbitrarily Large Context

Python 619 48 Updated Aug 12, 2024

Ring attention implementation with flash attention

Python 538 41 Updated Sep 20, 2024

🟩⬜ Generates a snake game from a github user contributions graph and output a screen capture as animated svg or gif

TypeScript 4,249 1,048 Updated Jul 6, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

Python 1,081 154 Updated Sep 26, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,333 130 Updated Sep 24, 2024

The uncompromising Python code formatter

Python 38,667 2,430 Updated Sep 23, 2024

Pipeline Parallelism for PyTorch

Python 714 86 Updated Aug 21, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 132,650 26,434 Updated Sep 27, 2024

🐟 Python profile viewer

Python 1,354 33 Updated Aug 6, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,428 916 Updated Sep 25, 2024

Material for gpu-mode lectures

Jupyter Notebook 2,558 254 Updated Sep 23, 2024

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 9,346 915 Updated Sep 22, 2024

A native PyTorch Library for large model training

Python 2,230 164 Updated Sep 27, 2024

This repository contains the experimental PyTorch native float8 training UX

Python 212 20 Updated Aug 1, 2024

Sampling profiler for Python programs

Rust 12,510 413 Updated Sep 10, 2024

Over 250 terminal color schemes/themes for iTerm/iTerm2. Includes ports to Terminal, Konsole, PuTTY, Xresources, XRDB, Remmina, Termite, XFCE, Tilda, FreeBSD VT, Terminator, Kitty, MobaXterm, LXTer…

Shell 24,653 6,418 Updated Sep 25, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,078 847 Updated Sep 13, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,617 438 Updated Sep 19, 2024
Python 321 19 Updated Sep 19, 2024

Release for Improved Denoising Diffusion Probabilistic Models

Python 3,158 480 Updated Jul 18, 2024
Next