Skip to content
View ray-ng's full-sized avatar

Organizations

@WillFlare

Block or report ray-ng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

5,077 283 Updated Nov 8, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,841 525 Updated Oct 24, 2024

how to learn PyTorch and OneFlow

347 22 Updated Mar 22, 2024

how to optimize some algorithm in cuda.

Cuda 1,576 129 Updated Nov 10, 2024
C++ 1,599 301 Updated Oct 28, 2024

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 4,072 434 Updated Apr 13, 2024

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

2,293 154 Updated Apr 22, 2024

Curated list of resources on testing distributed systems

HTML 2,490 225 Updated Oct 31, 2024

Repo for external large-scale work

Python 6,515 725 Updated Apr 27, 2024

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Jupyter Notebook 2,569 581 Updated Oct 27, 2024
Python 177 43 Updated Nov 5, 2024

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 457 68 Updated Oct 28, 2024
Python 2,678 307 Updated Oct 31, 2024

Named Tensors for Legible Deep Learning in JAX

Python 153 11 Updated Nov 6, 2024

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 2,104 141 Updated Oct 31, 2024

JAX-Toolbox

Jupyter Notebook 241 47 Updated Nov 10, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,957 326 Updated Nov 8, 2024

A simple, performant and scalable Jax LLM!

Python 1,527 293 Updated Nov 11, 2024

Two implementations of ZeRO-1 optimizer sharding in JAX

Python 13 Updated Jun 11, 2023

JAX - A curated list of resources https://github.com/google/jax

1,544 132 Updated Jul 10, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,404 255 Updated Aug 13, 2024

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Python 516 81 Updated Nov 8, 2024

Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes

Python 239 22 Updated May 12, 2023

Model parallel transformers in JAX and Haiku

Python 6,291 892 Updated Jan 21, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 29,902 4,516 Updated Nov 11, 2024

Simple, light-weight and easy-to-use asynchronous components

C++ 1,694 252 Updated Nov 7, 2024

A course to build distributed key-value service based on TiKV model

Go 3,441 1,023 Updated Oct 11, 2024

Alluxio, data orchestration for analytics and machine learning in the cloud

Java 6,861 2,938 Updated Nov 7, 2024

Tensor library for machine learning

C++ 11,183 1,032 Updated Nov 8, 2024
Next