Skip to content
View 2sin18's full-sized avatar
  • Alibaba Group
  • Beijing

Block or report 2sin18

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 37 34 Updated Aug 21, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 948 44 Updated Jan 16, 2024
Python 1,145 163 Updated Aug 21, 2024

Mamba SSM architecture

Python 12,243 1,030 Updated Aug 15, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,060 1,052 Updated Jul 30, 2024

LLM training code for Databricks foundation models

Python 3,928 516 Updated Aug 24, 2024

Implementation of "SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks"

Python 735 76 Updated May 31, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,133 834 Updated Aug 13, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,513 265 Updated Aug 20, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 91,143 14,487 Updated Aug 24, 2024

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,808 1,154 Updated Jun 30, 2023

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,558 910 Updated Aug 23, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,287 4,021 Updated Jul 17, 2024

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,544 119 Updated Sep 19, 2023

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Python 1,936 124 Updated Jul 22, 2024

Visualizer for neural network, deep learning and machine learning models

JavaScript 27,219 2,729 Updated Aug 23, 2024

Library for reading and writing large multi-dimensional arrays.

C++ 1,333 119 Updated Aug 20, 2024

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Python 1,031 144 Updated Aug 9, 2024

I.Ming ( I.明體 / 一点明朝体 / 一點明體 )

890 39 Updated Apr 24, 2024

Noto CJK fonts

Shell 2,918 214 Updated Aug 19, 2024

cuDF - GPU DataFrame Library

C++ 8,166 879 Updated Aug 24, 2024

PerfKit Benchmarker (PKB) contains a set of benchmarks to measure and compare cloud offerings. The benchmarks use default settings to reflect what most users will see. PerfKit Benchmarker is licens…

Python 1,892 479 Updated Aug 23, 2024

Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's

C 76 10 Updated Apr 8, 2024

Large batch training of CTR models based on DeepCTR with CowClip.

Python 161 24 Updated Feb 8, 2023
Python 1,116 400 Updated Mar 6, 2019

A tensor-aware point-to-point communication primitive for machine learning

C++ 247 76 Updated Dec 17, 2022

An industrial deep learning framework for high-dimension sparse data

PureBasic 4,244 1,029 Updated Dec 8, 2022

Development repository for the Triton language and compiler

C++ 12,315 1,488 Updated Aug 24, 2024
Next