Skip to content
View Photooon's full-sized avatar
🏠
Studying
🏠
Studying
  • Tsinghua University
  • China

Block or report Photooon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimalistic large language model 3D-parallelism training

Python 1,232 122 Updated Nov 4, 2024
Python 1 Updated Aug 14, 2024
Verilog 1 Updated Mar 4, 2024

A L1D Hardware prefetcher

Python 2 Updated Aug 5, 2024

Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP

JavaScript 451 48 Updated Mar 15, 2022

Tools for merging pretrained large language models.

Python 4,812 439 Updated Nov 5, 2024

Masked Structural Growth for 2x Faster Language Model Pre-training

Python 22 2 Updated Apr 28, 2024

一分钟私有部署zerotier-planet服务

Shell 2,448 463 Updated Nov 10, 2024
Jupyter Notebook 129 7 Updated Mar 12, 2024

Official implementation of "A Multi-level Framework for Accelerating Training Transformer Models""

Python 6 Updated Apr 15, 2024

LaTeX Thesis Template for Tsinghua University

TeX 4,595 1,080 Updated Nov 12, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 11,182 1,621 Updated Oct 26, 2024

Tools to download and cleanup Common Crawl data

Python 971 142 Updated Apr 25, 2023

A series of large language models developed by Baichuan Intelligent Technology

Python 4,090 295 Updated Nov 8, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,736 686 Updated Aug 14, 2024

Minecraft 1.3.2-1.15.2 Vanilla and FML CoreMod Development Tutorial.

167 10 Updated Jun 14, 2022

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,983 236 Updated Sep 6, 2023

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,672 504 Updated Jul 18, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,549 4,055 Updated Jul 17, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,464 4,119 Updated Nov 15, 2024

Inference code for Llama models

Python 56,426 9,569 Updated Aug 18, 2024

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 20,478 4,165 Updated Nov 15, 2024

Caffe: a fast open framework for deep learning.

C++ 34,125 18,681 Updated Jul 31, 2024

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.

Python 264 49 Updated Mar 31, 2023

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Python 153 23 Updated Mar 25, 2022

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,543 3,502 Updated Jun 2, 2023

Code for SkipNet: Learning Dynamic Routing in Convolutional Networks (ECCV 2018)

Python 234 48 Updated Apr 11, 2019

A tool for extracting plain text from Wikipedia dumps

Python 3,751 967 Updated May 23, 2024

CodiMD - Realtime collaborative markdown notes on all platforms.

JavaScript 9,323 1,060 Updated Oct 30, 2024

An implementation of k-d tree

C++ 167 48 Updated Apr 5, 2024
Next