Skip to content
View sublimationAC's full-sized avatar
🎯
Focusing
🎯
Focusing
  • XDU & USYD
  • Xi'an

Block or report sublimationAC

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Python 1,450 81 Updated Feb 1, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,625 169 Updated Nov 28, 2023

Supercharge Your Model Training

Python 5,160 419 Updated Nov 13, 2024
Python 158 14 Updated Nov 13, 2023

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 558 45 Updated Mar 4, 2024

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales

Python 30 1 Updated Jul 17, 2023

Transformer training code for sequential tasks

Python 609 60 Updated Sep 14, 2021

Standalone TFRecord reader/writer with PyTorch data loaders

Python 864 107 Updated Aug 20, 2024

An implementation of training for GPT2, supports TPUs

Python 1,422 334 Updated Dec 12, 2022

Open Academic Research on Improving LLaMA to SOTA LLM

Python 1,606 102 Updated Aug 30, 2023

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,376 1,866 Updated Apr 30, 2024
Python 297 22 Updated Apr 6, 2023

Example models using DeepSpeed

Python 6,079 1,036 Updated Nov 7, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,570 350 Updated Oct 17, 2024

飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

Python 441 162 Updated May 24, 2024

PaddleSlim is an open-source library for deep model compression and architecture search.

Python 1,562 345 Updated Nov 5, 2024

🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation

Python 72 2 Updated Mar 25, 2024

🎁[ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT

Python 88 3 Updated Jan 15, 2024

🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT

Python 193 9 Updated Apr 17, 2023

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,319 1,482 Updated Nov 14, 2024

Repo for external large-scale work

Python 6,515 725 Updated Apr 27, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,646 861 Updated Nov 9, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,730 686 Updated Aug 14, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 47,600 5,627 Updated Sep 18, 2024
Python 607 64 Updated Aug 20, 2023

A procedural Blender pipeline for photorealistic training image generation

Python 2,826 451 Updated Oct 22, 2024

详细的C/C++编程规范指南,由360质量工程部编著,适用于桌面、服务端及嵌入式软件系统。

2,469 292 Updated Oct 19, 2024

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Rust 52,890 5,955 Updated Aug 29, 2024

The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"

Python 157 9 Updated Mar 17, 2023
Next