Skip to content
View minj001's full-sized avatar
  • The University of Hong Kong
Block or Report

Block or report minj001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 1 2 Updated Jul 10, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,457 257 Updated Jul 25, 2024

YaFSDP: Yet another Fully Sharded Data Parallel

Python 796 37 Updated Jul 22, 2024

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery

397 24 Updated Jun 20, 2024

Material for cuda-mode lectures

Jupyter Notebook 1,973 195 Updated Jun 13, 2024

Generative AI on AWS

Jupyter Notebook 401 161 Updated Jun 17, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,125 394 Updated Jul 26, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,683 271 Updated Jul 29, 2024

《跟我一起深度学习》

Python 163 25 Updated Jul 8, 2024
Python 152 13 Updated Jul 8, 2024

KAN for Vision Transformer

Python 161 9 Updated Jun 2, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 11,544 882 Updated May 23, 2024

Python 数据科学加速:Dask、Ray、Xorbits、mpi4py

Jupyter Notebook 27 6 Updated Jul 12, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 13,921 1,257 Updated Jul 28, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 9,798 1,410 Updated Jul 28, 2024

Materials for the course: AI ML & Analytics

Jupyter Notebook 5 1 Updated Apr 12, 2024

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper

Python 109 10 Updated Jul 20, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,238 1,464 Updated Jul 26, 2024

Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah

Python 1,001 180 Updated Jul 2, 2024
Python 69 4 Updated Jul 12, 2024

Reference implementation of Megalodon 7B model

Cuda 499 51 Updated Apr 18, 2024

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

542 30 Updated Jul 22, 2024

基于《cuda编程-基础与实践》(樊哲勇 著)的cuda学习之路。

Cuda 198 46 Updated Jan 15, 2024

Learn CUDA Programming, published by Packt

Cuda 957 224 Updated Dec 30, 2023

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

1,998 137 Updated Apr 22, 2024

This repository contains my homework for CS294-158 course presented in UC Berkeley

Jupyter Notebook 2 Updated Sep 19, 2023

Artificial Intelligence Principles and Techniques at Stanford 2023

Python 3 Updated Mar 13, 2024
Jupyter Notebook 1 Updated Jul 23, 2024

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 8,141 796 Updated Jul 28, 2024

Hands-On Graph Neural Networks Using Python, published by Packt

Jupyter Notebook 660 180 Updated May 7, 2024
Next