Skip to content
View XkunW's full-sized avatar

Block or report XkunW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM training in simple, raw C/CUDA

Cuda 24,142 2,704 Updated Oct 2, 2024

Material for gpu-mode lectures

Jupyter Notebook 2,780 274 Updated Oct 21, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,854 1,086 Updated Oct 21, 2024

Inspect: A framework for large language model evaluations

Python 588 106 Updated Oct 21, 2024

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

Python 776 91 Updated Sep 24, 2024

LLM101n: Let's build a Storyteller

29,424 1,609 Updated Aug 1, 2024

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Inform 7 44 1 Updated Jun 7, 2024

METR Task Standard

TypeScript 117 28 Updated Sep 27, 2024

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,848 318 Updated Oct 17, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 695 36 Updated Sep 24, 2024

Machine Learning Engineering Open Book

Python 11,449 690 Updated Oct 21, 2024

The LLM Evaluation Framework

Python 3,314 258 Updated Oct 20, 2024

Efficient LLM inference on Slurm clusters using vLLM.

Python 33 6 Updated Oct 16, 2024

Toy autograd engine in OCaml with Apple Accelerate backend

OCaml 30 Updated Jul 31, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 13,507 1,358 Updated Oct 21, 2024

Ongoing research training transformer models at scale

Python 10,344 2,316 Updated Oct 21, 2024

Universal LLM Deployment Engine with ML Compilation

Python 18,994 1,556 Updated Oct 21, 2024

Fast inference engine for Transformer models

C++ 3,339 289 Updated Oct 17, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,510 5,686 Updated Oct 21, 2024

What would you do with 1000 H100s...

Jupyter Notebook 892 52 Updated Jan 10, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,347 395 Updated Aug 19, 2024
Jupyter Notebook 153 45 Updated Apr 9, 2021

[CVPR 2020 & 2021 & 2022 & 2023] Agriculture-Vision Dataset, Prize Challenge and Workshop: A joint effort with many great collaborators to bring Agriculture and Computer Vision/AI communities toget…

202 32 Updated Jul 27, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,831 2,177 Updated Aug 12, 2024

Minimalistic large language model 3D-parallelism training

Python 1,180 113 Updated Oct 9, 2024

LLM finetuning in resource-constrained environments.

Python 40 8 Updated Jun 24, 2024

Refine high-quality datasets and visual AI models

Python 8,793 555 Updated Oct 21, 2024

GEO-Bench: Toward Foundation Models for Earth Monitoring

Python 85 5 Updated Oct 11, 2024
Next