Skip to content
View OWU-4f5755's full-sized avatar
  • Houston, TX
  • 16:29 (UTC -05:00)

Block or report OWU-4f5755

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results
Jupyter Notebook 8 1 Updated Nov 29, 2023

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,848 763 Updated Aug 19, 2024

Python bindings for the Transformer models implemented in C/C++ using GGML library.

C 1,779 135 Updated Jan 28, 2024

Python bindings for llama.cpp

Python 7,573 908 Updated Aug 28, 2024

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,717 214 Updated Sep 30, 2023

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Jupyter Notebook 762 27 Updated May 13, 2024

Structured Text Generation

Python 8,058 404 Updated Aug 23, 2024

LLM inference in C/C++

C++ 64,184 9,184 Updated Aug 28, 2024

LM Studio CLI

TypeScript 1,338 107 Updated Aug 22, 2024

Code for the book Deep Learning with PyTorch by Eli Stevens, Luca Antiga, and Thomas Viehmann.

Jupyter Notebook 4,655 1,978 Updated Jul 25, 2024

PyTorch tutorials.

Jupyter Notebook 8,070 4,001 Updated Aug 28, 2024

An AI search engine inspired by Perplexity

TypeScript 817 104 Updated Jun 24, 2024

Fast and memory-efficient exact attention

Python 13,136 1,186 Updated Aug 28, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,583 916 Updated Aug 28, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 31,786 2,370 Updated Aug 28, 2024

Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)

Python 57 6 Updated Aug 28, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,611 474 Updated Aug 28, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 16,235 1,517 Updated Aug 28, 2024

LLM101n: Let's build a Storyteller

27,404 1,496 Updated Aug 1, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,436 1,619 Updated Aug 28, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 1 Updated Aug 1, 2024

Bootstrap Kubernetes the hard way. No scripts.

40,330 13,831 Updated Aug 5, 2024

Large Language Model Text Generation Inference

Python 8,655 1,002 Updated Aug 28, 2024

End-to-End LLM Guide

Python 89 8 Updated Jul 2, 2024
Python 446 54 Updated Jul 22, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 25,199 2,693 Updated Aug 28, 2024

Morpheus Local Agents

Python 13 13 Updated Aug 18, 2024

A framework for few-shot evaluation of language models.

Python 6,246 1,649 Updated Aug 28, 2024

FRP Fork

Go 116 18 Updated Jun 27, 2024
Next