Skip to content
View iojw's full-sized avatar

Highlights

  • Pro

Organizations

@MovingBlocks @Terasology @KryptonChicken @web-at-berkeley

Block or report iojw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Long context evaluation for large language models

Python 172 14 Updated Sep 26, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 2,937 225 Updated Aug 10, 2024

Python library for accurately querying username and email usage on online platforms

Python 1,471 185 Updated Mar 20, 2024

OpenAI-Compatible RESTful APIs for Amazon Bedrock

Python 221 45 Updated Aug 18, 2024

Tutorial for building LLM router

Python 146 13 Updated Jul 19, 2024

LLM101n: Let's build a Storyteller

28,911 1,582 Updated Aug 1, 2024
JavaScript 49 20 Updated Sep 22, 2024

CoreNet: A library for training deep neural networks

Python 6,937 539 Updated May 28, 2024

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper

Python 121 11 Updated Jul 20, 2024
Python 1,562 137 Updated Sep 12, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,658 200 Updated Sep 21, 2024

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 425 57 Updated Sep 4, 2024

Python bindings for FFmpeg - with complex filtering support

Python 9,904 885 Updated Aug 4, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,568 1,222 Updated Aug 21, 2024

Grok open release

Python 49,445 8,326 Updated Aug 30, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,385 110 Updated Sep 26, 2024

Making large AI models cheaper, faster and more accessible

Python 38,660 4,333 Updated Sep 26, 2024

A unified evaluation framework for large language models

Python 2,392 179 Updated Sep 12, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,283 1,007 Updated Sep 20, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 21,065 622 Updated Sep 26, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,065 836 Updated Jul 1, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,300 380 Updated Sep 25, 2024

Python 3.8+ toolbox for submitting jobs to Slurm

Python 1,261 120 Updated Sep 18, 2024

pdb++, a drop-in replacement for pdb (the Python debugger)

Python 1,298 66 Updated Apr 15, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,903 768 Updated Sep 25, 2024

VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.

Python 4,917 368 Updated Aug 29, 2024

A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.

Python 431 23 Updated Sep 23, 2024

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,118 209 Updated Sep 26, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 90,937 7,156 Updated Sep 26, 2024

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 3,703 528 Updated Sep 26, 2024
Next