Skip to content
View semantium's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report semantium

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-Sora: Democratizing Efficient Video Production for All

Python 21,011 1,997 Updated Jul 25, 2024

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,472 138 Updated Jun 27, 2024

Convert PDF to markdown quickly with high accuracy

Python 15,053 802 Updated Jul 22, 2024

This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.

Python 964 122 Updated Jul 26, 2024

Go ahead and axolotl questions

Python 7,119 776 Updated Jul 31, 2024

Data files of German Decompounder for Apache Lucene / Apache Solr / Elasticsearch

101 17 Updated Sep 13, 2021

Agents Capable of Self-Editing Their Prompts / Python Code

Python 721 27 Updated Mar 15, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,716 395 Updated Jul 15, 2024

All things prompt engineering

Python 5,284 291 Updated Jun 4, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 82,856 6,330 Updated Aug 1, 2024

Python bindings for llama.cpp

Python 7,347 880 Updated Aug 1, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 24,030 3,459 Updated Aug 1, 2024

Inference code for Mistral and Mixtral hacked up into original Llama implementation

Python 373 40 Updated Dec 9, 2023

Fine-tune mistral-7B on 3090s, a100s, h100s

Python 696 63 Updated Oct 11, 2023

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,080 1,565 Updated Jul 31, 2024

A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualization tool (Streamlit)

Starlark 108 3 Updated Dec 4, 2023

Collection of Datasets for Legal Text Processing

77 4 Updated Jun 26, 2023

large language model for mastering data analysis using pandas

Python 44 1 Updated Oct 18, 2023

Open-source observability for your LLM application, based on OpenTelemetry

Python 1,630 130 Updated Aug 1, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 1,918 150 Updated May 23, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,516 173 Updated Jul 31, 2024

Zep: Long-Term Memory for ‍AI Assistants.

Go 2,202 330 Updated Jun 24, 2024

Awesome papers about unifying LLMs and KGs

1,759 132 Updated May 16, 2024

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

Python 9,480 605 Updated Aug 1, 2024

Force-directed graph rendered on HTML5 canvas

JavaScript 1,491 242 Updated Jul 20, 2024

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 5,984 570 Updated Aug 1, 2024

Large Language Model Text Generation Inference

Python 8,504 972 Updated Aug 1, 2024

Tune any FALCON in 4-bit

Python 468 53 Updated Sep 1, 2023
Next