Skip to content
View lbux's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report lbux

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops

Python 17 Updated Mar 16, 2024

The official implementation of the EMNLP 2023 paper LLM-FP4

Python 145 6 Updated Dec 15, 2023

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

Python 248 35 Updated Jun 19, 2024

Using Haystack to benchmark different RAG architectures over different datasets

Jupyter Notebook 5 Updated Jun 19, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 5,681 573 Updated Jun 19, 2024

A styling system for Flutter

Dart 458 23 Updated Jun 19, 2024

Tools for evaluation of RAG Chat Apps using Azure AI Evaluate SDK and OpenAI

Python 157 55 Updated Jun 5, 2024

Generic floating-point types in Python

Python 10 2 Updated Jun 19, 2024

A stand-alone implementation of several NumPy dtype extensions used in machine learning.

C++ 128 19 Updated May 22, 2024

PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.

Python 88 10 Updated Dec 8, 2023

This repository contains the experimental PyTorch native float8 training UX

Python 182 18 Updated Jun 18, 2024

C library for the emulation of reduced-precision floating point types

C 43 13 Updated Apr 2, 2023

Efficient Retrieval Augmentation and Generation Framework

Python 1,063 91 Updated Jun 4, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 786 53 Updated Jun 18, 2024

MiniJinja is a powerful but minimal dependency template engine for Rust compatible with Jinja/Jinja2

Rust 1,397 79 Updated Jun 17, 2024

Adaptive floating-point based numerical format for resilient deep learning

Python 14 2 Updated Apr 11, 2022

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 38,385 1,973 Updated Jun 20, 2024

Warp is a modern, Rust-based terminal with AI built in so you and your team can build great software, faster.

20,057 331 Updated May 1, 2024

A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.

TypeScript 280 15 Updated May 19, 2024

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 2,363 165 Updated Jun 19, 2024
Python 351 40 Updated Jun 15, 2024

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,023 126 Updated Jun 20, 2024

Vue3 + Pinia 仿抖音,Vue 在移动端的最佳实践 . Imitate TikTok ,Vue Best practices on Mobile

Vue 8,276 2,026 Updated Jun 6, 2024

Transform Windows 11's virtual SDR-in-HDR curve from piecewise sRGB to Gamma 2.2

JavaScript 302 9 Updated Feb 29, 2024

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 6,633 468 Updated Jun 19, 2024

Fact checking baseline combining dense retrieval and textual entailment

Jupyter Notebook 27 3 Updated Jan 6, 2024

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 620 71 Updated May 5, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 18,964 2,429 Updated Jun 17, 2024

Simple 2D renderer for GUIs.

Rust 1 1 Updated Apr 24, 2024
Next