Skip to content
View Inigo-13's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report Inigo-13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

🐚 OpenDevin: Code Less, Make More

Python 30,071 3,462 Updated Aug 16, 2024

Library for fast text representation and classification.

HTML 25,793 4,706 Updated Mar 22, 2024

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 902 46 Updated Aug 12, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 9,711 892 Updated Aug 15, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 85,089 6,553 Updated Aug 15, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 9,380 936 Updated Aug 15, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 31,009 3,551 Updated Aug 15, 2024

👨‍💻 An awesome and curated list of best code-LLM for research.

834 45 Updated Jun 29, 2024

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 508 36 Updated Aug 15, 2024

Open-source vector similarity search for Postgres

C 11,186 502 Updated Aug 13, 2024

A complement to pgvector for high performance, cost efficient vector search on large workloads.

Rust 797 36 Updated Aug 12, 2024

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.

Python 1,145 82 Updated Aug 15, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,237 443 Updated Aug 15, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,553 182 Updated Aug 14, 2024

use multiple proxies with Scrapy

Python 731 156 Updated May 20, 2022

Fast and memory-efficient exact attention

Python 12,948 1,166 Updated Aug 15, 2024

Build and run containers leveraging NVIDIA GPUs

Go 2,081 228 Updated Aug 14, 2024

CUDA Python Low-level Bindings

Python 836 66 Updated Aug 15, 2024

An extremely fast Python package installer and resolver, written in Rust.

Rust 15,947 474 Updated Aug 15, 2024

Robust recipes to align language models with human and AI preferences

Python 4,352 374 Updated Aug 15, 2024

The easiest way to use Agentic RAG in any enterprise

TypeScript 3,035 310 Updated Aug 15, 2024

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 3,406 285 Updated Aug 7, 2024
Python 412 23 Updated Jul 29, 2024

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Python 3,732 330 Updated Aug 1, 2024

Multilingual Sentence & Image Embeddings with BERT

Python 14,654 2,419 Updated Aug 15, 2024

Python dependency injection framework, inspired by Guice

Python 1,279 81 Updated Jul 10, 2024

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

TypeScript 25,372 4,330 Updated Aug 15, 2024

The little ASGI framework that shines. 🌟

Python 9,916 885 Updated Aug 12, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 24,639 5,081 Updated Aug 15, 2024
Next