Block or Report
Block or report onexixi
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (2)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
The code used to train and run inference with the ColPali architecture.
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense embedding, sparse embedding, tensor, and full-text
Attribute (or cite) statements generated by LLMs back to in-context information.
ELISA (Emacs Lisp Information System Assistant) is a system designed to provide informative answers to user queries by leveraging a Retrieval Augmented Generation (RAG) approach.
Distributed LLM inference for mobile, desktop and server.
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Copy a bunch of files into your clipboard to provide context for LLMs
OpenAI 接口接入适配,支持千帆大模型平台、讯飞星火大模型、腾讯混元以及MiniMax、Deep-Seek,等兼容OpenAI接口,仅单可执行文件,配置超级简单,一键部署,开箱即用. Seamlessly integrate with OpenAI and compatible APIs using a single executable for quick setup and depl…
A Comprehensive Toolkit for High-Quality PDF Content Extraction
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
Modified Beam Search with periodical restart
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
GraphRAG using Local LLMs with advanced Gradio UI
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
Turn any webpage into structured data using LLMs
This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets.
FinRL: Financial Reinforcement Learning. 🔥
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vector…
AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models