Stars
Puzzles for learning Triton, play it with minimal environment configuration!
The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
O1 Replication Journey: A Strategic Progress Report – Part I
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Entropy Based Sampling and Parallel CoT Decoding
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…
FlagGems is an operator library for large language models implemented in Triton Language.
[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…
GoMate:RAG Framework within Reliable input,Trusted output
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. …
A new markup-based typesetting system that is powerful and easy to learn.
SGLang is a fast serving framework for large language models and vision language models.
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.