Stars
A full-featured, hackable Next.js AI chatbot built by Vercel
利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式
wangEditor, open-source Web rich text editor 开源 Web 富文本编辑器
Retrieval and Retrieval-augmented LLMs
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Quantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
The official implementation of 'FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models'
C-OCR是携程自研的OCR项目,主要包括身份证、护照、火车票、签证等旅游相关证件、材料的识别。 项目包含4个部分,拒识、检测、识别、后处理。
A bibliography and survey of the papers surrounding o1
Ongoing research training transformer models at scale
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Analysis of Chinese and English layouts 中英文版面分析
A Comprehensive Toolkit for High-Quality PDF Content Extraction
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Entropy Based Sampling and Parallel CoT Decoding
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)