Stars
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Ongoing research training transformer models at scale
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
ReelsMaker is a Python-based/streamlit application designed to create captivating faceless videos for social media platforms like TikTok and YouTube.
ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots.
This repository shares recipes on building Streamlit apps with various tools.
Human-AI collaboration to produce a newstory about a meeting from minutes or transcript
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
React UI + elegant infrastructure for AI Copilots, in-app AI agents, AI chatbots, and AI-powered Textareas 🪁
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation,…
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated graph based algorithm to handle the tasks.
Build resilient language agents as graphs.
Parse simple SQL statements into an abstract syntax tree (AST) with the visited tableList and convert it back to SQL
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
Work with remote images registries - retrieving information, images, signing content
Official inference repo for FLUX.1 models
LangChat: Java LLMs/AI Project, Supports Multi AI Providers( OpenAI / Gemini / Ollama / Azure / 智谱 / 阿里通义大模型 / 百度千帆大模型), Java生态下AI大模型产品解决方案,快速构建企业级AI知识库、AI机器人应用
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents