-
Nanjing University
Highlights
- Pro
Block or Report
Block or report mansicer
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
Easy, fast, and cheap pretrain,finetune, serving for everyone
Doing simple retrieval from LLM models at various context lengths to measure accuracy
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
A framework for few-shot evaluation of language models.
AI driven development in your terminal. Designed for large, real-world tasks.
The implementation of the AAMAS'24 paper "Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation"
This is a repository for Hidden-utility Self-Play.
该项目可以让你通过订阅的方式使用Cloudflare WARP+,自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.
Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🎉CUDA&C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
SoftVC VITS Singing Voice Conversion
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A collection of MARL benchmarks based on TorchRL
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3,…
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and representation of player Elo.
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support futu…
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
AgentTuning: Enabling Generalized Agent Abilities for LLMs