Skip to content
View mansicer's full-sized avatar
🎆
coding
🎆
coding
  • Nanjing University

Highlights

  • Pro

Organizations

@LAMDA-RL
Block or Report

Block or report mansicer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Easy, fast, and cheap pretrain,finetune, serving for everyone

Python 208 31 Updated Jul 25, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,345 140 Updated Jul 19, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,398 357 Updated Jul 26, 2024

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 2,757 271 Updated Jul 22, 2024

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈

Jupyter Notebook 1,262 257 Updated Feb 28, 2024

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 495 60 Updated Jul 25, 2024

A framework for few-shot evaluation of language models.

Python 5,964 1,583 Updated Jul 26, 2024

AI driven development in your terminal. Designed for large, real-world tasks.

Go 10,059 704 Updated Jul 22, 2024

The implementation of the AAMAS'24 paper "Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation"

Python 3 Updated Mar 14, 2024

Overcooked human-AI experiment platform

Python 30 3 Updated Dec 21, 2023

This is a repository for Hidden-utility Self-Play.

JavaScript 27 1 Updated Jul 27, 2023

该项目可以让你通过订阅的方式使用Cloudflare WARP+,自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.

Python 8,286 1,130 Updated Jun 25, 2024

Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,343 482 Updated Jul 16, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 27,779 3,415 Updated Jul 26, 2024

🎉CUDA&C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Cuda 916 87 Updated Jul 25, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,065 140 Updated Jul 25, 2024

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,342 204 Updated Jul 26, 2024

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 68,978 7,645 Updated Jul 26, 2024

SoftVC VITS Singing Voice Conversion

Python 24,905 4,703 Updated Nov 11, 2023

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 29,767 3,439 Updated Jul 26, 2024

A collection of MARL benchmarks based on TorchRL

Python 199 23 Updated Jul 26, 2024

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3,…

TypeScript 15,751 2,618 Updated Jul 26, 2024

Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and representation of player Elo.

Jupyter Notebook 183 12 Updated May 27, 2024

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

406 21 Updated Jul 8, 2024

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models

433 14 Updated Apr 27, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 265,312 44,928 Updated Jul 15, 2024

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support futu…

Python 112 13 Updated Apr 11, 2024

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Python 193 31 Updated Nov 28, 2022

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,298 91 Updated Oct 31, 2023
Next