mansicer

🎆

coding

sicer mansicer

🎆

coding

Doing machine learning research at @LAMDA-NJU and @LAMDA-RL

46 followers · 36 following

Nanjing University

Achievements

Highlights

Organizations

Block or Report

Block or report mansicer

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

allwefantasy / byzer-llm

Easy, fast, and cheap pretrain,finetune, serving for everyone

Python 208 31 Updated Jul 25, 2024

gkamradt / LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,345 140 Updated Jul 19, 2024

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,398 357 Updated Jul 26, 2024

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 2,757 271 Updated Jul 22, 2024

TradeMaster-NTU / TradeMaster

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈

Jupyter Notebook 1,262 257 Updated Feb 28, 2024

huggingface / lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 495 60 Updated Jul 25, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 5,964 1,583 Updated Jul 26, 2024

plandex-ai / plandex

AI driven development in your terminal. Designed for large, real-world tasks.

Go 10,059 704 Updated Jul 22, 2024

LAMDA-RL / ReDA

The implementation of the AAMAS'24 paper "Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation"

Python 3 Updated Mar 14, 2024

liyang619 / COLE-Platform

Overcooked human-AI experiment platform

Python 30 3 Updated Dec 21, 2023

samjia2000 / HSP

This is a repository for Hidden-utility Self-Play.

JavaScript 27 1 Updated Jul 27, 2023

vvbbnn00 / WARP-Clash-API

该项目可以让你通过订阅的方式使用Cloudflare WARP+，自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.

Python 8,286 1,130 Updated Jun 25, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,343 482 Updated Jul 16, 2024

hiyouga / LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 27,779 3,415 Updated Jul 26, 2024

AffordableGenerativeAgents / Affordable-Generative-Agents

34 6 Updated May 1, 2024

DefTruth / CUDA-Learn-Notes

🎉CUDA&C++ 笔记 / 大模型手撕CUDA / 技术博客，更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Cuda 916 87 Updated Jul 25, 2024

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,065 140 Updated Jul 25, 2024

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,342 204 Updated Jul 26, 2024

rustdesk / rustdesk

An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.

Rust 68,978 7,645 Updated Jul 26, 2024

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 24,905 4,703 Updated Nov 11, 2023

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 29,767 3,439 Updated Jul 26, 2024

facebookresearch / BenchMARL

A collection of MARL benchmarks based on TorchRL

Python 199 23 Updated Jul 26, 2024

danny-avila / LibreChat

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3,…

TypeScript 15,751 2,618 Updated Jul 26, 2024

adamkarvonen / chess_llm_interpretability

Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and representation of player Elo.

Jupyter Notebook 183 12 Updated May 27, 2024

Neph0s / awesome-llm-role-playing-with-persona

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

406 21 Updated Jul 8, 2024

InteractiveNLP-Team / RoleLLM-public

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models

433 14 Updated Apr 27, 2024

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 265,312 44,928 Updated Jul 15, 2024

microsoft / SmartPlay

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support futu…

Python 112 13 Updated Apr 11, 2024

Div99 / IQ-Learn

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Python 193 31 Updated Nov 28, 2022

THUDM / AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,298 91 Updated Oct 31, 2023

sicer mansicer

Highlights

Organizations

Block or report mansicer

Starred repositories

cpp20