Lists (2)
Sort Name ascending (A-Z)
Stars
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
小红书数据采集、网站图片、视频资源批量下载工具,颜值超高的数据采集工具(批量下载,视频提取,图片,去水印等)Telegram:https://t.me/+ZtLSwuIKTo44MDY1
This is the notes of the way of machine learning study. You may find something useful in it.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Codes of Paper "Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding"
[MM24 Oral] Identity-Driven Multimedia Forgery Detection via Reference Assistance
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A modular RL library to fine-tune language models to human preferences
A large-scale simulation framework for LLM inference
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
A Clash GUI based on tauri. Supports Windows, macOS and Linux.
A nanoGPT pipeline packed in a spreadsheet
Simplifying reinforcement learning for complex game environments
Examples and guides for using the OpenAI API
A curated list of Machine Learning Surveys, Tutorials and Books.
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
A Python tool to synchronize the clipboard (text and image) between macOS and Ubuntu in a local network.
Anthropic's educational courses