Stars
Notes, examples, and Python demos for the 2nd edition of the textbook "Machine Learning Refined" (published by Cambridge University Press).
[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
Python inventory optimization and simulation tools.
An extension of the PyMARL codebase that includes additional algorithms and environment support
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
A library to generate LaTeX expression from Python code.
A playbook for systematically maximizing the performance of deep learning models.
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中
Solving Multi-Echelon Inventory Management Problems with Heuristic-Guided Deep Reinforcement Learning
This is the 4th place winning solution for the Kore 2022 simulation, a coding competition sponsored by Google.
Frontend for https://tx.eth.samczsun.com/
An investment portfolio of stocks is created using Long Short-Term Memory (LSTM) stock price prediction and optimized weights. The performance of this portfolio is better compared to an equally wei…
Numba tutorial for GTC 2018
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Value-Decomposition Networks For Cooperative Multi-Agent Learning
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)