Stars
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Retrieval and Retrieval-augmented LLMs
pip install nb_log 各种日志handler和自动转化项目的任意print的效果。日志自动彩色炫酷,可点击控制台的日志自动精确跳转到pycharm的文件和行号。文件日志多进程切割安全。在10个最重要方面全方位超过loguru
A cloud-native vector database, storage for next generation AI applications
🦜🔗 Build context-aware reasoning applications
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
Ultra Fast Deep Lane Detection With Hybrid Anchor Driven Ordinal Classification (TPAMI 2022)
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Hybrid A* Path Planner for the KTH Research Concept Vehicle
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
Matplotlib styles for scientific plotting
This is the official implementation of Multi-Agent PPO (MAPPO).
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Massively Parallel Deep Reinforcement Learning. 🔥
A Comprehensive Reinforcement Learning Zoo for Simple Usage 🚀
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Python Multi-Agent Reinforcement Learning framework
An elegant PyTorch deep reinforcement learning library.
A high-performance distributed training framework for Reinforcement Learning
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
Code accompanying the paper Robust Asymmetric Learning in POMDPs