-
Visual-Instruction-Tuning Public
Forked from BAAI-DCAI/Visual-Instruction-TuningSVIT: Scaling up Visual Instruction Tuning
Python MIT License UpdatedJun 20, 2024 -
-
-
-
HALOs Public
Forked from ContextualAI/HALOsA library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).
Python Apache License 2.0 UpdatedJan 31, 2024 -
xtuner-ko Public
Forked from InternLM/xtunerAn efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)
Python Apache License 2.0 UpdatedJan 21, 2024 -
-
llm-course-ko Public
Forked from mlabonne/llm-courseCourse to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Jupyter Notebook Apache License 2.0 UpdatedJan 15, 2024 -
octo Public
Forked from octo-models/octoOcto is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
Python MIT License UpdatedJan 8, 2024 -
IDC-Tutorials-ko Public
Forked from ImagingDataCommons/IDC-Tutorials번역
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedDec 6, 2023 -
-
-
-
-
PaLM-rlhf-pytorch Public
Forked from lucidrains/PaLM-rlhf-pytorchImplementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Python MIT License UpdatedJan 8, 2023 -
circuit_training Public
Forked from google-research/circuit_trainingPython Apache License 2.0 UpdatedNov 2, 2022 -
-
agents Public
Forked from tensorflow/agentsTF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Python Apache License 2.0 UpdatedOct 10, 2022 -
-
-
-
-
acme Public
Forked from google-deepmind/acmeA library of reinforcement learning components and agents
Python Apache License 2.0 UpdatedAug 21, 2022 -
-
Unity-Robotics-Hub Public
Forked from Unity-Technologies/Unity-Robotics-HubCentral repository for tools, tutorials, resources, and documentation for robotic simulation in Unity.
C# Apache License 2.0 UpdatedFeb 9, 2021 -
Deep-Multi-Agent-Reinforcement-Learning Public
Forked from seolhokim/Deep-Multi-Agent-Reinforcement-Learningdeep multi agent reinforcement learning tutorial book for intermediate
UpdatedFeb 6, 2021 -
examples Public
Forked from tensorflow/examplesTensorFlow examples
Jupyter Notebook Apache License 2.0 UpdatedFeb 5, 2021 -
coding-interview-university Public
Forked from jwasham/coding-interview-universityA complete computer science study plan to become a software engineer.
-
nn Public
Forked from labmlai/annotated_deep_learning_paper_implementations🧠 Minimal implementations of neural network architectures and layers in PyTorch with side-by-side notes
Python MIT License UpdatedJan 10, 2021 -