Skip to content
View kk-machine-learning's full-sized avatar
Block or Report

Block or report kk-machine-learning

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,168 390 Updated May 24, 2024

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Shell 62 4 Updated May 28, 2024

Official implementation for the paper *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Jupyter Notebook 41 1 Updated Jul 24, 2024

Expert Specialized Fine-Tuning

Python 105 9 Updated Jul 11, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,263 115 Updated Jun 13, 2024

Scalable toolkit for efficient model alignment

Python 459 48 Updated Jul 31, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,959 4,419 Updated Jul 31, 2024
Python 14 4 Updated Dec 13, 2023

Stack trace visualizer

Perl 16,890 1,929 Updated Jul 14, 2024

A recipe for online RLHF.

Python 344 39 Updated Jun 20, 2024

Monitor and profiler powered by eBPF to monitor network traffic, and diagnose CPU and network performance.

Go 190 37 Updated Jul 30, 2024

APM, Application Performance Monitoring System

Java 23,563 6,475 Updated Jul 30, 2024

字节跳动 APM 团队预备招聘社群,来一起聊聊大厂面试经验、简历如何编写、技术……

110 4 Updated Sep 22, 2021

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,848 175 Updated Jul 31, 2024

This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

Python 488 62 Updated Sep 26, 2023

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,166 119 Updated Jun 26, 2024

CodeBERT

Python 2,099 431 Updated Jul 9, 2023

CodeUp: A Multilingual Code Generation Llama2 Model with Parameter-Efficient Instruction-Tuning on a Single RTX 3090

Jupyter Notebook 113 10 Updated Aug 3, 2023

The Serenity Operating System 🐞

C++ 29,907 3,157 Updated Jul 31, 2024

抄nemu的同学点个star好嘛

C 134 20 Updated Jul 20, 2016

A collection of gdb tips. 100 maybe just mean many here.

Go 2,995 709 Updated Oct 30, 2023

Exploit Development and Reverse Engineering with GDB Made Easy

Python 7,071 859 Updated Jul 30, 2024
Python 14 Updated Jan 21, 2021

Python debugger (debugpy) extension for VS Code.

TypeScript 48 18 Updated Jul 31, 2024

GEF (GDB Enhanced Features) - a modern experience for GDB with advanced debugging capabilities for exploit devs & reverse engineers on Linux

Python 6,708 718 Updated Jul 28, 2024

AutoDev - 🧙‍the AI-powered coding wizard . Put the most loved AutoDev AI assistant into your VSCode, and have things done quickly

TypeScript 198 33 Updated Jul 19, 2024
Python 11 7 Updated Mar 21, 2024

NJU EMUlator, a full system x86/mips32/riscv32/riscv64 emulator for teaching

C 823 174 Updated Jul 15, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,787 510 Updated May 31, 2024

An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"

Python 277 25 Updated Mar 1, 2024
Next