Skip to content
View XXares's full-sized avatar

Block or report XXares

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of OpenAI paper with Simple Noise Scale on Fastai V2

Jupyter Notebook 19 1 Updated Apr 16, 2021

An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST

Python 7 Updated Nov 19, 2022

[ICSE 2024 Industry Challenge Track] Official implementation of "ReposVul: A Repository-Level High-Quality Vulnerability Dataset".

Python 38 5 Updated Sep 25, 2024

Longitudinal Evaluation of LLMs via Data Compression

Python 25 Updated May 29, 2024

Codebase for Merging Language Models (ICML 2024)

Python 757 44 Updated May 5, 2024
Python 1,226 166 Updated Oct 15, 2024

RewardBench: the first evaluation tool for reward models.

Python 387 48 Updated Oct 11, 2024

Robust recipes to align language models with human and AI preferences

Python 4,583 397 Updated Oct 7, 2024

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 114 11 Updated Mar 18, 2024

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Languag…

Python 217 14 Updated Apr 10, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 961 79 Updated Jul 23, 2024
Python 22 3 Updated Nov 26, 2022

Retrieval and Retrieval-augmented LLMs

Python 7,138 520 Updated Oct 10, 2024

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip

Python 1,357 105 Updated Oct 15, 2024

Imitate OpenAI with Local Models

Python 85 9 Updated Aug 27, 2024

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python 204 13 Updated Apr 22, 2024

An application providing a RESTful API similar to OpenAI Embedding, supporting BERT, SBERT, and CoSENT models for generating text embedding vectors.

Python 1 1 Updated Nov 9, 2023

Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'

Python 11 Updated Aug 2, 2024

Datasets, tools, and benchmarks for representation learning of code.

Jupyter Notebook 2,197 386 Updated Jan 31, 2022

Enhacing Code Pre-trained Models by Contrastive Learning

Python 28 8 Updated Mar 8, 2023

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,702 1,112 Updated Sep 24, 2024

[ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"

Python 64 9 Updated Jun 6, 2024
Python 170 12 Updated Oct 14, 2024

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales

Python 30 1 Updated Jul 17, 2023

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,906 410 Updated Oct 15, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 982 48 Updated Jan 16, 2024

Ongoing research training transformer models at scale

Python 10,267 2,304 Updated Oct 14, 2024

CodeXGLUE

C# 1,534 364 Updated Apr 23, 2024
Next