Skip to content
View KarelDO's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report KarelDO

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,569 349 Updated Jul 22, 2024

Repository for prompt-decoding using LLMs (GPT3.5, GPT4, Vicuna, and Zephyr)

Python 281 36 Updated Jul 27, 2024
Python 406 25 Updated Jul 26, 2024

A machine learning benchmark of in-the-wild distribution shifts, with data loaders, evaluators, and default models.

Python 544 124 Updated Jan 26, 2024

Large Action Model framework to develop AI Web Agents

Python 5,128 444 Updated Jul 28, 2024

A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contains ~43 million edits across 8 languages.

103 8 Updated May 6, 2019

Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.

Python 20 1 Updated Nov 28, 2023

Reference implementation for DPO (Direct Preference Optimization)

Python 1,908 147 Updated May 23, 2024

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 4,635 395 Updated May 9, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 489 34 Updated Jul 24, 2024

In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.

Python 340 19 Updated Feb 13, 2024

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Python 562 50 Updated Jul 25, 2024

Dataset of synthetic job ad sentences tagged with ESCO skills. From the paper Extreme Multi-Label Skill Extraction Training using Large Language Models.

2 Updated Jan 11, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 645 36 Updated May 30, 2024

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,163 882 Updated Jul 28, 2024

Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.

11 Updated Jul 18, 2024

SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings

Perl 52 13 Updated Feb 14, 2024

The dataset used to evaluate JobBERT on the task of job title normalization.

21 2 Updated Sep 10, 2022

TFIDF / KNN based string matching

Python 46 12 Updated Apr 6, 2023

State-of-the-art efficient coreference. This repository contains the code for the CRAC-2023 paper "CAW-coref: Conjunction-Aware Word-level Coreference Resolution". Forked from the EMNLP-2021 paper …

Python 7 3 Updated Nov 2, 2023

Inspecting and Editing Knowledge Representations in Language Models

Python 104 5 Updated Jul 24, 2023

BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.

Jupyter Notebook 39 4 Updated Jan 29, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 36,868 3,218 Updated May 7, 2024

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Shell 4,166 423 Updated Jun 28, 2023

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 9,753 797 Updated Jun 10, 2024

Home of StarCoder: fine-tuning & inference!

Python 7,215 511 Updated Feb 27, 2024

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Python 1,045 77 Updated Mar 7, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,230 1,463 Updated Jul 26, 2024

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 1,799 239 Updated Jul 26, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,203 4,017 Updated Jul 17, 2024
Next