Skip to content
View KarelDO's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report KarelDO

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,576 347 Updated Jul 22, 2024

Repository for prompt-decoding using LLMs (GPT3.5, GPT4, Vicuna, and Zephyr)

Python 281 36 Updated Aug 1, 2024
Python 412 25 Updated Jul 30, 2024

A machine learning benchmark of in-the-wild distribution shifts, with data loaders, evaluators, and default models.

Python 544 125 Updated Jan 26, 2024

Large Action Model framework to develop AI Web Agents

Python 5,146 445 Updated Aug 1, 2024

A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contains ~43 million edits across 8 languages.

103 8 Updated May 6, 2019

Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.

Python 20 1 Updated Nov 28, 2023

Reference implementation for DPO (Direct Preference Optimization)

Python 1,918 150 Updated May 23, 2024

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 4,644 395 Updated May 9, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 493 34 Updated Jul 24, 2024

In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.

Python 342 19 Updated Feb 13, 2024

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Python 564 52 Updated Jul 29, 2024

Dataset of synthetic job ad sentences tagged with ESCO skills. From the paper Extreme Multi-Label Skill Extraction Training using Large Language Models.

2 Updated Jan 11, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 651 36 Updated May 30, 2024

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,165 882 Updated Aug 1, 2024

Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.

11 Updated Jul 18, 2024

SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings

Perl 52 13 Updated Feb 14, 2024

The dataset used to evaluate JobBERT on the task of job title normalization.

21 2 Updated Sep 10, 2022

TFIDF / KNN based string matching

Python 46 12 Updated Apr 6, 2023

State-of-the-art efficient coreference. This repository contains the code for the CRAC-2023 paper "CAW-coref: Conjunction-Aware Word-level Coreference Resolution". Forked from the EMNLP-2021 paper …

Python 7 3 Updated Nov 2, 2023

Inspecting and Editing Knowledge Representations in Language Models

Python 104 5 Updated Jul 24, 2023

BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.

Jupyter Notebook 39 4 Updated Jan 29, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 36,871 3,215 Updated May 7, 2024

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Shell 4,167 423 Updated Jun 28, 2023

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 9,769 799 Updated Jun 10, 2024

Home of StarCoder: fine-tuning & inference!

Python 7,220 511 Updated Feb 27, 2024

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Python 1,046 77 Updated Mar 7, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,303 1,469 Updated Aug 1, 2024

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 1,805 240 Updated Aug 1, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,213 4,020 Updated Jul 17, 2024
Next