Skip to content
View dvsrepo's full-sized avatar

Organizations

@argilla-io @huggingface

Block or report dvsrepo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Agentic components of the Llama Stack APIs

Python 3,090 295 Updated Aug 29, 2024

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

185 6 Updated Aug 20, 2023
Jupyter Notebook 251 19 Updated Aug 29, 2024

Aana SDK is a powerful framework for building AI enabled multimodal applications.

Python 21 2 Updated Aug 29, 2024

A highly efficient library for large scale distributed training

Python 40 22 Updated Aug 28, 2024

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

Python 2,230 145 Updated Aug 29, 2024

Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on tasks like multi-label classification, named entity recognition,…

Python 113 4 Updated Aug 25, 2024
Python 174 17 Updated Jul 25, 2024

A working repository for experimental pipelines in distilabel

Jupyter Notebook 6 1 Updated Jul 6, 2024

Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the avai…

Jupyter Notebook 48 7 Updated Jul 10, 2024

Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI

Python 221 24 Updated Apr 29, 2024

A very simple news crawler with a funny name

Python 269 74 Updated Aug 28, 2024

awesome synthetic (text) datasets

Jupyter Notebook 200 10 Updated Jun 25, 2024

WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.

Python 26 4 Updated Jul 23, 2024

The Argilla API python SDK

Python 8 1 Updated Aug 26, 2024
Python 483 44 Updated Aug 15, 2024

Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"

Python 24 4 Updated Jun 22, 2024

A simple script for running a mixture of RAG and HF using Ollama and Argilla

Python 4 1 Updated Mar 20, 2024

A basic script for running an ensemble of TinyLlamas on MLX to annotate an Argilla Dataset

Python 7 Updated Aug 1, 2024

Efficient vector database for hundred millions of embeddings.

Python 190 9 Updated May 17, 2024

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Python 1,191 98 Updated Aug 22, 2024

A framework for Claude Opus to intelligently orchestrate subagents.

Python 4,088 634 Updated Jul 1, 2024

Recipes to train reward model for RLHF.

Python 578 49 Updated Aug 28, 2024

Official repository for ORPO

Python 397 35 Updated May 31, 2024

real-time, multi-modal, vector embedding pipeline

Python 4 Updated Mar 19, 2024

Let's build better datasets, together!

Jupyter Notebook 192 29 Updated Jul 24, 2024

Repository containing the SPIN experiments on the DIBT 10k ranked prompts

Python 22 Updated Mar 12, 2024

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 851 62 Updated Aug 1, 2024

A Python native FastAPI server for the Argilla backend.

Python 9 9 Updated Jun 14, 2024
Next