Skip to content

Assess the effectiveness of chunking strategies in RAG systems via a custom evaluation framework.

License

Notifications You must be signed in to change notification settings

Leo310/rag-chunking-evaluation

Repository files navigation

RAG Chunking Evaluation

This repository contains code and datasets for evaluating chunking strategies in Retrieval-Augmented Generation (RAG) systems. The project includes various benchmarks, data loaders, and utility functions to facilitate the evaluation process.

Setup

  1. Clone this repository

  2. Create a virtual environment:

    python -m venv venv
    source venv/bin/activate  # On Windows use `venv\Scripts\activate`
  3. Install dependencies:

    pip install -r requirements.txt
  4. Set up environment variables: Copy .env.example to .env and fill in the required values.

Usage

Follow the instructions in the my_benchmark notebook to run the proposed chunking evaluation framework. The specific chunking strategies under evaluation are detailed in the chunking_strategies notebook.

Each step in the evaluation pipeline generates intermediate results, which are saved in the data directory for later review and loading.

The experimental directory includes tests for other benchmarks and evaluation frameworks, such as Ragas, Trulens, and Multi-Hop-RAG.

About

Assess the effectiveness of chunking strategies in RAG systems via a custom evaluation framework.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published