GitHub - michelin/LLMInspector: LLMInspector is a comprehensive framework that evaluates alignment and adversaries for LLM based enterprise applications

LLMInspector: Comprehensive Evaluation and Testing for LLM Applications

When deploying Large Language Models (LLMs) in enterprise applications, it's crucial to understand, evaluate, and navigate their capabilities, limitations, and risks. Ensuring alignment with functional and non-functional requirements while maintaining robustness against adversarial queries is paramount.

LLMInspector is a sophisticated Python package developed to address these challenges. It offers a comprehensive solution for evaluating and testing the alignment and security of LLM-based applications. Tailored for enterprise needs, LLMInspector ensures ethical and effective deployment of powerful language models.

Key features:

Generation of prompts from Goldendataset by exploding the prompts with tag augmentation and paraphrasing.
Generation of prompts with various perturbations applied to test the robustness of the LLM application.
Generation of question and ground truth from documents, that can be used for testing of RAG based application.
Evaluation of RAG based LLM application using LLM based evaluation metrics.
Evaluation of the LLM application through various accuracy based metrics, sentiment analysis, emotion analysis, PII detection, Readability scores.
Adversarial red team testing using curated datasets to probe for risks and vulnerabilities in LLM applications

Installation:

The source code is currently hosted on GitHub at: llminspector

pip install notebook==5.2.2
pip install git+https://github.com/michelin/LLMInspector.git

The list of changes to LLMInspector between each release can be found here.

Documentation:

Detailed package and API Documentation is available here

Getting Started

Set up the config.ini and .env by referring here

from llm_inspector.llminspector import llminspector
import pandas as pd

obj = llminspector(config_path=<path-to-cofig.ini>, env_path=<path-to-.env>)
obj.alignment()
obj.adversarial()
obj.rag_evaluate()
obj.converse()
df = pd.read_excel(<path-to-input-csv>)
obj.evaluate(df)

Licence

This project is licensed under the Apache License 2.0. See the LICENSE file for more details.

Authors

Sourabh Potnis
Ankit Zade
Kiran Prasath
Arpit Kumar

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
docs		docs
example		example
llm_inspector		llm_inspector
tests		tests
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
ci_badges.sh		ci_badges.sh
config.ini		config.ini
pyproject.toml		pyproject.toml
req.txt		req.txt
setup.py		setup.py
sonar-project.properties		sonar-project.properties

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMInspector: Comprehensive Evaluation and Testing for LLM Applications

Key features:

Installation:

Documentation:

Getting Started

Licence

Authors

About

Releases 1

Packages

Contributors 2

Languages

License

michelin/LLMInspector

Folders and files

Latest commit

History

Repository files navigation

LLMInspector: Comprehensive Evaluation and Testing for LLM Applications

Key features:

Installation:

Documentation:

Getting Started

Licence

Authors

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages