OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
-
Updated
Jul 29, 2024 - Python
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Python package for the evaluation of odometry and SLAM
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
A unified evaluation framework for large language models
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
☁️ 🚀 📊 📈 Evaluating state of the art in AI
Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"
Multi-class confusion matrix library in Python
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
XAI - An eXplainability toolbox for machine learning
FuzzBench - Fuzzer benchmarking as a service.
High-fidelity performance metrics for generative models in PyTorch
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks
SemanticKITTI API for visualizing dataset, processing data, and evaluating results.
Evaluate your LLM's response with Prometheus and GPT4 💯
A General Toolbox for Identifying Object Detection Errors
中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Python implementation of the IOU Tracker
Add a description, image, and links to the evaluation topic page so that developers can more easily learn about it.
To associate your repository with the evaluation topic, visit your repo's landing page and select "manage topics."