diff --git a/evaluations/README.md b/evaluations/README.md index 9b44c4a..7b959d5 100644 --- a/evaluations/README.md +++ b/evaluations/README.md @@ -1,9 +1,9 @@ # Evaluations -| Dataset and Evaluation | Evaluation Metrics |Colab | Python Script | -|------------------------------|--------------------|------|-----------------------------------------------------------| -| RAG over ARAGOG dataset | [ContextRelevance](https://docs.haystack.deepset.ai/docs/contextrelevanceevaluator) , [Faithfulness](https://docs.haystack.deepset.ai/docs/faithfulnessevaluator), [Semantic Answer Similarity](https://docs.haystack.deepset.ai/docs/sasevaluator) |Open In Colab | [evaluation_aragog.py](evaluation_aragog.py) | -| RAG over SQuAD dataset | [ContextRelevance](https://docs.haystack.deepset.ai/docs/contextrelevanceevaluator) , [Faithfulness](https://docs.haystack.deepset.ai/docs/faithfulnessevaluator), [Semantic Answer Similarity](https://docs.haystack.deepset.ai/docs/sasevaluator) |Open In Colab | [evaluation_squad_rag.py](evaluation_squad_rag.py) | -| Extractive QA over SQuAD dataset | [Answer Exact Match](https://docs.haystack.deepset.ai/docs/answerexactmatchevaluator), [DocumentMRR](https://docs.haystack.deepset.ai/docs/documentmrrevaluator), [DocumentMAP](https://docs.haystack.deepset.ai/docs/documentmapevaluator), [DocumentRecall](https://docs.haystack.deepset.ai/docs/documentrecallevaluator), [Semantic Answer Similarity](https://docs.haystack.deepset.ai/docs/sasevaluator) |Open In Colab | [evaluation_squad_extractive_qa.py](evaluation_squad_extractive_qa.py) | +| Dataset | Architecture | Evaluation Metrics | Colab | Python Script | +|---------------|--------------|--------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------| +| ARAGOG | RAG | [ContextRelevance](https://docs.haystack.deepset.ai/docs/contextrelevanceevaluator) , [Faithfulness](https://docs.haystack.deepset.ai/docs/faithfulnessevaluator), [Semantic Answer Similarity](https://docs.haystack.deepset.ai/docs/sasevaluator) | Open In Colab | [evaluation_aragog.py](evaluation_aragog.py) | +| SQuAD | RAG | [ContextRelevance](https://docs.haystack.deepset.ai/docs/contextrelevanceevaluator) , [Faithfulness](https://docs.haystack.deepset.ai/docs/faithfulnessevaluator), [Semantic Answer Similarity](https://docs.haystack.deepset.ai/docs/sasevaluator) | Open In Colab | [evaluation_squad_rag.py](evaluation_squad_rag.py) | +| Extractive QA | SQuAD | [Answer Exact Match](https://docs.haystack.deepset.ai/docs/answerexactmatchevaluator), [DocumentMRR](https://docs.haystack.deepset.ai/docs/documentmrrevaluator), [DocumentMAP](https://docs.haystack.deepset.ai/docs/documentmapevaluator), [DocumentRecall](https://docs.haystack.deepset.ai/docs/documentrecallevaluator), [Semantic Answer Similarity](https://docs.haystack.deepset.ai/docs/sasevaluator) | Open In Colab | [evaluation_squad_extractive_qa.py](evaluation_squad_extractive_qa.py) |