Training of FARMReader uses too many and potentially wrong no answer labels due to bug in SquadProcessor #2771

mathislucka · 2022-07-06T13:46:47Z

Describe the bug

When training a QA model with the FARMReader.train() method, the SquadProcessor will be used to convert the SQuAD-style json file to training samples.

The context for a question might be longer than the model's token limit. Therefore, the processor splits the full context into smaller passages. It then checks, if the original answer is present in a passage using its character positions. If the answer is not present in the passage it automatically uses the sample as a no-answer sample.

The code is here:

haystack/haystack/modeling/data_handler/processor.py

Line 643 in a2905d0

if passage_len_t > answer_start_t >= 0 and passage_len_t >= answer_end_t >= 0:

This creates multiple issues:

the user is not aware of this behaviour
for long documents, there is too many no-answer samples
the answer might be present in the passage but it was not labeled

Expected behavior

Give the user a parameter max_no_answer_per_context where they can decide how many no-answer samples should be created.
Check if the actual answer has a string match in that passage and never use samples as no-answer sample if there is a string match/overlap

The text was updated successfully, but these errors were encountered:

sjrl · 2022-08-03T14:18:48Z

Hi @julian-risch it sounds like we agree that we would like to have control over how many no-answer labels are created during training in a FARMReader model, so the implementation of something like max_no_answer_per_context. As for the second suggestion, it sounds like we thought it would be best to perhaps print it as a warning message instead of removing the no-answer sample.

In regards to evaluation, the issue outlined here does not affect evaluation because it was determined that evaluation does aggregate results per file which is why Michel closed the issue #2622.

sjrl · 2022-08-03T14:22:23Z

Additionally, we also agreed that adding additional documentation to the FARMReader training to explain how no-answer labels are automatically generated would be very helpful.

julian-risch · 2022-08-03T14:24:08Z

Alright, thank you. I'll tag @brandenchan and @agnieszka-m so that they also learn about the needed documentation update .

mathislucka added type:bug Something isn't working topic:train journey:advanced labels Jul 6, 2022

julian-risch self-assigned this Jul 6, 2022

julian-risch added the topic:reader label Jul 6, 2022

sjrl added type:feature New feature or request and removed type:bug Something isn't working labels Aug 3, 2022

sjrl added the type:documentation Improvements on the docs label Aug 3, 2022

masci unassigned julian-risch Aug 31, 2022

brandenchan mentioned this issue Oct 17, 2022

docs: Add comment about the generation of no-answer samples in FARMReader training #3404

Merged

3 tasks

masci added the P2 Medium priority, add to the next sprint if no P1 available label Nov 24, 2022

masci added P3 Low priority, leave it in the backlog and removed P2 Medium priority, add to the next sprint if no P1 available labels Jan 25, 2023

masci removed the journey:advanced label Dec 6, 2023

masci added the wontfix This will not be worked on label Feb 26, 2024

masci closed this as completed Feb 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training of FARMReader uses too many and potentially wrong no answer labels due to bug in SquadProcessor #2771

Training of FARMReader uses too many and potentially wrong no answer labels due to bug in SquadProcessor #2771

mathislucka commented Jul 6, 2022

sjrl commented Aug 3, 2022 •

edited

Loading

sjrl commented Aug 3, 2022

julian-risch commented Aug 3, 2022

Training of FARMReader uses too many and potentially wrong no answer labels due to bug in SquadProcessor #2771

Training of FARMReader uses too many and potentially wrong no answer labels due to bug in SquadProcessor #2771

Comments

mathislucka commented Jul 6, 2022

sjrl commented Aug 3, 2022 • edited Loading

sjrl commented Aug 3, 2022

julian-risch commented Aug 3, 2022

sjrl commented Aug 3, 2022 •

edited

Loading