how-to-agreement

Prerequisites

The following package is used in the code:

nltk

You can install it by running:

pip install nltk

Quick Start

To calculate agreement scores, one can use nltk.metrics.agreement.

We provide a simple code snippet in agreement.py :

def fleiss_kappa(worker_tags: list[list[str]]) -> float:
    """
    Calculate the Fleiss' Kappa score of several(>=2) workers' annotations on the same sequence.
    Example of worker_tags:
    [
        ['B-POS', 'I-POS', 'O'    ], # worker 1
        ['B-POS', 'O',     'O'    ], # worker 2
        ['B-POS', 'I-POS', 'I-POS'], # worker 3
    ]
    """
    assert len(worker_tags) >= 2

    # Check if all workers give no annotation spans on the sequence.
    is_only_O = True
    for tags in worker_tags:
        if not is_only_O:
            break
        for tag in tags:
            if tag != 'O':
                is_only_O = False
                break
    if is_only_O:
        return 1.0

    # The NLTK implementation.
    data = []
    for worker_idx, tags in enumerate(worker_tags):
        for tag_idx, tag in enumerate(tags):
            data.append((worker_idx, tag_idx, tag))
    task = agreement.AnnotationTask(data=data)
    return task.multi_kappa()

The argument worker_tags: list[list[str]] is a list of annotations from different workers on the same sentence.

For example, 3 workers annotate the sentence I love you. We split the sentence into words like ['I', 'love', 'you']. Each worker assigns a BIO tag on each word in the sentence like ['B-POS', 'I-POS', 'O']. We collect annotations from all workers:

worker_tags = [
    ['B-POS', 'I-POS', 'O'    ], # worker 1
    ['B-POS', 'O',     'O'    ], # worker 2
    ['B-POS', 'I-POS', 'I-POS'], # worker 3
]

Now, just call the function and get the fleiss' kappa score calculated.

fleiss_kappa(worker_tags) # 0.3999999999999999

Different Agreements

Cohen's Kappa: This kappa is initially designed for two annotators. For multiple (>= 3) annotators, it is pairwise calculated and averaged.
```
AnnotationTask.kappa()
```
Fleiss' Kappa: By default, you should use this kappa for the multi-annotator situation.
```
AnnotationTask.multi_kappa()
```
Krippendorff's Alpha: This is another implementation of multi-annotator agreement. The value of this agreement is very close to Fleiss' Kappa.
```
AnnotationTask.alpha()
```

How to deal with nested spans (e.g. trees)?

We provide one possible solution:

Extract all subsequences from the original sequence.
Concatenate the subsequences into a long sequence.
Copy annotation spans to corresponding subsequences in the long sequence.
Calculate agreement with long annotation sequences.

Following is an example:

The original sequence is

['I', 'love', 'you']

We extract all subsequences and concatenate them into

[
  'I', 'love', 'you', # length = 3 
  'I', 'love',        # length = 2
  'love', 'you',      # length = 2
  'I',                # length = 1
  'love',             # length = 1
  'you'               # length = 1
]

An annotator gives a syntax tree annotation like

We can translate it into nested spans:

[(0, 3, 'S'), (0, 1, 'NP'), (1, 3, 'VP'), (1, 2, 'V'), (2, 3, 'NP')]

Then overwrite the concatenated sequence with these spans:

[
  'B-S', 'I-S', 'I-S', # ['I', 'love', 'you'] is 'S'
  'O', 'O',            #        ['I', 'love'] is not annotated 
  'B-VP', 'I-VP',      #      ['love', 'you'] is 'VP'
  'B-NP',              #                ['I'] is 'NP'
  'B-V',               #             ['love'] is 'V'
  'B-NP'               #              ['you'] is 'NP'
]
# We use BIO tags here.

Now, we can calculate Fleiss' Kappa of such concatenated annotation sequences from different annotators.

What if nested spans completely overlap?

Consider the following syntax tree:

We may translate this annotation of syntax tree into:

[
  ['B-S',  'I-S',  'I-S' ], # S
  ['B-NP', 'O',    'B-NP'], # NP
  ['O',    'B-VP', 'I-VP'], # VP
  ['B-N',  'O',    'O'   ], # N
  ['O',    'B-V',  'O'   ], # VP
# ['I',    'love', 'you' ]
]

Then Fleiss' Kappa can be calculated.

Not enough for your experiments?

You can refer to the official document of the nltk.mertics.agreement module.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
img		img
README.md		README.md
agreement.py		agreement.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

how-to-agreement

Prerequisites

Quick Start

Different Agreements

How to deal with nested spans (e.g. trees)?

What if nested spans completely overlap?

Not enough for your experiments?

About

Releases

Packages

Languages

Liskie/how-to-agreement

Folders and files

Latest commit

History

Repository files navigation

how-to-agreement

Prerequisites

Quick Start

Different Agreements

How to deal with nested spans (e.g. trees)?

What if nested spans completely overlap?

Not enough for your experiments?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages