Skip to content

Latest commit

 

History

History
259 lines (167 loc) · 13.1 KB

pm.md

File metadata and controls

259 lines (167 loc) · 13.1 KB

KEOPS for project managers

KEOPS (Keen Evaluation Of Parallel Sentences) provides a complete tool for manual evaluation of parallel sentences.

Table of contents

Your role in KEOPS

The parallel sentences are organized in tasks. A task defines a group of sentences which should be reviewed by an evaluator in a certain way.

As a project manager, your mission is to create tasks, assign them to evaluators and track their progress.

Even if you are a project manager, you can be assigned tasks. For more information about how to evaluate tasks, please refer to the evaluators guide.

Getting started

In order to use KEOPS, you need an invitation. Only another project manager can provide you with one. In addition, the project manager who invited you has to grant project manager privileges to your account.

Sign in

If you already have your project manager account, sign in to use KEOPS. You can do this by clicking on Sign in in the homepage.

Home page

This will take you to the Sign in screen, where you just have to fill the fields with your email and password.

Sign in screen

Sign up

When you are invited to KEOPS, you will get an invitation link. If you follow that link, you will be able to sign up.

Fill the required fields and click on Sign up. This will create your evaluator account. From now on, you can sign in using your email and password.

Instead of an invitation link, you could get an invitation token. This token will look like this:

5d4404fb9f66a

On the Sign up screen, fill the Access Token field with this token.

Sign up screen

Managing tasks and projects

Projects group tasks which are somehow related. A task must always belong to a project, so projects may contain only one task. When creating a project, you will be asked for a name and a description. Use this information to correctly identify your projects.

Projects screen

Tasks group sentences which should be evaluated in the same way. You can create tasks once you create the project they belong to. When creating a task, you will be asked for:

  • Mode: the way this task should be evaluated. Refer to the modes section for more information.
  • Source language: the language of the source texts. For certain kinds of evaluation, this will not be required.
  • Target language: the language of the target texts.
  • Evaluator: the evaluator who should evaluate this set of sentences. Only evaluators who have knowledge of both source and target languages can be chosen.
  • Corpus: the set of sentences to evaluate. Refer to managing corpora for more information.

Tasks screen

If you click on Recap all on the Projects screen, you get an overview of all your current projects and their tasks.

Projects overview screen

Evaluation modes

A task has always an evaluation mode. This will change the way evaluators review the sentences.

Validation

Validation screen

This mode is used to classify pairs of sentences using the European Language Resource Coordination (ELRC) validation guidelines. These guidelines define a set of labels to classify sentences:

  1. Wrong language identification
  2. Incorrect alignment
  3. Wrong tokenization
  4. MT translation
  5. Translation error
  6. Free translation
  7. Valid translation

You may refer to the ELRC Validation Guidelines document for more information about these labels.

If a label is not selected, the sentence will be pending.

When selecting some of the labels, additional information may be asked:

  • Wrong language identification: whether the source, the target or both texts are wrong identified.
  • MT Translation: whether the source, the target or both texts are MT-translated.
  • Free translation: whether the translation should be kept, even though it is a free one.

Please note that this additional information is completely optional to provide.

Moreover, KEOPS can be also used to flag sentences which contain personal data or inappropriate language. This is done by using the checkboxes on the bottom right of the screen.

  • A pair of sentences contains personal data when the content includes proper names or other personal data that could be anonymised for data protection purposes.
  • A pair of sentences contains inappropriate language when it includes profane expressions.

Adequacy

Adequacy screen

When evaluating adequacy, evaluators must assess that the candidate translation expresses the meaning of the source text in an adequate way. This is, that the candidate translation accurately captures the meaning of the source text.

The source text is presented to evaluators on the left of the screen and, the candidate translation, on the right.

Detective Quality control

Adequacy mode adds quality control sentences to the corpus. These sentences can be previewed at any time, and their creation is based on the research of Cambridge Core.

Once the evaluator is done with the task, a score is computed assessing their performance on the control sentences. This score may be mediocre even for good evaluators, since the used metrics are strict. We do not recommend to ban an evaluator because of this score as the only reason. The score may be used as an illustrative measure of the performance of the user on that task, but project managers should always corroborate the score.

Fluency

Fluency screen

When evaluating fluency, evaluators must assess that the presented text is fluent in its language. The sentence to evaluate is shown on the left of the screen.

Ranking

Ranking screen

When performing this kind of evaluation, a source text and a reference translation is presented to the evaluator. Then, evaluators must rank a set of candidate translations. The first position belongs to the best translation and, the last position, to the worst one.

Paraphrasing

Paraphrasing screen

When evaluating paraphrases, a source text and a candidate paraphrase is presented to the evaluator. The evaluator must assess that the candidate is a valid paraphrase of the source text. A bunch of guidelines are available on the evaluating screen, in order to help the evaluator assess the paraphrases.

Managing corpora

A set of sentences is called a corpus. Corpora may have a different structure depending on the evaluation mode they are meant to be used in. KEOPS lets you upload, preview and assign corpora for evaluation.

Uploading corpora

Corpora upload screen

Corpora is always uploaded in TSV format. This format uses one line per record and a tab character to separate fields.

The specific format of the TSV file is explained below for each of the evaluation modes. This information is alsa available on KEOPS clicking on First time uploading corpora. You can also download a template and use it to upload your data:

Please note that all formats contain headers which are necessary in order to upload corpora to KEOPS. If you have a corpus without headers, you can add them using:

echo -e "Header\tAnother header" | cat - corpus-without-headers >
corpus-with-headers

Corpora for validation

Source text Tab Target text
You can contact our customer service department using the form below Tab Puedes ponerte en contacto con nuestro departamento de servicio al cliente mediante el siguiente formulario

You should only include one target text for each source text.

Corpora for adequacy

Source text Tab Candidate translation
You can contact our customer service department using the form below Tab Puedes ponerte en contacto con nuestro departamento de servicio al cliente mediante el siguiente formulario

You should only include one candidate translation for each source text.


⚠️ Due to quality control sentences, uploading corpora for adequacy will result in a corpus 30% bigger than its original size. Learn more about quality control here


Corpora for fluency

Candidate translation
Puedes ponerte en contacto con nuestro departamento de servicio al cliente mediante el siguiente formulario

Corpora for fluency evaluation consist only on one column because they are monolingual tasks.

Corpora for ranking

Source text Tab Reference text Tab Name of system 1 Tab Name of system 2 Tab ...
You can contact our customer service department using the form below Tab Puedes ponerte en contacto con nuestro departamento de servicio al cliente mediante el siguiente formulario Tab Manual de empleo y manutención Tab Manual de empleo y manutención Tab ...

Include as many systems as you want.

Corpora for paraphrasing

Source text Tab Candidate paraphrase
Have the right to choose Tab Have the right to elect

You should only include one candidate paraphrase for each source text.

Managing users

As a project manager, you have de capability of inviting new users to KEOPS. The new users may have evaluator or project manager privileges.

Inviting new users

Invitation screen

Use the Invitations tab on the Management page to invite a user. You only need to provide their email address. Make sure to provide a valid one, since they will be notified via mail about the invitation.

Modifying users

Edit user screen

Once the user accepts the invitation and signs up, you can modify their information:

  • Name: the name of the user.
  • Email: the email address of the user.
  • Languages: the languages the user has knowledge of.
  • Role: whether this user is an evaluator or a project manager.
  • Active: whether the user can sign in.

Managing languages

Languages screen

A comprehensive list of languages is already provided when KEOPS is installed. However, you may add additional languages using the Add language action on the Languages tab of the Managment screen.

When adding a new language, you are asked to type a language code and a language name.

  • The language code must have the form of an ISO 639-1 code.
  • The language name can be any string which describes the language (for example, "Spanish")

Feedback

Feedback screen

When an evaluator finishes a task, they can provide feedback about their experience. As a project manager, you can read this feedback on the Managment screen.

A list of the provided comments is displayed, along with the user who made them. If you click on Contact, you can get in touch with that user.