Skip to content

Releases: ETCBC/nestle1904

Clauses, phrases, frames, and subj refs

11 May 13:23
Compare
Choose a tag to compare

There are more edges now: when the xml refers to other parts of the xml by means of ids, we turn that into edges.

And we added phrases and clauses.

Tweaked metadata and some features

10 May 14:11
Compare
Choose a tag to compare
  • Attributes that only ever have the value true get value 1 (int).
  • Generated numbers for books, sentences, words are now all in feature num.
  • Features are not split between the w and wg elements, if they have values for both, they have slightly different meanings
  • The descriptions of the features are a little bit clearer

Good word order

10 May 09:43
Compare
Choose a tag to compare

This version of the data has the words in proper order.
Achieved by a new version of the TF walker converter, which can now reorder slots.

First working dataset

08 May 14:45
Compare
Choose a tag to compare

Data version 0.1.1