Skip to content

mikahama/SemFi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SemFi

Is an online tool to browse semantic data for Finnish. Try SemFi online now!

The Database

The SemFi data has been extracted from The Finnish Internet Parsebank. This data has been applied in tools such as Poem Machine that is used to create Finnish poetry automatically.

⬇️ Dowload DOI to your own computer. It is released under CC BY-SA 4.0. © 2015-2017 Mika Hämäläinen

The Contents of the Database

The database contains a noun and a verb table which contain syntactically related words with frequencies. There's also a frequencies table that contains the frequencies of all word forms in the corpus.

And finally a verse structure table that contains the syntactic structures of Finnish poem verses. The around 5000 poems analyzed are the ones released in wiki sources.

✉️ In case of questions, contact me.

Cite

In case you use the data in a scientific project, please consider citing it as follows:

Hämäläinen, Mika (2018). Extracting a Semantic Database with Syntactic Relations for Finnish to Boost Resources for Endangered Uralic Languages. In The Proceedings of Logic and Engineering of Natural Language Semantics 15 (LENLS15).