Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Mika Hämäläinen committed Sep 30, 2020
1 parent c04a31d commit af5a285
Showing 1 changed file with 15 additions and 1 deletion.
16 changes: 15 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,6 +29,15 @@ You can normalize multiple sentences at the same time by running
sents = ["kissa syö karkkii", "jok laulaa tuol puole", "en tiiä oikee et kuka se o", "kyl on hölömöö"]
normalize_sentences(sents)
>> ['kissa syö karkkia', 'joka laulaa tuolla puolen', 'en tiedä oikein että kuka se on', 'kyllä on hölmöä']

### Swedish

You can use the Swedish model by passing *language=swe*

from murre import normalize_sentence

normalize_sentence("int vet ja", language="swe")
>> inte vet jag

## Generate

Expand Down Expand Up @@ -61,10 +70,15 @@ Non-standard dialectal data can be a mess to deal with when you are processing y

## Cite

**Normalization**
**Normalization (Finnish)**

Niko Partanen, Mika Hämäläinen, and Khalid Alnajjar. 2019. [Dialect Text Normalization to Normative Standard Finnish](https://www.aclweb.org/anthology/D19-5519/). In *the Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT)*.


**Normalization (Swedish)**

Mika Hämäläinen, Niko Partanen and Khalid Alnajjar. Accepted. Normalization of Different Swedish Dialects Spoken in Finland. In *the Proceedings of the 4th ACM SIGSPATIAL International Workshop on Geospatial Humanities*.

**Dialect generation**

Hämäläinen, M., Partanen, N., Alnajjar, K., Rueter J. & Poibeau T. (2020). [Automatic Dialect Adaptation in Finnish and its Effect on Perceived Creativity](https://researchportal.helsinki.fi/files/144027877/ICCC20_finnish_dialect_normalization.pdf). In Proceedings of the 11th International Conference on Computational Creativity. p. 204-211
Expand Down

0 comments on commit af5a285

Please sign in to comment.