moby

Corrected versions of Moby lexical database

The original Moby database is in the public domain and is available at https://icon.shef.ac.uk/Moby/

So far, this repository contains only the pronunciation database, with some corrections.

Summary of changes

Standardisation

Errors

Editorial decisions

Documentation

Remaining questions

Should 'Cipriano' begin with 's' or 'tS' (Italian pronunciation)?
What does /z/ mean? German or Italian sound? If so, it should be 'ts'.
When should /-/ be used instead of /@/?
A few other issues and inconsistencies concerning non-English words.
Inconsistent use of optional /@/ in some words, eg 'operative' and related words.
What about several symbols used in non-English words which are (a) not described in the original documentation, and (b) are already encoded by existing symbols? Eg, "S" in some French words is just "s", and "/z/" in German is just "ts". Also, many uses of "e" in French words.
"V" is sometimes Spanish β (like "b") but sometimes Dutch ʋ (like "w")

Other to do

Reduce redundancy of compound words: find all cases where both parts of a compound word can be correctly encoded independently, ensure the single word entries exist, then delete the compound entry.
Swap "A" and "a", so that "a" means IPA /a/ and "A" means IPA upside-down a.
Use "0" instead of "y" and "y" instead of "Y"
Simplify encoding: use j instead of /j/, x instead of /x/, S instead of /S/, Z instead of /Z/ E instead of /E/, I instead of /I/, D instead of /D/, maybe B instead of Spanish V, get rid of /z/
Possibly use other sources to cross check: CMU dictionary (included); Oxford Dictionaries API https://developer.oxforddictionaries.com; Lingorado https://lingorado.com/ipa/; Words API https://www.wordsapi.com.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
mpron		mpron
README.md		README.md