#

wiktionary-parser

Here are 23 public repositories matching this topic...

tatuylonen / wiktextract

Wiktionary dump file parser and multilingual data extractor

multilingual parser lua dictionary extractor templates wikitext scribunto wiktionary wiktionary-parser

Updated Nov 1, 2024
Python

suyashb95 / WiktionaryParser

A Python Wiktionary Parser

python parser mediawiki wiktionary-parser

Updated Jan 12, 2024
Python

gambolputty / wiktionary-de-parser

Extract data from German Wiktionary XML files.

nlp german data-extraction wiktionary german-language wiktionary-parser wiktionary-dump dewiktionary

Updated Jul 29, 2024
Jupyter Notebook

clefourrier / EtymDB

[LREC 2020] EtymDB, an Etymological DataBase (v2.1)

database extract etymology tei wiktionary cognates etymology-data wiktionary-parser lrec2020 borrowings

Updated Jan 4, 2022
Perl

wswu / yawipa

A comprehensive and extensible Wiktionary parsing framework.

multilingual parser dictionary wiktionary wiktionary-parser

Updated Sep 5, 2024
Julia

lenakmeth / Wikinflection

Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and Neumann, 2018)

morphology linguistics inflection computational-linguistics wiktionary wiktionary-parser

Updated Jun 23, 2020
Python

snizio / italian-wiktionary-parser

This repository contains a python script for parsing an xml dump of the Italian Wiktionary (Wikizionario); it also contains the parsed dictionary in a JSON file and a ONLI (italian database of neologisms) scraper with the scraped data in a CSV file

nlp italian corpus-linguistics italiano onli wiktionary-parser wiktionary-data neologisms

Updated May 4, 2024
Python

Surkal / WiktionnaireParser

A library for parsing the french wiktionary

python python3 francais french wiktionary wiktionary-parser

Updated Feb 20, 2022
Python

slowwavesleep / RuWiktionaryParser

Extraction of the Russian word forms and their segmentation from the Russian Wiktionary

morphology segmentation wiktionary russian-language wiktionary-parser word-forms

Updated May 1, 2021
Python

javalc6 / wikiparser-java

Light Wiki parser and renderer developed in Java and Lua, from wiktionary xml dump to html

java lua mediawiki wikipedia wiktionary multi-language-support wiktionary-parser wiktionary-renderer

Updated Oct 2, 2024
Java

beviah / ezglot

Selected data processing scripts including language agnostic multilingual wiktionary parser

multilingual dictionary extractor templates pronunciation levenshtein-distance wikitext ipa similarity-measures language-resources wiktionary thesaurus-data wiktionary-parser wiktionary-data wiktionary-tool wiktionary-dataset word-distance

Updated Mar 31, 2024
Python

Vuizur / ruwiktionary-htmldump-parser

Parses the Russian Wiktionary HTML dumps into JSON and generates ereader dictionaries

parser language-learning russian wiktionary wiktionary-parser

Updated Aug 10, 2023
Python

dicc-io / wiktioparse

Wiktionary Parser written in Ruby

ruby parser json wiktionary wiktionary-parser wiktionary-data

Updated Dec 17, 2020
Lua

Vuizur / dewiktionary-htmldump-parser

A scraper which extracts data from the German Wiktionary HTML dump.

german czech wiktionary wiktionary-parser wiktionary-dump

Updated Jul 19, 2022
Python

yuzhoumo / latinator-3000

Web interface for parsing Wiktionary for results in specific languages

dictionary wiktionary-parser

Updated Jan 13, 2022
JavaScript

kurd-cc / kurdish-wiktionary

A parser for the Kurdish Wiktionary (NPM package)

npm-package kurdish wiktionary-parser kurdish-language-processing kurmanji

Updated Oct 17, 2021
JavaScript

cpina / kamus-dictionary

Prototype of an interface to use Wiktionary translations

language translation dictionary wiktionary wiktionary-parser wiktionary-tool

Updated Aug 20, 2023
Python

Erutuon / wiktionary-scripts

wiktionary wiktionary-parser

Updated Apr 11, 2023
Lua

AtilioA / frenchhomophones

🇫🇷 Source code for frenchhomophones website. [inactive]

flask herokuapp mongodb-atlas wiktionary-parser

Updated Feb 3, 2022
Python

vls9 / fetch-kaikki

A simple TypeScript client + types for parsed Wiktionary data from Kaikki.org, parsed with wiktextract

wiktionary wiktionary-parser wiktionary-data kaikki wiktextract

Updated May 29, 2023
TypeScript

Improve this page

Add a description, image, and links to the wiktionary-parser topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wiktionary-parser topic, visit your repo's landing page and select "manage topics."