digdes-project

This project will recognize main named entities of the doc/docx texts with the help of natasha and pullenti.

usage

usage: main.py [-h] [-e {pullenti,pullenti-wrapper,natasha} | -oo] dir

Extract organizations and money from texts and compare it with xml

positional arguments:

  dir                   directory with directories which contain doc/docx
                        files with xml-files

optional arguments:

  -h, --help            show this help message and exit
  -e {pullenti,pullenti-wrapper,natasha}, --extractor {pullenti,pullenti-wrapper,natasha}
                        name of extractor, default "natasha"
  -oo, --only_organizations
                        extract only organizations

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.idea		.idea
contracts_stats		contracts_stats
README.md		README.md
doc_info.py		doc_info.py
extract_text.py		extract_text.py
main.py		main.py
ner_natasha.py		ner_natasha.py
ner_pullenti.py		ner_pullenti.py
ner_pullenti_wrapper.py		ner_pullenti_wrapper.py
ner_stuff.py		ner_stuff.py
requirements.txt		requirements.txt
stats.py		stats.py
string_stuff.py		string_stuff.py
xml_parser.py		xml_parser.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

digdes-project

usage

About

Releases

Packages

Languages

Julia-Markelova/digdes-project

Folders and files

Latest commit

History

Repository files navigation

digdes-project

usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages