Skip to content

khnlp is a library for advanced Khmer Natural Language Processing in Python. It is developed by a research team of Cambodia Academy of Digital Technology (CADT) which is built on the very latest research, and was designed from day one to be used in real products.

Notifications You must be signed in to change notification settings

IDRI-LAB/Khmer-NLP-Tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 

Repository files navigation

Khmer-NLP-Tools

khnlp is a library for advanced Khmer Natural Language Processing in Python. It is developed by a research team of Cambodia Academy of Digital Technology (CADT) which is built on the very latest research, and was designed from day one to be used in real products.

Installation

You can install the khnlp package and its dependencies by following these steps:

1. Clone the Repository

First, clone the repository from GitHub: "git clone https://github.com/IDRI-LAB/Khmer-NLP-Tools.git"

2. Install the Package

To install the khnlp package, use "pip install khnlp"

Usage

Run the script in inference directory.

    1. Khmer Tokenization, execute "python inference_tokenizer.py"
    1. Khmer Romanization, execute "python inference_romanizer.py"

References

  • [2] Chenchen Ding, Vichet Chea, Masao Utiyama, Eiichiro Sumita, Sethsere Sam, Sopheap Seng,Statistical Khmer Name Romanization, 2018
  • [1] Vichet Chea, Ye Kyaw Thu, Chenchen Ding, Masao Utiyama, Andrew Finch, and Eiichiro Sumita. Khmer word segmentation using conditional random fields. Khmer Natural Language Processing, 2015

About

khnlp is a library for advanced Khmer Natural Language Processing in Python. It is developed by a research team of Cambodia Academy of Digital Technology (CADT) which is built on the very latest research, and was designed from day one to be used in real products.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages