Skip to content

Commit

Permalink
Browse files Browse the repository at this point in the history
  • Loading branch information
bedapudi6788 committed Sep 27, 2019
2 parents 56691ce + 7bd002f commit fbbe756
Showing 1 changed file with 11 additions and 2 deletions.
13 changes: 11 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DeepSegment: A sentence segmenter that actually works!
# For the original implementation please use the "master" branch of this repo.
Note: For the original implementation please use the "master" branch of this repo.

This implementation is trained on data from various sources. (v1 or the model in master branch is trained only on Tatoeba data).

Expand All @@ -11,6 +11,14 @@ pip install --upgrade deepsegment
# please install tensorflow or tensorflow-gpu separately. Tested with tf and tf-gpu versions 1.8 to 2.0
```

# Supported languages:
en - english (Trained on data from various sources)

fr - french (Only Tatoeba data)

it - italian (Only Tatoeba data)


# Usage:

```
Expand All @@ -22,6 +30,7 @@ segmenter.segment('I am Batman i live in gotham')
```

Training deepsegment on custom data: https://colab.research.google.com/drive/1CjYbdbDHX1UmIyvn7nDW2ClQPnnNeA_m

# To Do:
1. Add a sliding window for processing very long texts. - DONE (segmenter.segment_long(input_sentence))
2. Publish docker tf-serving image and deepsegment-client.

0 comments on commit fbbe756

Please sign in to comment.