Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] text dictation #6

Open
wants to merge 2 commits into
base: master
Choose a base branch
from
Open

[WIP] text dictation #6

wants to merge 2 commits into from

Conversation

ir-regular
Copy link
Owner

@ir-regular ir-regular commented Feb 3, 2018

Resolves #1

Experimenting with a text dictation mode.

  • Find a sample text with a permissive license
  • Split text into sentences
  • ...but don't just split using a regexp on punctuation
  • Split sentences into phrases...
  • ...but don't just split by space
  • What are the licenses of models from nltk_data?
  • Package manager 101: install nltk and nltk_data
  • Package manager level up: install only the English tokenizer
  • Use click for command line arguments
  • Testing level up: set up unittest
  • Update the docs

…ntence.

Uses a pre-trained nltk model to split the text into sentences.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant