Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Guiraud’s Index #26

Open
dpalmasan opened this issue Apr 11, 2021 · 3 comments
Open

Implement Guiraud’s Index #26

dpalmasan opened this issue Apr 11, 2021 · 3 comments
Labels
enhancement New feature or request good first issue Good for newcomers

Comments

@dpalmasan
Copy link
Owner

dpalmasan commented Apr 11, 2021

This is a lexical diversity measurement that penalizes number of words. It is computed as:

Where is the number of distinct words in the text, and is the total number of words in the text.

Unit tests should be added as well.

Docs should be updated as well, adding the following reference:

@misc{herdan1961problemes,
  title={Probl{\`e}mes et m{\'e}thodes de la statistique linguistique},
  author={Herdan, Gustav},
  year={1961},
  publisher={JSTOR}
}
@dpalmasan dpalmasan added enhancement New feature or request good first issue Good for newcomers labels Apr 11, 2021
@supersonic1999
Copy link
Contributor

Looking into this now; would you suggest adding this as a function to a pre-existing file or creating a separate one?

@dpalmasan
Copy link
Owner Author

Hello! Sure, this should go into src/ttr.py file, as it is a TTR metric. Let me know if you have more questions!

@supersonic1999
Copy link
Contributor

Thanks! I just pushed the code to #49 .

Im going to have to do some more research on github, getting lost if ive pushed that right... cheers!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

2 participants