Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add .html and respective .html.txt files #127

Merged
merged 2 commits into from
Jul 21, 2020
Merged

Add .html and respective .html.txt files #127

merged 2 commits into from
Jul 21, 2020

Conversation

mohammedsahl
Copy link
Contributor

This PR adds test files for the .html extension
The .\generate_html.ps1 script goes through all the test files in /test_files/docx and mirrors them into /test_files/html
A .txt file is then generated for every cloned .html file.

generate_html.ps1 has to be run in the root directory as .\generate_html.ps1 .\test_files in Powershell 7.0 or higher



$AbsPath = @(Get-ChildItem -path $Directory -Recurse -Exclude *.txt, *.skip, *.docx, "*.rtf","*.odt","*.doc")
:main2 for ($i = 0; $i -lt $AbsPath.length; $i++) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Small change, would just change main2 to html_to_txt or something along those lines!

Copy link
Contributor

@wlawt wlawt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@wlawt wlawt merged commit a71b7d6 into master Jul 21, 2020
@wlawt wlawt deleted the add-html-test-files branch July 21, 2020 21:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants