Skip to content

Convert many PDF files to CSV for data mining purposes. Tables remain readable.

Notifications You must be signed in to change notification settings

Kiragroh/PDF-to-CSV_GUI-Converter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PDF-to-CSV converter

Convert many PDF files to CSV for data mining purposes. Tables remain readable.

Test Image 6

You can process a whole folder with or without subfolders and can peak (extra Button) beforehand how many PDF-files will be processed.

Here is an example of my recent use-case for this kind of program:

Test Image 6

The conversion is not perfect, but therefore functional for nearly every PDF. You have to modify the code for even better results. PDF formats are too unpredictable for a one-fits-all solution.

My program can be immediately started with the .exe in the 'debug'-folder.

I used this nice repo for development (many thanks to Justabhi96): https://github.com/Justabhi96/Detect_And_Extract_Table_From_Pdf/tree/df6d3aace3012612d437779efd181daa6dd5ef06

About

Convert many PDF files to CSV for data mining purposes. Tables remain readable.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages