This tool provides a simple GUI interface to inspect PDF files for potential external references, including embedded JavaScript, hyperlinks, and form submission actions. It also provides the capability to exclude links with certain keywords.
- GUI based PDF selection.
- Extract and display embedded JavaScript from PDF files.
- List hyperlinks in PDF files.
- Detect form submission actions in PDFs.
- Exclude links based on user-defined keywords.
- Save results to a text file.
- Ensure you have Python installed.
- Install required packages using pip:
pip install PyMuPDF tkinter
-
Run the script:
python pdf_checker.py
-
Click on "Select PDFs" to open the file dialog and select one or multiple PDF files.
-
If prompted, enter keywords to exclude from the link checks.
-
Review findings in the results window.
-
Optionally, save the results to a text file using the "Save Results" button.
Embedded fonts will trigger fals positive if the string JS appears. ex:
Detected embedded JavaScript:
<<
/Type /FontDescriptor
/FontName /JSNCUQ+font351
/FontFamily (font351)
/Flags 32
/FontBBox [ 0 -206 2000 792 ]
/ItalicAngle 0
/Ascent 792
/Descent -206
/CapHeight 792
/StemV 80
/StemH 80
/FontFile2 5108 0 R
>>
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.