.NET 6 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
-
Updated
Feb 7, 2024 - C#
.NET 6 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
Identify a file via MIME type and file signature detection.
A script that downloads the NSRL RDS Modern and feeds the SHA-1 as key to a redis server
PrintTracker is used to identify file formats on files that have lost their extension, it can learn to identify new file extensions by using the learn (-l) and print (-p) commands.
A simple CLI Tool scripted in Python to check for File types based on MIME types and then comparing them with the extensions.
Add a description, image, and links to the file-identification topic page so that developers can more easily learn about it.
To associate your repository with the file-identification topic, visit your repo's landing page and select "manage topics."