Skip to content
This repository has been archived by the owner on Jun 2, 2024. It is now read-only.

dotpack/php-boiler-pipe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PhpBoilerPipe

Project Archived

This project is no longer maintained. Please refer to pforret/pf-article-extractor for further updates and continued development.

Thank you for your support!


Boilerplate Removal and Fulltext Extraction from HTML pages.

Partial implementation of https://github.com/kohlschutter/boilerpipe in PHP. Requires PHP >= 5.4.

Example

# html
$path = "http:https://example.com/some-article.html";
$data = file_get_contents($path);

# code
$ae = new DotPack\PhpBoilerPipe\ArticleExtractor();
echo $ae->getContent($data) . "\n";

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages