feedme

This is a content aggregator utility for rss and atom feeds over a proxy IT IS VERY MUCH IN AN ALPHA STATE Currently, it is capable of:

Crawling a list of RSS feeds from sources.txt through a proxy (TOR), as well as downloading images
Store posts in a sqlite3 database
Pop a webserver up on localhost:5000
Display posts in a paginated feed

Known Issues:

Deleting a post with images will delete all images

Getting set up:

pip3 install beautifulsoup4
pip3 install requests
pip3 install feedparser

Using the package manager of your choice, install TOR E.G.

apt-get install tor
brew install tor

You may need to run the 2to3 conversion tool for python on the feedparser library (easily googled)

Code Notes:

Basic flow of control:

crawler.py is responsible for all crawling

run tor outside of feedme, and it will connect via port 9050 (tor's default port)
run "python3 run.py" to run the flask server
go to localhost:5000 to see the server
import sources in the top left of the website will import sources from sources.txt
crawl will send an ajax request to crawl all the websites from the imported sources
prune will delete all posts that don't have a source, and all images that don't have a post

Database schema:

each table is a class inherited from database.py each table has a list of columns, and hashable columns instead of an ID, a hash of hashable columns is used (prevents duplicates) insert function is defined in database.py, and automatically sets all columns not pass in the dictionary to False, as well as adds a timestamp and hash init.py is where all the routes are stored, this is basically the controller

TODO:

segregate posts into categories
add live-suggestion for filter
add stop_crawler function
create semaphore for crawler
Detect iframes and handle them so non-tor requests aren't made
rename images when downloaded

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
app		app
flask		flask
README.md		README.md
feedme		feedme
feedme-journal		feedme-journal
font-awesome.min.css		font-awesome.min.css
run.py		run.py
sources.txt		sources.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

feedme

Known Issues:

Getting set up:

Code Notes:

Basic flow of control:

Database schema:

TODO:

About

Releases

Packages

Languages

m1lky/feedme

Folders and files

Latest commit

History

Repository files navigation

feedme

Known Issues:

Getting set up:

Code Notes:

Basic flow of control:

Database schema:

TODO:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages