News Emote Analyser

News source headline scraper and analyser for, viewable with news-emote

Development

First time on new computer/clone:

$ git clone https://github.com/ri/news-emote-analyser analyser
$ cd analyser
$ virtualenv .
$ source bin/activate  # enters the project's virtualenv
(analyser) $ pip install -r requirements.txt

Then every time after:

$ source bin/activate  # enters the project's virtualenv
(analyser) $ ...

After installing new dependencies with pip:

(analyser) $ pip install "new-dependency"
(analyser) $ pip freeze > requirements.txt
(analyser) $ git add requirements.txt

Running the scraper:

(analyser) $ python newsemote.py [au|us|all]

You may have to put S3 credentials in ~/.boto as described here.

Deployment

Analyser is deployed on Heroku using the Python and PhantomJS buildpacks.

S3 credentials for storing the resulting data files are in Heroku environment variables and can be viewed with heroku config and changed with heroku config:set.

The Scheduler add-on runs the scraper for each region daily. To run them manually on Heroku, run:

$ heroku run -a news-emote-analyser python newsemote.py [au|us|all]

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
doc		doc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
newsemote.py		newsemote.py
nltk.txt		nltk.txt
requirements.txt		requirements.txt
runtime.txt		runtime.txt
scraper.py		scraper.py
textanalyser.py		textanalyser.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

News Emote Analyser

Development

Deployment

About

Releases

Packages

Languages

License

ri/news-emote-analyser

Folders and files

Latest commit

History

Repository files navigation

News Emote Analyser

Development

Deployment

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages