JSAKA

Highly Customizable Word Scrapper, with minimal foot print. Allows crawlers and spiders addition using whatever technique is prefered, be it scrapy, requests and beautiful soup, selnium webdriver, etc for crawling and has a configuration panel to configure some crawlers activities. The web interface allows you to configure keywords that one should be alerted via email once they are detected in the scrapped data.

Deploying using nginx and uwsgi (Centos)

Steps:

Install Nginx:

If you dont have the epel repo in your machine, run:

sudo yum install epel-release

Install nginx web and reverse proxy server:

sudo yum install nginx

Setup pip:

sudo yum install python-pip python-devel gcc

install and create python Virtual Environment

installation:

sudo pip install virtualenv

Creating virtual env with virtualenv tool:

mkdir ~/myproject
cd ~/myproject
virtualenv myprojectenv

The above will create a python virtual env in your project folder created (myprojectenv). The above copies files for your default python interpreter to the virtual environment. If you have multiple instances of python interpreter on your machine, you can specify the version to use as:

virtaulenv --python=/location/to/python/touse myprojectenv
eg: virtualenv --python=/usr/local/bin/python2.7 myprojectenv

Activate and install project dependencies:

Activate the environment as follows:

source myprojectenv/bin/activate

Now copy all sources from JSAKA into the project folder (myprojectenv). The install the projects dependencies usinf the requirements.txt file in JSAKA by running:

pip install -r requirements.txt

Installing uwsgi server

Run:

 pip install uwsgi

Configuring Nginx

add the following directive to nginx conf file:

server {
    listen 8080;
    server_name 0.0.0.0;


    location / {

    include uwsgi_params;
    uwsgi_pass unix:/tmp/jsaka.sock;
    }

    }

By default, our app uses unix socket to connect to nginx which is placed in the default location "/tmp/jsaka.sock" This can be cahnged by editing jsaka.ini configuration file. Also ensure the path to .sock file has enough priveleges for this to be successful.

Run JSAKA with uwsgi

uwsgi --ini jsaka.ini

Run Nginx

type 'nginx' on terminal as root.

Screenshots

Keyword manager

Subscription Manager

Spider setting

Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
Scrapper		Scrapper
model		model
screenshots		screenshots
services		services
static		static
templates		templates
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
jsaka.ini		jsaka.ini
requirements.txt		requirements.txt
waks.csv		waks.csv
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JSAKA

Deploying using nginx and uwsgi (Centos)

Install Nginx:

Setup pip:

install and create python Virtual Environment

Activate and install project dependencies:

Installing uwsgi server

Configuring Nginx

Run JSAKA with uwsgi

Run Nginx

Screenshots

About

Releases 1

Packages

Contributors 2

Languages

License

Ndiithi/JSAKA

Folders and files

Latest commit

History

Repository files navigation

JSAKA

Deploying using nginx and uwsgi (Centos)

Install Nginx:

Setup pip:

install and create python Virtual Environment

Activate and install project dependencies:

Installing uwsgi server

Configuring Nginx

Run JSAKA with uwsgi

Run Nginx

Screenshots

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages