Skip to content

zealot09/node-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

node-crawler

##1. nodejs + phantomjs

###1.1 Basically the crawler allows user to crawl static pure text, links, images...

U can config like this

var config = {
	type: 'static',
	element: 'jquery selector',
	attr: 'content|src|href...'
}

###1.2 The crawler allows user to set simple javascript code which can execute on the crawling page and get the content.

Supported events such as: 'click', 'input'...

U can config like this

var config = {
	type: 'event',
	element: 'jquery selector',
	action: 'click',
	attr: 'content|src|href...'
}

##2. Actions to cheat the target server like change ips, limit the crawler speed...

The requirement is being explored.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published