PyCraig

Written by Stephen Diehl [email protected].

PyCraig is a python library for scraping small amounts of data off of Craigslist.

PyCraig is for personal use only, too many requests to craigslist will get your ip address banned.

All code is released under a MIT license, see LICENSE for details.

Dependencies

PyCraig depends on BeautifulSoup, you can install it with

 pip install BeautifulSoup

It also uses GNU Curl for grabbing web pages. If you are running Linux, BSD, or OS X you probably have this installed.

jellyfish ( https://github.com/sunlightlabs/jellyfish ) is optionally included for doing approximate string matching. It is written in C and is very fast.

To use jellyfish as a local module use:

 cd pycraig/jellyfish
 make

Or install globally with:

 python pycraig/jellyfish/setup.py install

Example

 >>> from pycraig import *

 # Get 3 page of listings for "cars & trucks" for sale "by owner"
 # in the "San Franciso Bay" area
 >>> listings = get_listings(url='sfbay.craigslist.org',
                             cat='cars & trucks - by owner',
                             pages=3)
 
 # Create table with our car listings
 >>> cars = Table()
 >>> extract_rows(listings, cars)
     
 # Show all hondas under $15,000
 >>> for car in cars:
        if car.price < 15000 and 'honda' in car.desc:
            print car.link, car.desc

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
pycraig		pycraig
LICENSE.txt		LICENSE.txt
README.md		README.md
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyCraig

Dependencies

Example

About

Releases

Packages

Languages

License

sdiehl/pycraig

Folders and files

Latest commit

History

Repository files navigation

PyCraig

Dependencies

Example

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages