This program can take a Capterra category result and extract all of the data from every platform reviewed within that result.
- python
3.6+
- Chrome browser (tested on
73.0.3683.75
) - Chromedriver (install instructions)
- make sure you have all the dependencies installed
- clone this repo
cd
into repo in your terminal- run
pip install -r requirements.txt
- use the Capterra site to select a category. Here is an example search result
- run
./scrape.py <category page path>
. Do not close or otherwise interact with the Chrome windows that automatically opened - once finished, the data will be saved in
scraped_data/data.json
- logs of each run are created in
logs/