Skip to content

CLI based application to parse jobs published by Servei d'Ocupació de Catalunya, throught its website, Feina Activa.

Notifications You must be signed in to change notification settings

RomeroRodriguezD/Feina-Activa-Job-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Feina Activa (Servei d'Ocupació de Catalunya) CLI Job Scraper

image

This is a CLI based application made to parse jobs from the official webpage of Generalitat de Catalunya.

Prerequisites

  • Having the requirements installed (Selenium and BeautifulSoup).
  • Having the webdriver downloaded and/or set to PATH, for the specific navigator. In my case I set it to Chrome.

Arguments

The command line currently takes up to 4 arguments, two of them being mandatory (output_path and results), and, at least, one of the remaining, topic and province.

  • output_path: Absolute (preferably) path to stores JSON outputs.
  • results: Integer for the number of results. Should be a divisor of 20 (each web updates increases the amount of jobs shown by 20).
  • topic: Keyword/s used to search for specific words. If there is more than one, they should be space-separated and in quotes.
  • province: Can be none, or one of the four catalan provinces (Barcelona, Girona, Tarragona, Lleida).

Example

python .\main.py C:\feina_activa_parser\searchs --province barcelona --results 80

Command line output:

img.png

JSON file:

img_1.png

Results are indexed, starting from 0 as usual (80, in this example).

About

CLI based application to parse jobs published by Servei d'Ocupació de Catalunya, throught its website, Feina Activa.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages