Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 1.48 KB

Privacy and copyright.md

File metadata and controls

12 lines (9 loc) · 1.48 KB

Is the data I want to scrape compliant with privacy laws or copyrighted?

This is not a legal advice, please refer to a lawyer if you're in doubt Another aspect needed to be considered before starting a web scraping project is about the kind of data we're retrieving.

Personal data or PII

Unless you have the person's explicit consent it is now illegal to scrape an EU resident's personal data under GDPR and this should be enough to make you stop from any personal data gathering. It's very difficult to know before scraping the citizenship of a person whose data is going to be scraped and in any case, there are similar rules also in other countries, making the scraping of personal data prohibitive.

In this great article by Zyte it's explained how to behave to be compliant with GDPR, which is only valid in Europe.

Copyrighted Data

Unless you're OpenAI, you cannot scrape copyrighted material and hope to win a case in court. So limit your operation on what is publicly available and it's factual, not made by someone who can claim the data as its own. This means also to don't scraping and store pictures made by professional photographers, not limited to artistic pictures but also pictures made for fashion websites.