Skip to content

funilrys/blackweb

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

84 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Blackweb es un proyecto que pretende recopilar la mayor cantidad de listas negras públicas de dominios (para bloquear porno, descargas, drogas, malware, spyware, trackers, bots, redes sociales, warez, venta de armas, etc), con el objeto de unificarlas y hacerlas compatibles con Squid (Tested in v3.5.x). Para lograrlo, realizamos una depuración de urls, para evitar duplicados, dominios inválidos (validación de ccTLD, ccSLD, sTLD, uTLD, gSLD, gTLD, etc), y un filtrado con listas blancas de dominios (falsos positivos, como google, hotmail, yahoo, etc), para obtener una mega ACL, optimizada para Squid, libre de "overlapping domains" (e.g: "ERROR: '.sub.example.com' is a subdomain of '.example.com'").

Blackweb is a project that aims to collect as many public domain blacklists (to block porn, downloads, drugs, malware, spyware, trackers, Bots, social networks, warez, arms sales, etc.), in order to unify them and make them compatible with Squid (Tested in v3.5.x ). To do this, we perform a debugging of urls, to avoid duplicates, invalid domains (validation, ccTLD, ccSLD, sTLD, uTLD, gSLD, gTLD, etc), and filter with white lists of domains (false positives such as google , hotmail, yahoo, etc.), to get a mega ACL, optimized for Squid, free of overlapping domains (eg: "ERROR: '.sub.example.com' is a subdomain of '.example.com'").

FICHA TECNICA / DATA SHEET


ACL Black Domains txt size tar.gz size
blackweb.txt 3.867.256 87.7 MB 19.8 MB

DEPENDENCIAS / DEPENDENCIES


git squid bash tar zip wget subversion python

GIT CLONE


git clone --depth=1 https://github.com/maravento/blackweb.git

MODO DE USO / HOW TO USE


La ACL blackweb.txt ya viene optimizada para Squid. Descárguela y descomprimala en la ruta de su preferencia / The ACL blackweb.txt is already optimized for Squid. Download it and unzip it in the path of your preference

Download and Checksum
wget -q -N https://raw.githubusercontent.com/maravento/blackweb/master/blackweb.tar.gz && cat blackweb.tar.gz* | tar xzf -
wget -q -N https://raw.githubusercontent.com/maravento/blackweb/master/checksum.md5
md5sum blackweb.txt | awk '{print $1}' && cat checksum.md5 | awk '{print $1}'

ACTUALIZACIÓN / UPDATE


El script bwupdate.sh descarga las FUENTES, las depura, unifica y elimina los dominios superpuestos. Una vez terminado verifique los logs de Squid y si aún existe algún error deberá depurarlo manualmente de blackweb / The bwupdate.sh script downloads the SOURCES, purifies, unifies and removes overlapping domains. Once finished check the Squid logs and if there is still an error you must manually debug it from blackweb

wget -q -N https://raw.githubusercontent.com/maravento/blackweb/master/bwupdate/bwupdate.sh && sudo chmod +x bwupdate.sh && sudo ./bwupdate.sh
Advertencia / Warning
  • Antes de utilizar bwupdate.sh debe activar la regla en Squid-Cache / You must activate the rule in Squid-Cache before using bwupdate.sh
  • Para reducir el tamaño de blackweb (eliminando dominios inactivos/muertos/inválidos) puede utilizar la herramienta httpstatus / To reduce the size of blackweb (eliminating inactive/dead/invalid domains) can use httpstatus
  • El proceso de actualización y depuración de blackweb puede tardar mucho tiempo y consumir muchos recursos de hardware y ancho de banda / The process of updating and debugging of blackweb can take a long time and consume many hardware resources and bandwidth
  • El path por default de blackweb es /etc/acl. Puede cambiarlo por el directorio de su preferencia / The default path of blackweb is /etc/acl. You can change it by the directory of your preference
  • bwupdate.sh no incluye cloud/sync (Mega, Dropbox, Pcloud, iCloud, etc), soporte remoto (Teamviewer, Anydesk, logmein, etc) o dominios relacionados con telemetría, excepto si ya vienen bloqueados desde las FUENTES. Para bloquearlos o excluirlos debe activar las líneas en el script (# JOIN LIST), aunque se recomenda agregarlas manualmente a blackweb para evitar conflictos o falsos positivos / bwupdate.sh does not include cloud/sync (Mega, Dropbox, Pcloud, iCloud, etc) or remote support (Teamviewer, Anydesk, logmein, etc), or domains related to telemetry, except if they are already blocked from the SOURCES. To block or exclude them you must activate the lines in the script (# JOIN LIST), although it is recommended to add them manually to blackweb to avoid conflicts or false positives

Edite / Edit:

/etc/squid/squid.conf

Y agregue las siguientes líneas: / And add the following lines:

# INSERT YOUR OWN RULE(S) HERE TO ALLOW ACCESS FROM YOUR CLIENTS
acl blackweb dstdomain -i "/path_to_acl/blackweb.txt"
http_access deny blackweb
MUY IMPORTANTE (Edición Blackweb) / VERY IMPORTANT (Blackweb Edition)

Blackweb contiene millones de dominios bloqueados, por tanto: / Blackweb contains millions of blocked domains, so:

acl whitedomains dstdomain -i "/path_to_acl/whitedomains.txt"
acl blackdomains dstdomain -i "/path_to_acl/blackdomains.txt"
acl blackweb dstdomain -i "/path_to_acl/blackweb.txt"
http_access allow whitedomains
http_access deny blackdomains
http_access deny blackweb

FUENTES / SOURCES


Blacklists

Adaway

adblockplus malwaredomains_full

ABPindo indonesianadblockrules

Anti-WebMiner

anudeepND Blacklist (included: coinminer, adservers)

BambenekConsulting

betterwebleon dga-feed

Capitole - Direction du Système d'Information (DSI)

Carl Spam

cedia.org.ec (included: domains, immortal_domains)

chadmayfield (included: porn_all, porn top)

CHEF-KOCH BarbBlock-filter-list

Cibercrime-Tracker

crazy-max WindowsSpyBlocker

Dawsey21 List

Disconnect.me (included: simple_ad, simple_malvertising, simple_tracking)

dshield.org (included: Low, Medium, High)

ethanr dns-blacklists

firebog.net (included: AdguardDNS, Airelle-hrsk, Airelle-trc, BillStearns, Easylist, Easyprivacy, Kowabit, Prigent-Ads, Prigent-Malware, Prigent-Phishing, Shalla-mal, WaLLy3K)

gfmaster adblock-korea

Halt-and-Block-Mining

hexxium

hostsfile.mine.nu

hosts-file.net (included: ad_servers, emd, grm, hosts, psh)

Joelotz URL Blacklist

Joewein Blacklist

KADhosts

malc0de

Malwaredomains

Malwaredomainlist Hosts

Malware-domains (or Lehigh Malwaredomains)

margevicius easylistlithuania

Matomo-org referrer-spam-blacklist

MESD blacklists

mitchellkrogza (included: Badd-Boyz-Hosts, Hacked Malware Web Sites, Nginx Ultimate Bad Bot Blocker, The Big List of Hacked Malware Web Sites, Ultimate Hosts Blacklist)

notabug latvian-list

Oleksiig Blacklist

openphish

Perflyst (included: android-tracking, SmartTV)

Quedlin blacklist

quidsup (included: notrack-blocklists, notrack-malware, trackers, qmalware)

Ransomware Abuse (included: CryptoWall, Locky, Domain Blocklist, Ransomware Abuse ,URL Blocklist ,TorrentLocker)

reddestdream

securemecca.net and hostsfile.org

Shallalist.de

Someonewhocares

squidblacklist.org (included: dg-ads, dg-malicious.acl)

StevenBlack (included: add.2o7Net, add.Risk, fakenews-gambling-porn-social, hosts, spam, uncheckyAds)

Stopforumspam Toxic Domains

tankmohit UnifiedHosts

Taz SpamDomains

vokins yhosts

Winhelp2002

YousList

Yoyo

zerodot1 CoinBlockerLists (included: Host, host_browser, host_optional, list, list_browser, list_browser_UBO)

Zeustracker

Discontinued

UrlBlacklist (Server Down since July 24, 2017)

Passwall SpamAssassin (Server Down since Dec 2016)

Central Repo Dead Domains (Last Update included in Inactive Domains)

Debug Lists
Internals

BlackTLDs

BlackURLs

WhiteURLs

Inactive

Invalid

Optionals

CloudSync

Remote

Telemetry

Externals

ipv6-hosts (Partial)

O365IPAddresses (Partial)

Ransomware Database

University Domains and Names Data List

TLDs

Tools
Internals

Debugging Squid Error

Parse Domains (modified by Maravento)

httpstatus

Externals

CTFR

CONTRIBUCIONES / CONTRIBUTIONS


Agradecemos a todos aquellos que han contribuido a este proyecto. Los interesados pueden contribuir, enviándonos enlaces de nuevas listas, para ser incluidas en este proyecto / We thank all those who have contributed to this project. Those interested can contribute, sending us links of new lists, to be included in this project

Special thanks to: Jhonatan Sneider

DONACION / DONATE


BTC: 3M84UKpz8AwwPADiYGQjT9spPKCvbqm4Bc

LICENCIAS / LICENCES


GPL-3.0

CreativeCommons maravento.com is licensed under a Creative Commons Reconocimiento-CompartirIgual 4.0 Internacional License.

© 2019 Maravento Studio

EXENCION DE RESPONSABILIDAD / DISCLAIMER


THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Releases

No releases published

Packages

No packages published

Languages

  • Shell 91.8%
  • Python 8.2%