Skip to content

This Project is a website detector, Detect website from text to image both way. Detect content that's CCP may don't like.

Notifications You must be signed in to change notification settings

z23788677/CCP_GFW_detector

Repository files navigation

Statement

This project doesn't mean to political propaganda, this is aiming for fun and learning.

Wish you get offended by it.

CCP_GFW_Detector

This is a program with a GUI, allowing users to input a URL link to the program. The program will access any single Chinese characters from the website HTML content. Since some websites use images more than text, once the program accesses the HTML content, it'll also process images for Chinese characters. It doesn't matter if the website uses simplified or traditional Chinese, the program won't miss any of them.

What is GFW

Great Fire Wall

The Chinese government has a GFW policy. Generally speaking, this means that Chinese people can't access websites banned by the government, such as YouTube and Facebook. Since some website content is something that the CCP doesn't want Chinese people to see, this is the aim of the project.

The "word_list" Folder

I listed a lot of words that the CCP may not like, and rated them by "Red", "Orange", and "Yellow".

  • Red: Words that MUST BE BANNED
  • Orange: Words that are OKAY to use, but as less as possible
  • Yellow: Words that are between safe and Orange words

These DAT files can be edited.

Some of the Python files have this, please ignore them as they are for testing during the developing process.

if __name__ == "__main__": something...

The Project not done yet. please wait

  • images text classification
  • detecting process state display to gui
  • Chinese version of md file
  • list detected word to gui