A Computer Science Undergraduate Student from HK PolyU
- HK
-
19:52
(UTC +08:00)
Highlights
- Pro
Popular repositories Loading
-
-
-
-
-
-
Crawler-Parallel
Crawler-Parallel PublicForked from ChenyuGao/Crawler-Parallel
C语言并行爬虫(epoll),爬取服务器的16W个有效网页,通过爬取页面源代码进行确定性自动机匹配和布隆过滤器去重,对链接编号并写入url.txt文件,并通过中间文件和三叉树去除掉状态码非200的链接关系,将正确的链接关系继续写入url.txt
C
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.