- 使用Jsoup抓取,但是使用的不多,后续需改进
- 程序代码有点混乱,没有处理和使用好Java的函数和类关系,有待学习和掌握
- 程序思路:
- 爬取了网站所有页面,存储在content.txt文件,同时存储各个页面title到map中
- 复制无内容网页框架rawGet.html内容到Get_gank.html并写入map中内容形成汇总列表
- 将content.txt内容写入到Get_gank.html
-
Notifications
You must be signed in to change notification settings - Fork 4
lxxself/getGank
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
抓取gank.io网页
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published