- sohu-dataset: 抓取自sohu网站的1000个网页,附带标题、关键词、带格式的HTML正文内容,无格式的纯文本内容等信息,以XML格式保存。可用于关键词抽取测试。
forked from iamxiatian/data
-
Notifications
You must be signed in to change notification settings - Fork 0
taiyangdixia/data
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Experimental Data
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published