File tree Expand file tree Collapse file tree 1 file changed +2
-2
lines changed Expand file tree Collapse file tree 1 file changed +2
-2
lines changed Original file line number Diff line number Diff line change @@ -604,9 +604,9 @@ Python实现的数据库。
604
604
* html2text:将 HTML 转换为 Markdown 格式文本[ 官网] ( https://github.com/Alir3z4/html2text )
605
605
* lassie:人性化的网页内容检索库。[ 官网] ( https://github.com/michaelhelmick/lassie )
606
606
* micawber:一个小型网页内容提取库,用来从 URLs 提取富内容。[ 官网] ( https://github.com/coleifer/micawber )
607
- * newspaper:使用 Python 进行新闻提取,文章提取以及内容策展。[ 官网] ( https://github.com/codelucas/newspaper )
607
+ * [ newspaper] ( http://hao.jobbole.com/python-newspaper/ ) :使用 Python 进行新闻提取,文章提取以及内容策展。[ 官网] ( https://github.com/codelucas/newspaper )
608
608
* opengraph:一个用来解析开放内容协议(Open Graph Protocol)的 Python模块。[ 官网] ( https://github.com/erikriver/opengraph )
609
- * python-goose:HTML内容/文章提取器。[ 官网] ( https://github.com/grangier/python-goose )
609
+ * [ python-goose] ( http://hao.jobbole.com/python-goose/ ) :HTML内容/文章提取器。[ 官网] ( https://github.com/grangier/python-goose )
610
610
* python-readability:arc90 公司 readability 工具的 Python 高速端口。[ 官网] ( https://github.com/buriy/python-readability )
611
611
* sanitize:为杂乱的数据世界带来调理性。[ 官网] ( https://github.com/Alir3z4/python-sanitize )
612
612
* sumy:一个为文本文件和 HTML 页面进行自动摘要的模块。[ 官网] ( https://github.com/miso-belica/sumy )
You can’t perform that action at this time.
0 commit comments