Skip to content

Commit 737089f

Browse files
committed
update newspaper, python-goose detail link
1 parent f4d6edd commit 737089f

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -604,9 +604,9 @@ Python实现的数据库。
604604
* html2text:将 HTML 转换为 Markdown 格式文本[官网](https://github.com/Alir3z4/html2text)
605605
* lassie:人性化的网页内容检索库。[官网](https://github.com/michaelhelmick/lassie)
606606
* micawber:一个小型网页内容提取库,用来从 URLs 提取富内容。[官网](https://github.com/coleifer/micawber)
607-
* newspaper:使用 Python 进行新闻提取,文章提取以及内容策展。[官网](https://github.com/codelucas/newspaper)
607+
* [newspaper](http://hao.jobbole.com/python-newspaper/):使用 Python 进行新闻提取,文章提取以及内容策展。[官网](https://github.com/codelucas/newspaper)
608608
* opengraph:一个用来解析开放内容协议(Open Graph Protocol)的 Python模块。[官网](https://github.com/erikriver/opengraph)
609-
* python-goose:HTML内容/文章提取器。[官网](https://github.com/grangier/python-goose)
609+
* [python-goose](http://hao.jobbole.com/python-goose/):HTML内容/文章提取器。[官网](https://github.com/grangier/python-goose)
610610
* python-readability:arc90 公司 readability 工具的 Python 高速端口。[官网](https://github.com/buriy/python-readability)
611611
* sanitize:为杂乱的数据世界带来调理性。[官网](https://github.com/Alir3z4/python-sanitize)
612612
* sumy:一个为文本文件和 HTML 页面进行自动摘要的模块。[官网](https://github.com/miso-belica/sumy)

0 commit comments

Comments
 (0)