add textract to Web Content Extracting section

This commit is contained in:
Vinta 2014-08-05 22:10:47 +08:00
parent a7942efbbf
commit d52ea10139
1 changed files with 1 additions and 0 deletions

View File

@ -565,6 +565,7 @@ A curated list of awesome Python frameworks, libraries and software. Inspired by
* [Haul](https://github.com/vinta/Haul) - An Extensible Image Crawler.
* [python-readability](https://github.com/buriy/python-readability) - Fast Python port of arc90's readability tool.
* [opengraph](https://github.com/erikriver/opengraph) - A Python module to parse the Open Graph Protocol
* [textract](https://github.com/deanmalmgren/textract) - Extract text from any document, Word documents, PowerPoint presentations, PDFs, etc.
## Forms