GitHub / deanmalmgren / textract
extract text from any document. no muss. no fuss.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/deanmalmgren%2Ftextract
PURL: pkg:github/deanmalmgren/textract
Stars: 4,240
Forks: 631
Open issues: 149
License: mit
Language: HTML
Size: 4.31 MB
Dependencies parsed at: Pending
Created at: about 11 years ago
Updated at: about 20 hours ago
Pushed at: 8 months ago
Last synced at: about 6 hours ago
Commit Stats
Commits: 502
Authors: 41
Mean commits per author: 12.24
Development Distribution Score: 0.45
More commit stats: https://commits.ecosyste.ms/hosts/GitHub/repositories/deanmalmgren/textract
Topics: data-mining, natural-language-processing, python, text-mining