Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / vaibhavhaswani / GoText
GoText is a universal text extraction and preprocessing tool for python which supportss wide variety of document formats.
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vaibhavhaswani%2FGoText
Stars: 0
Forks: 1
Open Issues: 0
License: gpl-3.0
Language: Python
Repo Size: 66.4 KB
Dependencies:
4
Created: over 2 years ago
Updated: over 1 year ago
Last pushed: over 1 year ago
Last synced: about 1 month ago
Topics: data-preprocessing, datacleaning, python, similarity-score, text-preprocessing
Files
Dependencies
- get *
- textract-plus *