Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / opensemanticsearch / open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/opensemanticsearch%2Fopen-semantic-etl
Stars: 241
Forks: 68
Open Issues: 41
License: gpl-3.0
Language: Python
Repo Size: 615 KB
Dependencies:
17
Created: almost 9 years ago
Updated: 7 months ago
Last pushed: over 1 year ago
Last synced: 6 months ago
Topics: annotation, documents, elasticsearch, enrichment, etl, extract, extract-information, extract-text, extractor, ingest, ingestion-pipeline, ingests-documents, named-entity-recognition, nlp, ocr, pdf, python, rdf, solr, solr-dataimporter
Funding links: https://www.paypal.me/MMandalka
Files
Dependencies
- ${FROM} latest build