GitHub / opensemanticsearch / open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/opensemanticsearch%2Fopen-semantic-etl
PURL: pkg:github/opensemanticsearch/open-semantic-etl
Stars: 268
Forks: 72
Open issues: 42
License: gpl-3.0
Language: Python
Size: 615 KB
Dependencies parsed at: Pending
Created at: about 10 years ago
Updated at: 3 months ago
Pushed at: almost 3 years ago
Last synced at: 2 months ago
Topics: annotation, documents, elasticsearch, enrichment, etl, extract, extract-information, extract-text, extractor, ingest, ingestion-pipeline, ingests-documents, named-entity-recognition, nlp, ocr, pdf, python, rdf, solr, solr-dataimporter
Funding Links https://www.paypal.me/MMandalka