Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / opensemanticsearch / open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/opensemanticsearch%2Fopen-semantic-etl

Stars: 241
Forks: 68
Open Issues: 41

License: gpl-3.0
Language: Python
Repo Size: 615 KB
Dependencies: 17

Created: almost 9 years ago
Updated: 7 months ago
Last pushed: over 1 year ago
Last synced: 6 months ago

Topics: annotation, documents, elasticsearch, enrichment, etl, extract, extract-information, extract-text, extractor, ingest, ingestion-pipeline, ingests-documents, named-entity-recognition, nlp, ocr, pdf, python, rdf, solr, solr-dataimporter

Funding links: https://www.paypal.me/MMandalka

Files
    Loading...
    Readme
    Loading...
    Dependencies
    Dockerfile docker