Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ingests-documents

opensemanticsearch/open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

Language: Python - Size: 615 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 241 - Forks: 68

open-search/elastic-ingestion

Language: JavaScript - Size: 41 KB - Last synced: 3 days ago - Pushed: about 6 years ago - Stars: 2 - Forks: 0