GitHub / anyparser / anyparserjs
Anyparser Typescript SDK for RAG/ETL Pipelines - File Content Extraction. Supports extraction from various file formats including PDF, Microsoft Office documents, OCR/Image to Text, Audio to Text, and Website to Text.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/anyparser%2Fanyparserjs
Stars: 1
Forks: 0
Open issues: 0
License: apache-2.0
Language: TypeScript
Size: 408 KB
Dependencies parsed at: Pending
Created at: 2 months ago
Updated at: about 2 months ago
Pushed at: about 2 months ago
Last synced at: 23 days ago
Topics: anyparser, artificial-intelligence, cache-augmented-generation, crawler, etl-pipeline, graph-rag, knowledgebase, langchain, microsoft-office, microsoft-word, ms-office, n8n-nodes, ocr, pdf-extraction, rag, retrieval-augmented-generation, text-extraction, web-crawler