An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: structured-data-extraction

xingbow/SciDaEx

Data Extraction and Structuring Demo

Language: Python - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 12 - Forks: 1

AI-Data-Space/happymatrix-eco-assistant

AI-powered assistant for analyzing Engineering Change Orders (ECOs) using Google Gemini and RAG

Language: Jupyter Notebook - Size: 255 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

msoedov/validex

Simplifies the retrieval, extraction, and training of structured data from various unstructured sources.

Language: Python - Size: 487 KB - Last synced at: about 12 hours ago - Pushed at: 11 days ago - Stars: 143 - Forks: 12

serpapi/google-local-results-ai-parser

A ruby gem to extract structured data from Google Local Search Results using the serpapi/bert-base-local-results model, enabling parsing, classification, and information extraction from English HTML content.

Language: Ruby - Size: 51.8 KB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 18 - Forks: 1

milahu/reverse-template-engine

find a template of many similar html files

Language: JavaScript - Size: 159 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

towfique-elahe/pdf-to-structured-csv

A Python-based tool for extracting structured data from PDFs using OCR and regex, and exporting it to CSV. Ideal for processing invoices, logs, or scanned documents into organized, usable datasets.

Language: Jupyter Notebook - Size: 27.3 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

ShomaSpirks12/Web-Parser

Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0