An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ai-web-scraper

oxylabs/web-scraping-machine-learning

Web Scraping for Machine Learning

Language: Jupyter Notebook - Size: 129 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

teamreboott/Tapestry

🌐 Open-Source Web Search Backend Framework via Plug-and-Play Knowledge Reconstruction

Language: Python - Size: 9.07 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 3 - Forks: 0

m92vyas/llm-reader

Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.

Language: Python - Size: 92.8 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 201 - Forks: 15

Weekend-Dev-Labs/crawlio-js

Language: TypeScript - Size: 64.5 KB - Last synced at: 15 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

JustM3Sunny/AI_WEB_INFO_RETRIVAL

AI-powered web info retriever that performs real-time search, intelligent content extraction, and Gemini-based summarization with CLI & Python support.

Language: Python - Size: 102 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

javapuppteernodejs/ai-web-unlocker

Size: 3.91 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nnitiwe-dev/Nimble_Codelabs

This repository is dedicated to hosting a variety of Nimble's web scraping experiments.

Language: Python - Size: 35.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0