An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: crawl4ai

cyberagiinc/DevDocs

Completely free, private, UI based Tech Documentation MCP server. Designed for coders and software developers in mind. Easily integrate into Cursor, Windsurf, Cline, Roo Code, Claude Desktop App

Language: TypeScript - Size: 3.33 MB - Last synced at: about 10 hours ago - Pushed at: 8 days ago - Stars: 1,576 - Forks: 150

Gint367/webscraping_marketing

scraping fixed asset from Jahresabschluss and machine keywords from the company websites

Language: Python - Size: 42.9 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 1

MalikMalcolm1/PuppetMaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

Language: Python - Size: 41 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

bigsk1/supa-crawl-chat

Integrates Supabase with Crawl4AI and AI Chat to create a powerful web crawling and semantic search solution. Streamlit supabase data visualization. Run all in Docker. API and more!

Language: Python - Size: 1.69 MB - Last synced at: 8 days ago - Pushed at: 29 days ago - Stars: 14 - Forks: 1

PangPangGod/crawl4ai-sample-codes

crawl4ai sample codes

Language: Python - Size: 0 Bytes - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

NhanPhamThanh-IT/Deepnews-Summarizer

A feature-rich web application for automated news scraping and summarization. It allows users to enter article URLs, fetches the full content, and generates concise summaries. The system supports both local inference with custom models and remote deployment via FastAPI or Streamlit interfaces.

Language: Jupyter Notebook - Size: 309 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

mzazakeith/PuppetMaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

Language: Python - Size: 47.9 KB - Last synced at: 16 days ago - Pushed at: 23 days ago - Stars: 3 - Forks: 0

lymagics/generic-ai-scraper

Just provide schema and let your AI scrape

Language: Python - Size: 4.88 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

Jeanetted3v/Web-Crawler-Playground

A playground to testing out website crawling tools

Language: Python - Size: 16.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

wenkil/langgraph_study_web_agent

LangGraph Web Agent: A LangGraph based intelligent AI assistant that can perform network searches, crawl web content, and intelligently summarize, providing users with a deeper and more comprehensive network information acquisition experience than regular searches.

Language: Python - Size: 43.9 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

tecno-consultores/llm-lab

LLM laboratory

Language: Shell - Size: 203 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 3 - Forks: 2

Svantevith/talk-to-your-website

RAG agent enhanced with LLM-optimized website crawler built using Crawl4AI, Langchain, ChromaDB and Ollama.

Language: Python - Size: 189 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

TrueMan777/upwork_scraper

Web-scraper for Upwork jobs using Selenium-driverless

Language: Python - Size: 50.8 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

JamesN-dev/Scroll-Scribe

ScrollScribe is a Python tool designed to scrape internal URLs from targeted website, read those URLs from a text file, scrape the content of each URL using `crawl4ai`, and generate cleaned Markdown output suitable for ingestion into RAG (Retrieval-Augmented Generation) systems or other text-processing pipelines.

Language: Python - Size: 646 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Karthick-840/Crawl4ai-RAG-with-Local-LLM

A tool for scraping web documentation using Crawl4AI, converting it to Markdown, and preparing it for integration with local LLMs (like Ollama) to enhance their knowledge for learning and "vibe coding" workflows.

Language: Python - Size: 33.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

e-d-i-n-i/ai-data-extraction

AI-driven system for structured data extraction, storage, and vector search, leveraging Crawl4AI, PydanticAI, and Supabase to enable efficient retrieval and RAG-based AI applications.

Size: 2.93 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

thevladdo/rag-backend

Retrieval-Augmented Generation server with Pinecone and OpenAI

Language: HTML - Size: 46.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

oussemabenhassena5/Crawl4DeepSeek

Crawl4DeepSeek = Crawl4AI + DeepSeek 🚀 Smart, efficient, and built for deep web exploration! 🌐🤖

Language: Python - Size: 23.4 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 7 - Forks: 1

Swish78/linkedin-insider-

Linkedin Info scrapper using crawl4ai

Language: Python - Size: 13.7 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

sidhyaashu/scrapy-deepseek

Scrap any website using deepseek

Language: Python - Size: 12.7 KB - Last synced at: about 10 hours ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

yashpinjarkar10/Pinescript-Agent

PineScript Agentic RAG system that provides an interactive chat interface to answer questions about PineScript. It combines a Streamlit-based UI with an asynchronous AI agent to retrieve and summarize documentation stored in a Supabase database.

Language: Python - Size: 25.4 KB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

kaymen99/ai-web-scraper

AI web scraper built with Crawl4AI for extracting structured leads data from websites.

Language: Python - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 14 - Forks: 1

balaji1233/WEB_MASTER

AI tool to transforms any URL into a structured knowledge source by: extracting content using Crawl4AI ,vectorizing and summarizing data , running Retrieval-Augmented Generation (RAG) for deep information discovery, enabling a smart chatbot for interactive Q&A.

Language: Python - Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

laurentvv/crawl4ai-mcp

Web crawling tool that integrates with AI assistants via the MCP

Language: Python - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

shubhampandit/ai-web-scraper

Web Scraper using Gen-AI

Language: Python - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

GreatHayat/yt-shorts-generator

Generate YouTube Shorts Video From an Article Link using Pydantic AI

Language: Python - Size: 21.5 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

ebrown-32/webcrawleb

URL to Markdown for AI Processing. Provide better quality input to LLMs for better output.

Language: CSS - Size: 42 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Saurabh7636/crawl2md

A streamlined Python tool that crawls websites and converts them into organized Markdown files.

Language: Python - Size: 3.91 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

sitamgithub-MSIT/crawl4ai-txtfy

Easily retrieve the full-text content from web pages using Crawl4AI.

Language: Python - Size: 389 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

muzzlol/review-radar

Language: TypeScript - Size: 14.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

varunsaagar/crawlwithagents

The Web Metadata Extraction Toolkit is designed to streamline the process of extracting, cleaning, and analyzing metadata from websites. Utilizing advanced AI models and custom extraction strategies, this toolkit helps users efficiently gather data like titles, descriptions, and keywords, which are crucial for SEO and content strategy.

Language: Python - Size: 9.77 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0