GitHub / arunkumar201 / aws-lambda-puppeteer-scraper
A scalable, serverless web scraping solution built on AWS Lambda using Puppeteer. It processes scraping jobs asynchronously via SQS, stores screenshots in S3, and provides a robust API gateway for initiating tasks, designed for high-volume data extraction.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/arunkumar201%2Faws-lambda-puppeteer-scraper
PURL: pkg:github/arunkumar201/aws-lambda-puppeteer-scraper
Stars: 1
Forks: 0
Open issues: 0
License: mit
Language: TypeScript
Size: 21.4 MB
Dependencies parsed at: Pending
Created at: 14 days ago
Updated at: 9 days ago
Pushed at: 9 days ago
Last synced at: 9 days ago
Topics: api-gateway, aws, aws-lambda, chromium, puppeteer, puppeteer-core, s3-bucket, serverless, sqs, typescript, web-scraping