GitHub topics: dataextraction | Ecosyste.ms: Repos

avhiraj/SentimentScope-E-Commerce-Review-Analyzer

📊 Analyze e-commerce reviews to uncover sentiment and drive insights through SQL ETL and Python visualizations for better business decisions.

Size: 1.29 MB - Last synced at: about 23 hours ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

wyattowalsh/proxywhirl

rotating proxy system

Language: Python - Size: 13.9 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

puleeno/meerkat-cloud-browser

Control Server Browser via Web API: Web Crawler, Data Extraction and Stream Proxy Server

Language: Python - Size: 57.6 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

Instagram-Automations/scraping-instagram

scraping instagram and data automation

Size: 1.7 MB - Last synced at: 20 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

Cnair02/IMDB-DataProcessing

This repository focusses on Data Cleaning

Language: Jupyter Notebook - Size: 35.2 KB - Last synced at: 13 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

feddelegrand7/ralger

ralger makes it easy to scrape a website. Built on the shoulders of titans: rvest, xml2.

Language: R - Size: 1.27 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 165 - Forks: 14

scraper-bots/bul_az

Language: Jupyter Notebook - Size: 3.26 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scraper-bots/birja-in_az

Language: Python - Size: 241 KB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scraper-bots/ucuztap_az

Language: Python - Size: 9.35 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scraper-bots/rahatemlak_az

Web Scraping with Scrapy

Language: HTML - Size: 62.2 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scraper-bots/unvan_az

Language: Jupyter Notebook - Size: 82.6 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scraper-bots/yeniemlak_az

Data Scraping

Language: Jupyter Notebook - Size: 42.5 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scraper-bots/vipemlak_az

data extraction from vipemlak.az

Language: HTML - Size: 122 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scraper-bots/ipoteka_az

scraping property info from ipoteka.az

Language: Jupyter Notebook - Size: 1.93 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scraper-bots/tap_az

Language: Python - Size: 46.9 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

scraper-bots/turbo_az

Data Scraping

Language: Jupyter Notebook - Size: 67 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

scraper-bots/birja_com

Web Scraping

Language: HTML - Size: 58.1 MB - Last synced at: 30 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

scraper-bots/h2h_az

Data Scraping

Language: Python - Size: 685 KB - Last synced at: 30 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

scraper-bots/qarabazar_az

Web Scraping

Language: HTML - Size: 25.4 MB - Last synced at: 30 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

scraper-bots/lalafo_az

Language: Jupyter Notebook - Size: 14.7 MB - Last synced at: 30 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

scraper-bots/emlak_az

Language: Jupyter Notebook - Size: 17.9 MB - Last synced at: 30 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

scraper-bots/bina_az

data extraction from bina.az

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 30 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

scraper-bots/arenda_az

Data Extraction From arenda_az

Language: Python - Size: 1.42 MB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

scraper-bots/scholenopdekaart_nl

Data Scraping

Language: Python - Size: 205 KB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

scraper-bots/linkedin_com

Web Scraping

Language: Python - Size: 98.6 KB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

scraper-bots/fullhdfilmizlesene_pw

Web Scraping

Language: Jupyter Notebook - Size: 6.6 MB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

scraper-bots/aratap_az

Language: Jupyter Notebook - Size: 6.13 MB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

scraper-bots/bestoftelegram_com

scraping whith scrapy data from https://bestoftelegram.com/

Language: Python - Size: 48.8 KB - Last synced at: 30 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

🛰 Ce tutoriel aide les utilisateurs à mieux comprendre, extraire et visualiser les données du télescope NEOSSAT. | 🛰 This tutorial helps users better understand, extract and visualize NEOSSAT telescope data.

Language: Jupyter Notebook - Size: 8.69 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 3

Varun-khorgade/SentimentScope-E-Commerce-Review-Analyzer

Analyzed customer reviews and purchase data to extract sentiment and behavioral insights. Built SQL-based ETL for data preparation and visualized results using Python and Power BI dashboards for actionable business decisions.

Size: 1000 Bytes - Last synced at: 26 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

pythonicshariful/phone-number-extractor

A Python script that extracts phone numbers from images using Tesseract OCR and Regex. Automatically organizes processed images into success and failed folders, and saves results to a CSV file.

Language: Python - Size: 84 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

oxylabs/Web-Scraping-With-Selenium

In this guide on how to web scrape with Selenium, we will be using Python 3. The code should work with any version of Python above 3.6

Language: Python - Size: 74.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 31 - Forks: 12

its-arnavtech/Parser_Build-Arnav

This Project is currently working on extracting key data from a resume in order to enhance a candidate's profile

Language: Python - Size: 196 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Docutain/Docutain-SDK-Example-Xamarin-iOS

Sample project showing how to integrate the Docutain Document Scanner SDK into a Xamarin.iOS application.

Language: C# - Size: 64.5 KB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

Docutain/Docutain-SDK-Example-.NET-MAUI

Sample project showing how to integrate the Docutain Document Scanner SDK into a .NET MAUI application.

Language: C# - Size: 488 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 1

Docutain/Docutain-SDK-Example-iOS-Swift

Sample project showing how to integrate the Docutain Document Scanner SDK into an iOS application.

Language: Swift - Size: 11.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 1

J-TECH-bot/Blackcoffer_Data_Extraction_NLP

This repository showcases data-driven text analytics using NLP techniques. It combines text preprocessing, sentiment scoring, and structured data extraction to convert unstructured text into business-ready datasets.

Language: Jupyter Notebook - Size: 491 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

wordbricks/next-eval

NEXT-EVAL: From Web URLs to Structured Tables – Extraction and Evaluation

Language: TypeScript - Size: 1.64 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 37 - Forks: 3

264Gaurav/airflow-with-astro

Airflow with astronomer - to automate and scheduling the workflow

Language: Python - Size: 8.79 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

devnamdev2003/result_automation_system

The "RGPV Result Scraper" is a Python script that automates the extraction of student results from the Rajiv Gandhi Proudyogiki Vishwavidyalaya (RGPV) website. It handles captchas and saves data in CSV files, making it a valuable tool for academic record retrieval.

Language: Python - Size: 22.2 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

ashishkumar30/ML-AI-Python-Codes

Python various Important codes, Machine learning, NLP using Spacy and NLTK with Neural Network in ML

Language: Jupyter Notebook - Size: 14.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 35 - Forks: 6

PakistanAiFrontierCorps/WebScraping-YaotaishCNC

I was assigned to webscrape a website and build a wordpress store. I used this code to extract the products and their hierarchy categories and made this wordpress website.

Language: Jupyter Notebook - Size: 970 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Mukeshthenraj/date-extraction-project

Extract and normalize dates from unstructured medical notes using Python and regular expressions.

Language: Python - Size: 40 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

royalgaetan/Vscrape

⚡Take automation to the next level: create workflows, scrape the web while you sleep, extract data with AI, and export it in any format.

Language: TypeScript - Size: 2.58 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

mahendraplus/Red-byte

Red-byte is an educational pentesting tool that extracts sensitive data from web forms using hidden payloads embedded within images. It helps users understand phishing vulnerabilities and simulate attacks to enhance security awareness and practices.

Language: JavaScript - Size: 29.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

medmahdimaarouf/cropsflow-automobile.tn

Flow-based web scraper for automobile.tn built with CropsFlow.

Language: Python - Size: 21.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

eneiromatos/the-home-depot-web-scraper

This web scraper is intended to extract data from The Home Depot Website, it could be run locally or in the Apify platform, the latter is the preferred way. It was made using Apify SDK V3 (Crawlee) with Typescript.

Language: TypeScript - Size: 196 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 9

mydevground/DataAnalyticsEngineeringScienceVizBILab

Language: Jupyter Notebook - Size: 12.2 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

yumeangelica/jirai_sweeties

A friendly Discord bot with store monitoring capabilities - Tracks online stores for new items and price changes while providing chat commands and real-time notifications. Built with Python.

Language: Python - Size: 120 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Martial2023/Linkedin-Performance-Analytics-Pipeline

Scrapping et analyses des performances des publications linkedin pour comprendre les caractéristiques des posts qui ont les meilleurs performances

Language: Jupyter Notebook - Size: 6.11 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

roslove44/web-scraping-toolkit

Un ensemble d'outils de web scraping pour extraire et analyser des données à partir de sites web spécifiques.

Language: Python - Size: 13.2 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

Docutain/Docutain-SDK-Example-Android-Kotlin

Sample project showing how to integrate the Docutain Document Scanner SDK into an Android application.

Language: Kotlin - Size: 132 KB - Last synced at: 7 months ago - Pushed at: 11 months ago - Stars: 11 - Forks: 1

gabrielianfr/web-scraping-project

A Python-based web scraping tool that extracts and stores data in JSON format using BeautifulSoup and Requests.

Language: Python - Size: 2.93 KB - Last synced at: 7 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

kleindasash/Content-Grabber

Content Grabber is a powerful software for automatic data extraction from websites.

Size: 2.93 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

dimitryzub/py-google-scholar-organic-cite-to-csv-sqlite

Scrape historic Google Scholar Organic and Cite results to CSV, MySQL Lite using Python and SerpApi.

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 4

DhaaraniPushpam/twitter-x-data-extraction

Twitter data extraction using Selenium and twitter api key

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

damnitjoshua/um-timeedit-timetable-toolkit

UM TimeEdit data toolkit for timetable software development.

Language: JavaScript - Size: 13.7 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

sravanigodavarthi/Gmail_to_Excel

This Python script allows you to extract specific email messages from your Gmail inbox, retrieve their subject and content, and save the data into an Excel file

Language: Python - Size: 22.5 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

Bessouat40/prefect-github-indexer

A Prefect pipeline that periodically scrapes one or more GitHub repositories, generates embeddings, and indexes them in ChromaDB.

Language: Python - Size: 8.79 KB - Last synced at: 7 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

siddartha48/Portfolio-Covid-Data-Analysis-SQL-Server-

Data Exploration using SQL on MS SQL SERVER MGT on Covid data downloaded from https://ourworldindata.org/covid-deaths in CSV file

Size: 1000 Bytes - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Thunderbit-HQ/ai-web-scraper

Thunderbit is an AI Web Scraper that replaces tedious copy-paste tasks for GTM teams.

Size: 677 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Nel-zi/Francine_store

Built a scalable data pipeline for Francine Stores, enabling them to extract, clean, and load data from Aliexpress for real-time market trend analysis and smarter business decisions.

Language: Jupyter Notebook - Size: 8.74 MB - Last synced at: 6 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

saivardhan08/My-Projects

In this repository you'll find projects that I have worked on for my own practice and upskilling

Size: 0 Bytes - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

VigilantiaFR/SIRETExtractor

Vigilantia SIRET Extractor | automatic scraper | input :{domain names} | output :{SIRET number}

Language: Python - Size: 45.9 KB - Last synced at: 25 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

chathumiamarasinghe/web-scraping

A versatile Python script for scraping data from websites. This script automates data extraction, processes the information, and saves it in a structured format like CSV. Ideal for data collection, research, and analysis tasks.

Language: Python - Size: 9.77 KB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

chouaib-629/WebScraping

A collection of web scraping projects using Beautiful Soup, Selenium, and mixed approaches. Each project includes Python scripts and CSV files of the scraped data. Perfect for learning and experimenting with static and dynamic web scraping techniques.

Language: Jupyter Notebook - Size: 106 KB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Docutain/Docutain-SDK-Example-Flutter

Sample project showing how to integrate the Docutain Document Scanner SDK into a Flutter application.

Language: Dart - Size: 102 KB - Last synced at: 7 months ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

Docutain/Docutain-SDK-Example-Android-Java

Sample project showing how to integrate the Docutain Document Scanner SDK into an Android application (Java).

Language: Java - Size: 131 KB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 1

Dhruv-0001/Shoe-Hype

A shoe👟 recommendation website.

Language: Python - Size: 288 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 6 - Forks: 0

SamRB-dev/AutoSeekOut

A simple web scraping bot for scraping information from seekout.com written in Python and Selenium

Language: Python - Size: 11.7 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

amybui0910/remax_web_scraper

Python project that extracts data from remax.ca using BeautifulSoup and and requests libraries to scrape the website and stores the data into a csv file

Language: Python - Size: 15.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

domingosdeeulariadumba/thekidsrightsindex_statisticalmodeling

A Kids Rights Index Statistical Modeling

Language: Jupyter Notebook - Size: 5.86 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Docutain/docutain-sdk-example-react-native

Sample project showing how to integrate the Docutain Document Scanner SDK into a React Native application.

Language: TypeScript - Size: 585 KB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

divithraju/divith-raju-Customer-Sales-ETL-Pipeline

This ETL project was designed to demonstrate the development of a scalable data pipeline for customer sales analysis. It covers all essential steps, from data extraction to transformation and loading into a database, with Apache Airflow used.

Language: Python - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Reddi-Srija-R/Data-wrangling

Comprehensive Data Wrangling Techniques

Language: HTML - Size: 200 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

praneethsattavaram/BlackCoffer

Data Extraction from links in a Excel file

Language: Python - Size: 356 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Farooq710/Hospital-Mortality-Rate-Prediction

Predicts the rate of death in hospitals .

Size: 1.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BilalAhmadKhanKhattak/LinkxDoctor

LinkxDoctor is a Python tool that scans a webpage to identify both broken and valid links. It provides a report on the link status, helping ensure all links on the page are functional. This Project Is Indeed The Successor Of One Of My Previous Projects, LinkxDoxer

Language: Python - Size: 13.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Docutain/Docutain-SDK-Example-Xamarin-Android

Sample project showing how to integrate the Docutain Document Scanner SDK into a Xamarin.Android application.

Language: C# - Size: 369 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

SwethaJoseph/Automate-API-Extraction

Automating the process of extracting data from APIs, appending new data to existing datasets and generating insightful visualizations

Language: Jupyter Notebook - Size: 156 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Docutain/Docutain-SDK-Example-Windows-WPF-.NET-Framework

Sample project showing how to integrate the Docutain SDK into a WPF application.

Language: C# - Size: 20.5 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

Docutain/Docutain-SDK-Example-Windows-Forms-.NET-Framework

Sample project showing how to integrate the Docutain SDK into a Windows Forms application.

Language: C# - Size: 17.6 KB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Docutain/Docutain-SDK-Example-Xamarin-Forms

Sample project showing how to integrate the Docutain Document Scanner SDK into a Xamarin.Forms application.

Language: C# - Size: 1.41 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Aldosee/SQL-Covid-2024

Size: 313 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ogundele1/EXCEL-PROJECT

This project presents a SWOT analysis for Ailead Technology Online Hotel Booking business as she embarks on here journey toward global expansion.

Size: 14 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vishmaria/gerencia-de-dados

Códigos e conteúdos importantes para extração e manipulação de dados na Web. Esses conteúdos foram desenvolvidos com auxílio das aulas da disciplina "Tópicos Especiais em Gerência de Dados", da UFSC.

Language: Python - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

weizhonzhen/FastEtl

简单的etl 支持跨数据库抽取数据库

Language: C# - Size: 18 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 9

MaemoonFarooq/Amazon-Dataset-Mining

The Frequent Dataset Mining project offers a comprehensive solution for mining frequent itemsets from the extensive Amazon dataset using Apache Kafka. Leveraging the power of distributed computing, this project employs two powerful algorithms, Apriori and PCY, to efficiently process and analyze large volumes of data.

Language: Python - Size: 19.5 KB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Pirimid/financial-documents-ocr-deep-learning

Language: Python - Size: 6.23 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 38 - Forks: 14

kingtroga/extract-expo-companies

Using Python scripts, I scraped 19 Expo websites, collecting 26,453 company names. I crafted tailored Google searches to find elusive company websites. My async Python script then captured essential details: Source, Company Name, Website, Contact Name(s), Email(s), Phone Number(s), and Social Media Accounts.

Size: 84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kingtroga/brian

This repository contains three web scrapers designed to extract specific data from various sources. These scrapers are tailored for different websites and are intended to be used for data collection and analysis.

Language: Python - Size: 44.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

shuddha2021/nodejs-crawler

A lightweight and efficient web crawler built with Node.js

Language: JavaScript - Size: 851 KB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SubbulakshmiSN/Phonepe-Pulse-Data-Visualization-and-Exploration

Phonepe Pulse Data Visualization and Exploration: A User-Friendly Tool Using Streamlit and Plotly

Language: Jupyter Notebook - Size: 1.48 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

yc-wang00/verra-scaper

This project facilitates the extraction of document data from the Verra Verified Carbon Standard (VCS) Registry, an open database widely utilized by carbon credit traders.

Language: Python - Size: 1.44 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Op27/UNJobs-Selective-Extractor

UNJobs-Selective-Extractor automates the processes to find relevant UN job opportunities by collecting and filtering listings from the UNJobs website based on user criteria.

Language: Python - Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

FaizanMohd5/Web-scraping-iPhone-11-Reviews

This is a web scraping project that extracts customer reviews for the iPhone 11 from Flipkart.com using Python and BeautifulSoup. The extracted data is saved in a CSV file for further analysis. Use it as a starting point for your own web scraping projects or for analyzing customer reviews of the iPhone 11.

Language: Jupyter Notebook - Size: 362 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

iambitttu/GitHub-User-Analytics-and-Recommendation-System

This project aims to collect data from GitHub users, store it in MongoDB, and create an analytics dashboard to understand users' technical strengths and weaknesses.

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

itratjassani/Data-Extraction-from-Invoice-Images

A program has been developed to automate the process of extracting text and data from handwritten invoices, thereby improving efficiency and reducing errors associated with manual data entry, thereby benefiting many businesses.

Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

swapnanildutta/instagram-search

I have used a python code to extract the details of a given username.

Language: Python - Size: 36.1 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 3

Tiwarijishiv/BizCardX-Extracting-Business-Card-Data-with-OCR

This project will require skills in image processing, OCR, GUI development, and database management. It will also require you to carefully design and plan the application architecture to ensure that it is scalable, maintainable, and extensible. Good documentation and code organization will also be important for this project.

Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0