An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: dataextraction

feddelegrand7/ralger

ralger makes it easy to scrape a website. Built on the shoulders of titans: rvest, xml2.

Language: R - Size: 1.06 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 156 - Forks: 14

devnamdev2003/result_automation_system

The "RGPV Result Scraper" is a Python script that automates the extraction of student results from the Rajiv Gandhi Proudyogiki Vishwavidyalaya (RGPV) website. It handles captchas and saves data in CSV files, making it a valuable tool for academic record retrieval.

Language: Python - Size: 22.2 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

Docutain/Docutain-SDK-Example-Android-Kotlin

Sample project showing how to integrate the Docutain Document Scanner SDK into an Android application.

Language: Kotlin - Size: 132 KB - Last synced at: 14 days ago - Pushed at: 5 months ago - Stars: 11 - Forks: 1

gabrielianfr/web-scraping-project

A Python-based web scraping tool that extracts and stores data in JSON format using BeautifulSoup and Requests.

Language: Python - Size: 2.93 KB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

ashishkumar30/ML-AI-Python-Codes

Python various Important codes, Machine learning, NLP using Spacy and NLTK with Neural Network in ML

Language: Jupyter Notebook - Size: 14.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 28 - Forks: 7

kleindasash/Content-Grabber

Content Grabber is a powerful software for automatic data extraction from websites.

Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

dimitryzub/py-google-scholar-organic-cite-to-csv-sqlite

Scrape historic Google Scholar Organic and Cite results to CSV, MySQL Lite using Python and SerpApi.

Language: Python - Size: 11.7 KB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 4

DhaaraniPushpam/twitter-x-data-extraction

Twitter data extraction using Selenium and twitter api key

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

damnitjoshua/um-timeedit-timetable-toolkit

UM TimeEdit data toolkit for timetable software development.

Language: JavaScript - Size: 13.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

royalgaetan/Vscrape

⚡Take automation to the next level: create workflows, scrape the web while you sleep, extract data with AI, and export it in any format.

Size: 1.8 MB - Last synced at: about 20 hours ago - Pushed at: about 22 hours ago - Stars: 1 - Forks: 0

yumeangelica/jirai_sweeties

A friendly Discord bot with store monitoring capabilities - Tracks online stores for new items and price changes while providing chat commands and real-time notifications. Built with Python.

Language: Python - Size: 99.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sravanigodavarthi/Gmail_to_Excel

This Python script allows you to extract specific email messages from your Gmail inbox, retrieve their subject and content, and save the data into an Excel file

Language: Python - Size: 22.5 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

oxylabs/Web-Scraping-With-Selenium

In this guide on how to web scrape with Selenium, we will be using Python 3. The code should work with any version of Python above 3.6

Language: Python - Size: 64.5 KB - Last synced at: 23 days ago - Pushed at: 2 months ago - Stars: 27 - Forks: 12

roslove44/web-scraping-toolkit

Un ensemble d'outils de web scraping pour extraire et analyser des données à partir de sites web spécifiques.

Language: Python - Size: 621 KB - Last synced at: 13 days ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

Bessouat40/prefect-github-indexer

A Prefect pipeline that periodically scrapes one or more GitHub repositories, generates embeddings, and indexes them in ChromaDB.

Language: Python - Size: 8.79 KB - Last synced at: 24 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

siddartha48/Portfolio-Covid-Data-Analysis-SQL-Server-

Data Exploration using SQL on MS SQL SERVER MGT on Covid data downloaded from https://ourworldindata.org/covid-deaths in CSV file

Size: 1000 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

puleeno/meerkat-cloud-browser

Control Server Browser via Web API: Web Crawler, Data Extraction and Stream Proxy Server

Size: 37.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Thunderbit-HQ/ai-web-scraper

Thunderbit is an AI Web Scraper that replaces tedious copy-paste tasks for GTM teams.

Size: 677 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Nel-zi/Francine_store

Built a scalable data pipeline for Francine Stores, enabling them to extract, clean, and load data from Aliexpress for real-time market trend analysis and smarter business decisions.

Language: Jupyter Notebook - Size: 8.74 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

saivardhan08/My-Projects

In this repository you'll find projects that I have worked on for my own practice and upskilling

Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

VigilantiaFR/SIRETExtractor

Vigilantia SIRET Extractor | automatic scraper | input :{domain names} | output :{SIRET number}

Language: Python - Size: 45.9 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

chathumiamarasinghe/web-scraping

A versatile Python script for scraping data from websites. This script automates data extraction, processes the information, and saves it in a structured format like CSV. Ideal for data collection, research, and analysis tasks.

Language: Python - Size: 9.77 KB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

chouaib-629/WebScraping

A collection of web scraping projects using Beautiful Soup, Selenium, and mixed approaches. Each project includes Python scripts and CSV files of the scraped data. Perfect for learning and experimenting with static and dynamic web scraping techniques.

Language: Jupyter Notebook - Size: 106 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Docutain/Docutain-SDK-Example-.NET-MAUI

Sample project showing how to integrate the Docutain Document Scanner SDK into a .NET MAUI application.

Language: C# - Size: 488 KB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Docutain/Docutain-SDK-Example-Flutter

Sample project showing how to integrate the Docutain Document Scanner SDK into a Flutter application.

Language: Dart - Size: 102 KB - Last synced at: 14 days ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

Docutain/Docutain-SDK-Example-Android-Java

Sample project showing how to integrate the Docutain Document Scanner SDK into an Android application (Java).

Language: Java - Size: 131 KB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 1

Dhruv-0001/Shoe-Hype

A shoe👟 recommendation website.

Language: Python - Size: 288 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 6 - Forks: 0

mydevground/DataAnalyticsEngineeringScienceVizBILab

A lab for DataAnalytics | DataEngineering | AnalyticsEngineering | DataScience | DataVisualization | BusinessIntelligence

Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

SamRB-dev/AutoSeekOut

A simple web scraping bot for scraping information from seekout.com written in Python and Selenium

Language: Python - Size: 11.7 MB - Last synced at: 20 days ago - Pushed at: 10 months ago - Stars: 7 - Forks: 0

Ismat-Samadov/data_edu_az

Web Scraping

Language: Jupyter Notebook - Size: 1.07 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

amybui0910/remax_web_scraper

Python project that extracts data from remax.ca using BeautifulSoup and and requests libraries to scrape the website and stores the data into a csv file

Language: Python - Size: 15.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

asc-csa/NEOSSAT_Tutorial

🛰 Ce tutoriel aide les utilisateurs à mieux comprendre, extraire et visualiser les données du télescope NEOSSAT. | 🛰 This tutorial helps users better understand, extract and visualize NEOSSAT telescope data.

Language: Jupyter Notebook - Size: 8.56 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

domingosdeeulariadumba/thekidsrightsindex_statisticalmodeling

A Kids Rights Index Statistical Modeling

Language: Jupyter Notebook - Size: 5.86 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

eneiromatos/the-home-depot-web-scraper

This web scraper is intended to extract data from The Home Depot Website, it could be run locally or in the Apify platform, the latter is the preferred way. It was made using Apify SDK V3 (Crawlee) with Typescript.

Language: TypeScript - Size: 196 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 9

Docutain/docutain-sdk-example-react-native

Sample project showing how to integrate the Docutain Document Scanner SDK into a React Native application.

Language: TypeScript - Size: 585 KB - Last synced at: 14 days ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

Ismat-Samadov/location_analytics

find best place for payment terminal

Language: HTML - Size: 9.61 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

divithraju/divith-raju-Customer-Sales-ETL-Pipeline

This ETL project was designed to demonstrate the development of a scalable data pipeline for customer sales analysis. It covers all essential steps, from data extraction to transformation and loading into a database, with Apache Airflow used.

Language: Python - Size: 7.81 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

Reddi-Srija-R/Data-wrangling

Comprehensive Data Wrangling Techniques

Language: HTML - Size: 200 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

praneethsattavaram/BlackCoffer

Data Extraction from links in a Excel file

Language: Python - Size: 356 KB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Farooq710/Hospital-Mortality-Rate-Prediction

Predicts the rate of death in hospitals .

Size: 1.2 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

BilalAhmadKhanKhattak/LinkxDoctor

LinkxDoctor is a Python tool that scans a webpage to identify both broken and valid links. It provides a report on the link status, helping ensure all links on the page are functional. This Project Is Indeed The Successor Of One Of My Previous Projects, LinkxDoxer

Language: Python - Size: 13.7 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Docutain/Docutain-SDK-Example-Xamarin-Android

Sample project showing how to integrate the Docutain Document Scanner SDK into a Xamarin.Android application.

Language: C# - Size: 369 KB - Last synced at: 27 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

SwethaJoseph/Automate-API-Extraction

Automating the process of extracting data from APIs, appending new data to existing datasets and generating insightful visualizations

Language: Jupyter Notebook - Size: 156 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Docutain/Docutain-SDK-Example-Xamarin-iOS

Sample project showing how to integrate the Docutain Document Scanner SDK into a Xamarin.iOS application.

Language: C# - Size: 62.5 KB - Last synced at: 15 days ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

Docutain/Docutain-SDK-Example-Windows-WPF-.NET-Framework

Sample project showing how to integrate the Docutain SDK into a WPF application.

Language: C# - Size: 20.5 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Docutain/Docutain-SDK-Example-iOS-Swift

Sample project showing how to integrate the Docutain Document Scanner SDK into an iOS application.

Language: Swift - Size: 11.6 MB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 3 - Forks: 1

Docutain/Docutain-SDK-Example-Windows-Forms-.NET-Framework

Sample project showing how to integrate the Docutain SDK into a Windows Forms application.

Language: C# - Size: 17.6 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Docutain/Docutain-SDK-Example-Xamarin-Forms

Sample project showing how to integrate the Docutain Document Scanner SDK into a Xamarin.Forms application.

Language: C# - Size: 1.41 MB - Last synced at: 26 days ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Aldosee/SQL-Covid-2024

Size: 313 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ogundele1/EXCEL-PROJECT

This project presents a SWOT analysis for Ailead Technology Online Hotel Booking business as she embarks on here journey toward global expansion.

Size: 14 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

vishmaria/gerencia-de-dados

Códigos e conteúdos importantes para extração e manipulação de dados na Web. Esses conteúdos foram desenvolvidos com auxílio das aulas da disciplina "Tópicos Especiais em Gerência de Dados", da UFSC.

Language: Python - Size: 8.79 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

weizhonzhen/FastEtl

简单的etl 支持跨数据库抽取数据库

Language: C# - Size: 18 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 22 - Forks: 9

MaemoonFarooq/Amazon-Dataset-Mining

The Frequent Dataset Mining project offers a comprehensive solution for mining frequent itemsets from the extensive Amazon dataset using Apache Kafka. Leveraging the power of distributed computing, this project employs two powerful algorithms, Apriori and PCY, to efficiently process and analyze large volumes of data.

Language: Python - Size: 19.5 KB - Last synced at: 30 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Pirimid/financial-documents-ocr-deep-learning

Language: Python - Size: 6.23 MB - Last synced at: 10 months ago - Pushed at: about 4 years ago - Stars: 38 - Forks: 14

kingtroga/extract-expo-companies

Using Python scripts, I scraped 19 Expo websites, collecting 26,453 company names. I crafted tailored Google searches to find elusive company websites. My async Python script then captured essential details: Source, Company Name, Website, Contact Name(s), Email(s), Phone Number(s), and Social Media Accounts.

Size: 84 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

kingtroga/brian

This repository contains three web scrapers designed to extract specific data from various sources. These scrapers are tailored for different websites and are intended to be used for data collection and analysis.

Language: Python - Size: 44.2 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

shuddha2021/nodejs-crawler

A lightweight and efficient web crawler built with Node.js

Language: JavaScript - Size: 851 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SubbulakshmiSN/Phonepe-Pulse-Data-Visualization-and-Exploration

Phonepe Pulse Data Visualization and Exploration: A User-Friendly Tool Using Streamlit and Plotly

Language: Jupyter Notebook - Size: 1.48 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

yc-wang00/verra-scaper

This project facilitates the extraction of document data from the Verra Verified Carbon Standard (VCS) Registry, an open database widely utilized by carbon credit traders.

Language: Python - Size: 1.44 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Op27/UNJobs-Selective-Extractor

UNJobs-Selective-Extractor automates the processes to find relevant UN job opportunities by collecting and filtering listings from the UNJobs website based on user criteria.

Language: Python - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

FaizanMohd5/Web-scraping-iPhone-11-Reviews

This is a web scraping project that extracts customer reviews for the iPhone 11 from Flipkart.com using Python and BeautifulSoup. The extracted data is saved in a CSV file for further analysis. Use it as a starting point for your own web scraping projects or for analyzing customer reviews of the iPhone 11.

Language: Jupyter Notebook - Size: 362 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

iambitttu/GitHub-User-Analytics-and-Recommendation-System

This project aims to collect data from GitHub users, store it in MongoDB, and create an analytics dashboard to understand users' technical strengths and weaknesses.

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

itratjassani/Data-Extraction-from-Invoice-Images

A program has been developed to automate the process of extracting text and data from handwritten invoices, thereby improving efficiency and reducing errors associated with manual data entry, thereby benefiting many businesses.

Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

swapnanildutta/instagram-search

I have used a python code to extract the details of a given username.

Language: Python - Size: 36.1 KB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 3

Tiwarijishiv/BizCardX-Extracting-Business-Card-Data-with-OCR

This project will require skills in image processing, OCR, GUI development, and database management. It will also require you to carefully design and plan the application architecture to ensure that it is scalable, maintainable, and extensible. Good documentation and code organization will also be important for this project.

Size: 0 Bytes - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Mariyajoseph24/SugarFit_Google_Play_Store_Review_Analysis_and_Power_BI_Reporting

"Utilized Python with Pandas, NumPy, and TensorFlow for data scraping and sentiment analysis in Microsoft Azure Data Studio. Employed MS Excel for data cleaning and exploration, with analysis done in PostgreSQL. Utilized Microsoft Power BI for visualization, deriving actionable insights."

Size: 111 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Saim-Akhtar/Stalker-Insta

An Instagram crawler for fetching a profile.

Language: Python - Size: 13.7 KB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 1

Rishi-Solanki07/SQL_Data_Extraction_Project

We Have Huge Data-Base In S.Q.L, Our Task is to Extract Some Data For Company(Advantureworks) and Present that data With our Presentation Skills, For more info Please Visit Read.me File

Size: 4.39 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

shubhamatkal/instagram-user-data-extraction

Used to extract data from instagram profile for analytics purpose

Language: Python - Size: 5.86 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

oussafik/Web-Scraping-RealEstate-Beautifulsoup

This is a Python project that uses BeautifulSoup and requests libraries to scrape real estate data from a website and store it in a database and a text file or a CSV file.

Language: Python - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Ismat-Samadov/turbo_az

Data Scraping

Language: Jupyter Notebook - Size: 67 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Ismat-Samadov/bul_az

Language: Jupyter Notebook - Size: 3.26 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Ismat-Samadov/birja-in_az

Language: Python - Size: 241 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Ismat-Samadov/vipemlak_az

data extraction from vipemlak.az

Language: HTML - Size: 122 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Ismat-Samadov/yeniemlak_az

Data Scraping

Language: Jupyter Notebook - Size: 42.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

regain001/Data-Extraction-From-PDF

Language: Jupyter Notebook - Size: 548 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

80396-B2/Credit_Score_Prediction

Given a person’s credit-related information, I am building a Machine/Deep learning model that can classify the credit score.

Language: Jupyter Notebook - Size: 30 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Ismat-Samadov/bina_az

data extraction from bina.az

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

CJHydraGenZ/komik

this is web comic from data komikcash

Language: TypeScript - Size: 3.76 MB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 3

ankita-gondkar/Skin-Cancer-Diagnosis

This repository showcases a Convolutional Neural Network (CNN) module developed on Jupyter and an AutoML module implemented on Google Cloud Platform (GCP) through VertexAI

Language: Jupyter Notebook - Size: 4.88 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Ismat-Samadov/car_price_prediction

car data sacraping and price predicition

Language: Jupyter Notebook - Size: 27.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

firec0de/caffeine

Caffeine is a computer malware. Created it as a uni project and by the time it developed as my final diploma thesis

Language: Python - Size: 18.6 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 3

aiwithqasim/text_analysis

Data Extraction and text analysis

Language: Python - Size: 48 MB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 2

chatterjeeabhi/blackcoffer_datascience_test_task

Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

azambd/yellowpages-review

Scrape Reviews From yellowpages.com Profiles

Language: Python - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

NechbaMohammed/TechTerminologyAssist

TechTerminologyAssist is a web application that simplifies the understanding of technical terms in financial reports. It uses machine learning to extract technical terms, providing users with comprehensive definitions. This tool is invaluable for public finance controllers and auditors, making complex financial data more accessible

Language: HTML - Size: 64.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Manojpatil123/Data-extraction-and-text-analysis

The objective of this assignment is to extract textual data articles from the URL and perform text analysis to compute variables.

Language: Jupyter Notebook - Size: 95.7 KB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 26

TheOwaisShaikh/woocommercescraper

Woocommerce Websites Scraper Without open ai and langchain

Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

BryanMorfe/jpgextract

Extract All JPG images from a file.

Language: C - Size: 7.81 KB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

Yousseflayechi/python-web-scraping-project

Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

sadhilchhabra/BizCardX_Extracting_Business_Card_Data_with_OCR

A Streamlit Application that uses optical character recognition (OCR) to read the information on business cards and classifying the data before extracting it into an SQL database.

Language: Python - Size: 1020 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Ismat-Samadov/emlak_az

Data Scraping

Language: Jupyter Notebook - Size: 17.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Ismat-Samadov/lalafo_az

Language: Jupyter Notebook - Size: 14.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Ismat-Samadov/unvan_az

Language: Jupyter Notebook - Size: 82.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Ismat-Samadov/rahatemlak_az

Web Scraping with Scrapy

Language: HTML - Size: 62.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Ismat-Samadov/arenda_az

Data Extraction From arenda_az

Language: Python - Size: 1.42 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Ismat-Samadov/ucuztap_az

Language: Python - Size: 9.35 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Ismat-Samadov/birja_com

Web Scraping

Language: HTML - Size: 58.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Ismat-Samadov/h2h_az

Data Scraping

Language: Python - Size: 685 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Ismat-Samadov/linkedin_com

Web Scraping

Language: Python - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0