Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-extraction

Rakib-Hasan-Rahad/Web-Scraping-with-python

Language: Jupyter Notebook - Size: 242 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

AnalystHub-Hub/IBM-Data-Science-Professional-Certificate

I learnt data science through hands-on practice in the IBM Cloud using real data science tools and real-world data sets.

Language: Jupyter Notebook - Size: 14.3 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

JanhaviGadge/Data-Analysis-Projects

This repository contains notebooks on various datasets as a practice on data analysis, all notebooks include: Data Cleaning. Data Visualization. Exploratory Data Analysis.

Language: Jupyter Notebook - Size: 7.35 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

ADVAIT135/Forage_BCGX_Gen_AI_Virtual_Job_Simulation-

This Repository consists of all the Jupyter Notebook (.ipynb) files, python files, excel sheets which are a part of the BCGX's Gen AI Virtual Job Simulation that is hosted on Forage.

Language: Jupyter Notebook - Size: 38.1 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

The-Nebula-Developers/Complex-Parser

Complex Parser is a powerful Python package designed to streamline the process of data extraction from JSON-like structures while also enriching the extracted data with synonym retrieval capabilities.

Language: Python - Size: 37.1 KB - Last synced: 18 days ago - Pushed: 3 months ago - Stars: 4 - Forks: 0

Revanthkasinathan/Data-Migration-and-Transformation

This project entails migrating and transforming data, with a key focus on extracting data, storing it in AWS, and then pushing it to a database table.

Language: Python - Size: 8.79 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

matthewmuccio/SimpleDataExtraction

A simple example of implementating data extraction, built with Python, an SQLite back-end, and the MVC design pattern.

Language: Python - Size: 4.88 KB - Last synced: 3 months ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0

lykmapipo/Python-Spark-Log-Analysis

Python scripts to process, and analyze log files using PySpark.

Language: Python - Size: 109 KB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 3 - Forks: 0

Yashmenaria1/Data-Extraction-and-Text-Analysis

This project involves extracting information from various links and analyzing text to derive insights, patterns, and trends.

Language: Jupyter Notebook - Size: 112 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

sypht-team/sypht-elixir-client

An Elixir client for the Sypht API https://sypht.com

Language: Elixir - Size: 47.9 KB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 6 - Forks: 0

im-dpaul/Wikipedia-Scraper

Scrape Wikipedia data based on user-input topic using Python. Extracts content, saves to text file. Utilizes requests, BeautifulSoup.

Language: Jupyter Notebook - Size: 338 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

nopperl/corporate_emission_reports

Language: TeX - Size: 937 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

alhankeser/tcx-extract

Speed-optimized data extractor for .tcx (Garmin) files.

Language: Zig - Size: 1.74 MB - Last synced: 23 days ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

SergeTouvoli/simple_email_extractor

Language: PHP - Size: 9.77 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

Rishi-gupta-data/Data-extraction

# Stock-market Data Extraction using Python

Language: Jupyter Notebook - Size: 491 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

guykiper/FootballInsights

FootballInsights uses web scraping and data science to extract football stats from fbref.com and analyze data to reveal insights for predicting key events, identifying player similarities, recognizing value metrics, and showcasing advanced football analytics for teams and players through interactive visualizations.

Language: Python - Size: 50.3 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

DishaK06/Online-Store-Sales-Analysis

Analysis of an online store indicating its KPIs such as Sales, Profit, Total Customers, Orders, Units Sold, etc

Size: 639 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

molybdenum-99/infoboxer

Wikipedia information extraction library

Language: Ruby - Size: 8.17 MB - Last synced: 23 days ago - Pushed: 3 months ago - Stars: 173 - Forks: 16

spmohara/P4-Search

An intuitive GUI-based Python application allowing a user to effortlessly search files on Perforce Helix Core utilizing specific search patterns.

Language: Python - Size: 87.9 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

Diggernaut/diggernaut-meta-lang-docs

Diggernaut meta language documentation

Language: HTML - Size: 156 KB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

spmohara/Filter-Trace

An intuitive GUI-based Python application allowing a user to easily extract data from a file based on specific keywords to generate a focused output file.

Language: Python - Size: 9.31 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

sorrychoe/covid19today

Today's World covid-19 Data Gathering Tool

Language: R - Size: 5.73 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

ppatrzyk/filmweb-export

Eksport danych z serwisu filmweb

Language: Python - Size: 363 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 12 - Forks: 2

ksm26/Functions-Tools-and-Agents-with-LangChain

Explore Functions, Tools and Agents with LangChain along with LangChain Expression Language

Language: Jupyter Notebook - Size: 1.65 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 1

JonathanLink/PDFLayoutTextStripper

Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).

Language: Java - Size: 21.1 MB - Last synced: 3 months ago - Pushed: 6 months ago - Stars: 1,524 - Forks: 204

steno-aarhus/ukbAid

Aid Steno Researchers Who Work on the UKB RAP.

Language: R - Size: 10.7 MB - Last synced: 5 months ago - Pushed: 6 months ago - Stars: 6 - Forks: 2

manjuq/LinkedIn-Job-Applicant-Scraper

LinkedIn Job Applicant Scraper: A Python-based web scraper using Selenium to extract applicant information from LinkedIn profiles, facilitating automated data gathering for recruitment processes

Language: Jupyter Notebook - Size: 6.84 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

fcoagz/xtweet

xtweet es una biblioteca de Python para interactuar con la API de Twitter.

Language: Python - Size: 6.84 KB - Last synced: 29 days ago - Pushed: 10 months ago - Stars: 2 - Forks: 0

Bisaloo/xlcutter

Parse Batches of 'xlsx' Files Based on a Template

Language: R - Size: 4.5 MB - Last synced: 24 days ago - Pushed: 4 months ago - Stars: 6 - Forks: 0

berntpopp/table-harvester

Effortlessly extract data from HTML tables and convert them into structured CSV files.

Language: JavaScript - Size: 458 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

101rakibulhasan/polyprolang

Explore, create, and share algorithm implementations in multiple programming languages within a unified '.prolang' file format. Enhance your coding skills, collaborate with diverse teams, and compare language performance in a single, versatile platform.

Language: C++ - Size: 71.3 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

AryanVBW/Exif

ExifTool is a powerful command-line tool that can be used to extract and edit metadata in a wide range of media files, including images, audio, and video. Metadata is information that is stored within a file that describes the file’s content or other attributes.

Language: Python - Size: 43.9 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 10 - Forks: 3

ranayalcink/selenium_web_scraper

This project employed Selenium for web scraping on Glassdoor, extracting key job insights. The Python script, utilizing Pandas, dynamically navigated pages, extracting details. XPath usage, dynamic scraping, and Chrome options configuration streamlined the process.

Language: Python - Size: 8.42 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

kurai-sx/Data-Extraction-and-Sentiment-Analysis-using-NLP

In this repository, you will be able to get how to extract text from the title and content from any article. Also using this extractede data to define the sentiment of the sentence.

Language: Jupyter Notebook - Size: 163 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

shdev/phpflashtext

Extract Keywords from sentence or Replace keywords in sentences. @ https://github.com/vi3k6i5/flashtext

Language: PHP - Size: 1.21 MB - Last synced: 17 days ago - Pushed: almost 5 years ago - Stars: 19 - Forks: 5

prashver/end-to-end-image-scraper

This project is a streamlined Streamlit web app for easy image scraping from Google Images. Enter your search query, fetch, and download images locally in a zip file. Simple setup and customization for tailored results.

Language: Python - Size: 10.2 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

ahmedmujtaba1/Python-Projects

There are numerous sources of source code available for scraping various types of social media platforms and websites.

Language: Jupyter Notebook - Size: 28.4 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 4 - Forks: 1

rabbittx/Digikala-Crawler

Digikala Crawlerیک خزنده وب قدرتمند برای جمع‌آوری و تحلیل داده‌های دیجی‌کالا است. این ابزار به تجار و تحلیلگران بازار کمک می‌کند تا به بینش‌های دقیقی از رفتار بازار دست یابند، شامل استخراج داده‌های فروشندگان، محصولات و تحلیل قیمت. مناسب برای تقویت استراتژی‌های بازاریابی و فروش

Language: Python - Size: 6.27 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

vansh-py04/BlackCoffer-Data-Extraction-and-Text-Analysis-

The objective of this assignment is to extract textual data articles from the given URL and perform text analysis to compute variables that are explained

Language: Jupyter Notebook - Size: 105 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

kiranndeep/Python_Web_Scrapping

This is a flipkart web scrapping project using python, that allows users to retrieve datasets of the products listed on flipkart for the given product.

Language: Python - Size: 20.5 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

InovaFiscaliza/rfdatahub Fork of ronaldokun/anateldb

This repository agregate and serve Telecommunication and Radio Difusion data from ANATEL and various Aeronautics APIs

Language: Jupyter Notebook - Size: 89.5 MB - Last synced: 19 days ago - Pushed: 20 days ago - Stars: 0 - Forks: 0

shraddhaROCKS/Data_extraction_and_Text_analysis_for_blackcoffer

The objective of this assignment is to extract textual data articles from the given URL and perform text analysis to compute variables that are explained

Language: Jupyter Notebook - Size: 91.8 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

MrHacker-X/OsintifyX

OsintifyX: Powerful Open-source OSINT tool for extracting valuable information from Instagram profiles.

Language: Python - Size: 3.83 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 32 - Forks: 2

JamesClarke7283/github_scraper

Efficient Rust library for dynamic GitHub data extraction without API: Scrape repository stats, user contributions, issues, and more with Selenium WebDriver support. Ideal for data analysis, GitHub research, and Rust-based web scraping projects.

Language: Rust - Size: 8.79 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

brunodifranco/project-star-jeans-data-engineering

ETL building for an e-commerce Jeans company. Feel free to access the Streamlit App in the link below.

Language: Jupyter Notebook - Size: 178 KB - Last synced: about 1 month ago - Pushed: 9 months ago - Stars: 1 - Forks: 1

Aries-Amit/Projects

SQL

Size: 21.7 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

Kaleeswari-S/BizCardX-Extracting-Business-Card-Data-with-OCR

BizCardX is a powerful Python-based tool designed for seamlessly extracting valuable information from business cards through the application of Optical Character Recognition (OCR).

Language: Python - Size: 16.6 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

arkutils/arkutils-website

The source for the arkutils website, home of a few Ark: Survival Evolved tools.

Language: Svelte - Size: 4.64 MB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 12 - Forks: 1

trangdangberlin/PDF_to_CSV_converter

Convert a PDf file using Python

Language: Jupyter Notebook - Size: 4.88 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

yukiyuqichen/His-Geo

A library to extract historical toponyms from texts, geocode and visualize the results on maps.

Language: Jupyter Notebook - Size: 25 MB - Last synced: 5 days ago - Pushed: 5 months ago - Stars: 2 - Forks: 0

attogram/justrefs

Just Refs - extract just the references and related topics from any page on the English Wikipedia

Language: PHP - Size: 244 KB - Last synced: about 1 month ago - Pushed: about 4 years ago - Stars: 15 - Forks: 0

Kaleeswari-S/Phonepe-Pulse-Data-Visualization-and-Exploration

A live geo visualization streamlit dashboard that displays information and insights from the Phonepe pulse Github repository in an interactive and visually appealing manner.

Language: Python - Size: 73.2 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

rz0012/Data-Extraction-from-Fitbit

Extraction of Fitbit data, analysis, and prediction.

Language: Jupyter Notebook - Size: 521 KB - Last synced: 4 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

teocci/py-pdf-ingester-public

Discover the essentials of PDF file processing with py-pdf-ingester. This Python application is designed to handle PDF files, ensuring security and legal compliance. Participants will learn installation procedures, Python coding, and usage of the py-pdf-ingester tool.

Language: Python - Size: 7.81 KB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

ROBROICH/SAP_AND_COMMON_DATA_MODEL_DEMO

This demo describes the basic integration between S/4HANA and the Microsoft Common Data Model (Model)

Size: 4.24 MB - Last synced: about 1 month ago - Pushed: about 4 years ago - Stars: 16 - Forks: 2

asc-csa/LEAD-Rover-Data-Tutorial

🌔 Ce tutoriel montre comment extraire et visualiser les données d'un rover (le rover JUNO de l'ASC) de la mission de Déploiement d'analogues pour l'exploration lunaire. | 🌔 This tutorial demonstrates how to extract and visualize rover data (CSA's JUNO Rover) from the Lunar Exploration Analogue Deployment Mission.

Language: Jupyter Notebook - Size: 12.4 MB - Last synced: 23 days ago - Pushed: 4 months ago - Stars: 1 - Forks: 2

lfhohmann/wordle-ETL

ETL for Wordle game

Language: Jupyter Notebook - Size: 157 KB - Last synced: 5 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

lfhohmann/quordle-ETL

ETL for quordle game (4 words version of Wordle)

Language: Jupyter Notebook - Size: 93.8 KB - Last synced: 5 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1

zyang1611/wiki_summarizer

Web app that provides a 10 sentence summary of any Wikipedia page

Language: Python - Size: 6.84 KB - Last synced: 5 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

zyang1611/stock_comments

Analysis of comment frequency of users in an online forum in relation to stock price movement

Language: Python - Size: 13.5 MB - Last synced: 5 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

1Jaffry1/Proprofs-Format-to-Aiken-Format

Extract data from a Proprofs style excel sheet and create a text file in Aiken Format for MCQ style quizzes

Language: Python - Size: 5.86 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

vikrantpagare/Web-Article-Sentiment-Analysis

Sentiment Analysis of given list of Web-Articles using Python | BeautifulSoup

Language: Jupyter Notebook - Size: 108 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

gautam132002/yellow-page-scraper-it

Extract business data from Pagine Gialle (https://www.paginegialle.it/) effortlessly. Get contact info, addresses, and more with our user-friendly interface. Empower your business with valuable insights.

Language: Python - Size: 102 MB - Last synced: 4 months ago - Pushed: 11 months ago - Stars: 1 - Forks: 0

4tocall/File-Collector-Script

The File Collector script is a command-line utility designed to collect and extract content from specified files within a directory tree.

Language: Python - Size: 31.3 KB - Last synced: 5 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

johnbumgarner/newspaper3_usage_overview

This repository provides usage examples for the Python module Newspaper3k.

Language: Python - Size: 121 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 120 - Forks: 17

lykmapipo/NYC-TLC-Trip-Data

Python scripts to download, process, and analyze NYC TLC trip data

Language: Jupyter Notebook - Size: 89.3 MB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

Eakta08/Text-Analysis

Extracting textual data articles from the given URL (webscraping using goose3) and performing text analysis

Language: Jupyter Notebook - Size: 146 KB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

johnbumgarner/newshound

This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around the world in over 50 languages.

Size: 28.3 KB - Last synced: 3 days ago - Pushed: about 1 year ago - Stars: 29 - Forks: 3

akifislam/Complex-PDF-MCQ-Scraper

A Script to Analyze thousands of complex PDFs with text, tables, graphs and input them in a xls file within seconds.

Language: Python - Size: 917 KB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 2

vemonet/csvw-ontomap

🗺️ ️Generate CSVW metadata for tabular data files, and map columns to terms in a given OWL ontology using semantic search

Language: Python - Size: 119 KB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 1 - Forks: 1

imubit/pi-pbook-data-extractor 📦

ProcessBook applet for extracting historical data from PI Server

Size: 395 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 6 - Forks: 4

SimoMrdjen/PSF_RestApp

App for fetching excel files, extracting and checking extracted data. After validation app is responsible for persistent data in DB.

Language: Java - Size: 96.2 MB - Last synced: 5 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

youssef-attai/extract-emails-from-pdfs

CLI utility for my job's BS because I can't take it anymore

Language: Python - Size: 6.84 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

JoaoHenriqueRX7/ETL--Data-Scrapping-Python-MySQL-

A Python-based, automates data extraction, transformation, and loading. It focuses ETL pipelines, web scrapping and MySQL database, leveraging Python libraries for processing and MySQL for storage.

Language: Jupyter Notebook - Size: 21.5 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

Revanthkasinathan/Youtube-Data-Harvesting-And-Warehousing

This project is a Streamlit application which is focussing on ETL concept. It is particularly designed to provide users with seamless access and analysis of data from multiple YouTube channels.

Language: Python - Size: 13.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

DatapaloozaCO/search-engines-scraper Fork of Juanchobanano/Search-Engines-Scraper

This library empowers developers to effortlessly query popular search engines, including Google, Bing, Yahoo, DuckDuckGo, and more. With features like multi-engine support, output flexibility, search filters, and proxy compatibility, it's a versatile solution for diverse search applications.

Language: Python - Size: 210 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

mhucka/taupe

Taupe takes a downloaded Twitter archive ZIP file, extracts the URLs corresponding to tweets, retweets, replies, quote tweets, and liked tweets, and outputs the results in a comma-separated values (CSV) format that you can use with other software tools.

Language: Python - Size: 176 KB - Last synced: 25 days ago - Pushed: about 1 year ago - Stars: 27 - Forks: 1

WaizKhan7/SmartMuv

An EVM-compatible Solidity Smart Contract Storage/Slot Analyzer and Data Extractor.

Language: Python - Size: 213 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 12 - Forks: 2

P-stha12/Form-Data-Extraction-And-Visualization

Extraction of Data from Form 1040, Form 990 and Insurance Certificate, and their Visualizations

Language: Jupyter Notebook - Size: 3.97 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 1

Chek0rrdn/DataEngineer_ETL

A project structure for doing and sharing data engineer work.

Language: Python - Size: 3.91 KB - Last synced: 6 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

Moha-cm/BizCardX

BizCardX: Extracting Business Card Data with OCR

Language: Python - Size: 805 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

JayUnitTest/multinational-retail-data-centralisation10

Centralized data handling for multinational retail operations. Python-based project with PostgreSQL, AWS, and data analysis tools.

Language: Python - Size: 20.5 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

matevz-spacapan/IEPS

Projects done for the subject Web information extraction and retrieval where the tasks were related to web crawlers, data extraction and web search.

Language: CSS - Size: 65.1 MB - Last synced: 6 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

MOUHASSINE-badreddine/MoroccanHousing-ETL

Moroccan housing data pipeline using scrapy, mongodb , zyte and digitalocean cloud

Language: Python - Size: 30.3 KB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 12 - Forks: 0

gayathri-pan/rateGain

Welcome to my solution for the web scraping hackathon! In this challenge, I developed a program using Python and the Scrapy library to extract specific information from the "https://rategain.com/blog" webpage.

Language: Python - Size: 21 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

uhh-lt/newsleak

Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery

Language: Java - Size: 116 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 52 - Forks: 15

Proggleb/youtube_data_engineering_project

Data Engineering Project: Extracting music video metrics of Twice using YouTube API, AWS, and Tableau

Language: Python - Size: 10.8 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0

elmezianech/Food-Delivery-Time-Prediction

This machine learning project focused on predicting food delivery times. The code emphasizes essential tasks such as data cleaning, feature engineering, categorical feature encoding, data splitting, and standardization to establish a solid foundation for building a robust predictive model.

Language: Jupyter Notebook - Size: 138 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

sjuanati/demonic-tutor

EVM blockchain data extraction tool, enabling automated event querying, contract call data retrieval, and conversion of dates to block numbers, with CSV export functionality

Language: Python - Size: 316 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0

deadbits/trs

🔭 Threat report analysis via LLM and Vector DB

Language: Python - Size: 1.29 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 2 - Forks: 0

mansi-k/Social-Champ

Performed data retrieval from Facebook and Twitter feeds. Filtered and ranked social influencers, considering various aspects, who could help in promoting NGOs.

Language: PHP - Size: 1.9 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

ricardolsmendes/gcp-documentai-custom-extractors

Custom data extractors that use Google Cloud's Document AI

Size: 28.9 MB - Last synced: 4 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

blalop/bbva2pandas

Extract the data from your BBVA's monthly statements

Language: Python - Size: 120 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 5 - Forks: 2

webmiddle/webmiddle

Node.js framework for modular web scraping and data extraction

Language: JavaScript - Size: 2.53 MB - Last synced: 12 days ago - Pushed: over 1 year ago - Stars: 14 - Forks: 2

sypht-team/sypht-python-client

A python client for the Sypht API

Language: Python - Size: 164 KB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 163 - Forks: 4

adil6572/houzz-scraper

Python web scraper built on Scrapy and BeautifulSoup for extracting business information from websites on Houzz.com and storing it in a CSV file.

Language: Python - Size: 744 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

bilalhameed248/Delay-Reason-Extraction-Model

Efficient Delay Reason Extraction in Patient Appointments/Treatments Using BERT and Tensorflow. - Feb 2022 - Jun 2023

Language: Python - Size: 2.93 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

bilalhameed248/Patient-Most-Recent-Treatments-Similarity-Measuring

BioBert Enhanced Patient Treatment Similarity Analysis with Sentence Transformers

Language: Jupyter Notebook - Size: 19.5 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

Onurkekec0/Open-Ports-visualization

With this project, you see the visualization of open ports around the world on a map.

Language: HTML - Size: 74.2 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

desininja/voice-disorder

Data Science project. ML algorithms to detect voice disorders.

Language: Jupyter Notebook - Size: 3.61 MB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 4 - Forks: 4