Topic: "data-gathering"
fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
Language: Python - Size: 2.99 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 2,236 - Forks: 436

Cacti/cacti
Cacti ™
Language: PHP - Size: 265 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,725 - Forks: 418

Decodo/Decodo
HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.
Language: Java - Size: 320 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,125 - Forks: 43

vil/H4X-Tools
Open source toolkit for scraping, OSINT and more.
Language: Python - Size: 3.43 MB - Last synced at: 8 days ago - Pushed at: 30 days ago - Stars: 432 - Forks: 47

lamthuyvo/social-media-data-scripts
Language: Python - Size: 2.08 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 242 - Forks: 87

OSINT-TECHNOLOGIES/dpulse
DPULSE - Tool for complex approach to domain OSINT
Language: Python - Size: 1.56 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 124 - Forks: 6

shadawck/glit
Retrieve all mails of users related to a git repository, a git user or a git organization
Language: Rust - Size: 266 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 50 - Forks: 7

viralvaghela/Jwiki
Java tool to get wikipedia data
Language: Java - Size: 601 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 36 - Forks: 1

chrislicodes/Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Language: Jupyter Notebook - Size: 93.1 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 34 - Forks: 22

lucaromagnoli/dataservice
Python async data gathering
Language: Python - Size: 630 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

DialRC/PortalAPI
Portal Tutorial
Language: Python - Size: 143 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 11 - Forks: 4

amirhossein-bayati/Scientist-Clustering-Index
Using kmeans algorithm for clustering Google scholar professors into three distinct clusters.
Language: Jupyter Notebook - Size: 153 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 0

darsan-in/Job-Crawler
The Job Crawler is an integral component of the Job RAID project, designed to automatically scrape and collect data from various job listing websites. This crawler enables Job RAID to aggregate comprehensive job listings, ensuring that users have access to up-to-date and relevant job opportunities.
Language: Python - Size: 6.83 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 5 - Forks: 0

sondosaabed/Oil-vs-BigTech-stock-investigation
💹📈Investigating the oils market prices in addition to the stock market prices between the start of 2001 to the end of 2023. 💰📉
Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 5 - Forks: 0

dbrennand/twitter-stream-bot-data-gatherer
An application to watch the Twitter stream and send accounts to the Botometer API for analysis. The results are stored in a SQLite database.
Language: Python - Size: 35.2 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

subahanii/COVID19-tracker
This is for Indian cases and data gathering from Indian governments site.
Language: HTML - Size: 23.7 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

BahramJannesar/CafebazzarWebsiteScraper
Data gathering from https://cafebazaar.ir
Language: Python - Size: 578 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 1

rdempsey/data-gathering-and-wrangling
Code and slides for my class: Data Gathering & Wrangling
Language: Python - Size: 24.6 MB - Last synced at: about 1 year ago - Pushed at: about 10 years ago - Stars: 5 - Forks: 11

speckly/sucorn
ML/DL dataset collection utilities
Language: Python - Size: 663 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 2

Corruptex/booru-dataset-gatherer
A .NET Core 3.1 Console application to gather tags and relevant information from Booru websites for Machine Learning.
Language: C# - Size: 59.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

calista-ai/crowdsourcing-app
A Web Application to collect data from pairwise image comparisons via crowdsourcing
Language: JavaScript - Size: 3.17 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

Yara-Aldajjani/Wrangle-and-Analyze-WeRateDogs-Data
This is Udacity's Data Analyst Nanodegree's 5th project; which is Wrangling and Analyzing WeRateDogs Twitter account.
Language: Jupyter Notebook - Size: 3.57 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

openpilot-community/opc-web
Seeking Maintainers. The official codebase for the openpilot community info portal.
Language: JavaScript - Size: 29.8 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 1

joedarby/AlcoSensing
An Android app for mobile sensing research
Language: Java - Size: 244 KB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 0

petermeissner/diffrprojects
Language: R - Size: 4.52 MB - Last synced at: 7 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

ZawszeBaka/face_recognition
Face Detection => Data Gathering => Training => Face Recognition
Language: Python - Size: 636 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

ctroller/rlfantasy
Custom Rocket League Fantasy League Stats Aggregator
Language: Java - Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

kleinpa/keyboardtime
Foreground application logger for Windows
Language: Python - Size: 209 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

wwengm/findar
Financial Datareader
Language: Python - Size: 49.8 KB - Last synced at: about 1 month ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 1

iwansal64/instaf1nder-py
An open source Instagram profile lookup.
Language: Python - Size: 17.6 KB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

rafa-br34/MCSF
Minecraft Server Finder is a small toolkit which helps in finding Minecraft servers and tracking players using the "sample" parameter.
Language: Python - Size: 5.37 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

sorrychoe/covid19today
Today's World covid-19 Data Gathering Tool
Language: R - Size: 5.73 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

marcoshsq/Python_DSA
Minhas anotações e projetos do curso Fundamentos de Linguagem Python Para Análise de Dados e Data Science oferecido pela Data Science Academy de forma gratuita.
Size: 48.8 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

rhart-rup/Foursquare-Venue-Scraper-2023
Find all venues in a geographic area using the Foursquare API and collect extended Foursquare data on each venue.
Language: Jupyter Notebook - Size: 37.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Coool/Cacti Fork of Cacti/cacti
Cacti ™ (Latvian and Russian translator contribution)
Language: PHP - Size: 165 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

TelRich/Web_Scrapping_with_BeautifulSoup-and-Wptool
Web scraping Webometrics and Wlkilpedia using Python (Beautiful soup and Wptools) to make a list of top 100 Universities in nigeria
Language: HTML - Size: 318 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

tarzahedi/imdb-scraper
Scraper for IMDBA top 1000 movies!
Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

alx3dev/ctfc 📦
Crypto to Fiat Currency data gathering
Language: Ruby - Size: 143 KB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Factobotics/FlexHex-Rose-AP
ROSE-AP of the FlexHex project tries to make data gathering from Orion Context Broker entities to Influx-db easier.
Language: Python - Size: 23.4 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 2

SamAndPel/grab-stick
A set of python utilities to automatically exfiltrate system data.
Language: Python - Size: 39.1 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

nagen1/teamportal
Ideas, Collaborate and Data Analytics
Language: HTML - Size: 3.09 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

ranjithkumarravikumar52/master-thesis
A process to parse survey data to elastic search to perform data visualization on Kibana
Language: JavaScript - Size: 105 MB - Last synced at: 10 months ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1

gondsm/bum
BUM (Bayesian User Model): A User Modelling Technique for Learning from Distributed Devices.
Language: Python - Size: 131 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

Cerberus-ik/rr-live
This is our entry for the 2017 Riot api challange.
Language: Java - Size: 2.63 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

letatthinh/gaming-profiles Fork of taramjacobsen/dssa5102finalproject
Final project in Data Gathering and Warehousing class
Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ibrahimceyisakar/hotel-finder
Hotel finder system with Python includes data gathering, analyzing, and visualization.
Language: Python - Size: 30.3 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

DanielOladipupo/Earth-To-Home-Agricultural-Research-Analysis
An Insightful Analysis given to an Agricultural Start-Up Company in South Africa for Decision Making
Size: 10.7 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

iabdullah215/ReconPro
This tool performs a comprehensive security reconnaissance on a given domain, gathering information such as subdomains, SSL certificate details, open ports, HTTP headers, WHOIS data, and more. It generates a detailed JSON report of the findings for further analysis.
Language: Python - Size: 16.6 KB - Last synced at: 13 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

tampucci/nautilos-CS-public
Application to send data acquired through Citizen Science activities to Nautilos erddap data server
Language: JavaScript - Size: 602 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

garvitjain-02/Data-Gathering_DataScience
All Data Gathering Techniques are implemented here from scratch! Welcome to this Data Gathering techniques repository which is a comprehensive guide for anyone looking to understand how to gather data by various methods.
Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

grip-on-software/data-gathering
Modules used to gather data from different data sources in software development processes
Language: Python - Size: 2.15 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

AqueeqAzam/web-scraping-for-data-gathering-and-mining
Web scraping is used by data mining experts and hackers to imitate conventional browsers and visit websites by following their hypertext structure. They then extract HTML content and data according to predetermined settings and store the data in local databases.
Language: Jupyter Notebook - Size: 4.88 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

AyaFergany/Netflix-Project
Language: Jupyter Notebook - Size: 3.15 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

aadityasikder/Object-Detection-with-raspberry-pi-implementing-TinyML-models
Repository for Raspberry Pi-based object detection with TinyML models like TensorFlow Lite, PyTorch Nano, including data gathering, mAP evaluation, and image data preparation in Jupyter notebooks.
Language: Jupyter Notebook - Size: 35.7 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nicolaswehmeyer/corpcrawler
A sales assistant for automatic information gathering on enterprise accounts
Language: Python - Size: 2.53 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

m-nanda/ilt-1
Language: Jupyter Notebook - Size: 1.19 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dannyskillzz/TonyElumelu
This project presents an analysis of Mr. Tony Elumelu’s engagements on LinkedIn for the first half of 2023 (January to June, 2023).
Size: 3.77 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Osman-28/Coffee-Sales-EXCEL-ANALYSIS-
Data Analysis Poroject
Size: 391 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ankitgmishra/MachineLearning
Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.
Language: Jupyter Notebook - Size: 103 MB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mariannapalaia/london_bike_sharing
An end to end project from data gathering to data visualization using pandas library in Python and Tableau.
Language: Python - Size: 1.15 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mariannapalaia/coffee_sales_dashboard_excel
Dynamic Dashboard reporting Coffee Sales over Time, Coffee Sales by Country and Top 5 Customers leveraging dynamic Charts, Timeline and Slicers.
Size: 558 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Sadiq-marcelo/investigate-TMDB-movies-dataset
Investigate TMDB Movies Dataset Project
Language: Jupyter Notebook - Size: 3.29 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kdvalin/ipmi-exporter
Exports IPMI sensor information to a CSV
Language: Python - Size: 16.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Dhrumi-Kansara-1/data-wrangling-audible-dataset
Data gathering, Assessing and cleaning on audible dataset
Language: Jupyter Notebook - Size: 38.1 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Sadiq-marcelo/twitter-WeRateDogs-data-analysis-project
Investigate Twitter WeRateDogs Data Analysis Project
Language: HTML - Size: 1.66 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

MohammadAnas5/Scrape-ambition-box
In this website Job seekers can write and share company reviews and interview stories
Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Dhrumi-Kansara-1/data-gathering-movies-dataset
Data gathering using api and web scrapping
Language: Jupyter Notebook - Size: 104 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Dhrumi-Kansara-1/data-wrangling-diabetes-patient-dataset
Applying data wrangling steps such as gathering data, assessing data and cleaning data on diabetes patient dataset
Language: Jupyter Notebook - Size: 92.8 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Am-I-Going-On-Holiday/CovidDashboardScraper
🧹 Python web scraper for the Coronavirus.data.gov.uk dashboard
Language: Python - Size: 38.1 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

manbau10/Data_wrangling
This report details the steps employed for wrangling the data used in the "WeRateDogs" project. The three steps in the wrangling phase of data analysis – gathering, assessing, and cleaning – were strictly followed.
Language: HTML - Size: 1.84 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

TomiJames/Wrangle-and-Analyze-Data
This repo contains files that show the different steps in data wrangling - gathering, assessment and cleaning. The use case is to wrangle data for WeRateDogs twitter account.
Language: HTML - Size: 1.72 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

panchis7u7/EMG_Data_Parser
GUI Client for recolecting EGM sensor data from a raspberry PI data emitter through serial comunication and parsing it to an available for format.
Language: C++ - Size: 39.1 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Shamoo100/Web-Automation-For-survey-responses
A project on Product Market Research
Language: Jupyter Notebook - Size: 8.71 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

yehia55/We-rate-dogs-data-wrangling
It was the project of the udacity data analysis nano degree, my job was to gather data from Twitter API programmatically with python then assess it and discover its quality and tidiness issues then clean these issues with pandas and finally, I did some analysis and visualization to this data to make insights of it
Language: Jupyter Notebook - Size: 1.62 MB - Last synced at: 11 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Hazem0h/Twitter-Data-Wrangling-Udacity
This is a project in the Udacity Data professional Nanodegree, sponsored by fwd. In this project, I gathered data from different sources, assessed the data, and cleaned it. In the end, I created extracted some insights from that clean data, with visualizations.
Language: Jupyter Notebook - Size: 1.7 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

guptaharshnavin/WeRateDogs-Dog-Rating-Tweets-Analysis
Detailed Data Wrangling and Analysis of tweets of Dog Ratings from WeRateDogs.
Language: Jupyter Notebook - Size: 870 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

Yara-Aldajjani/Analyze_Online_Job_Postings
This repository has the Udacity data wrangling exercise, which helps us practice the wrangling process.
Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

abhi18av/acm-icpc-problems-aggregator
Language: Clojure - Size: 20.5 KB - Last synced at: 9 days ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

ParthThakur/Wrangle-WeRateDogs
Identifying and Cleaning issues found in the @WeRateDogs twitter archive.
Language: HTML - Size: 1.9 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

grctest/BOINC_Scripts
Language: Shell - Size: 18.6 KB - Last synced at: 2 months ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 3
