An open API service providing repository metadata for many open source software ecosystems.

GitHub / data-han 1 Repository

Non-tech background but passionate about learning new skills & technologies: Spark, Big Data, Cloud, Airflow...

data-han/great_expectations Fork of great-expectations/great_expectations

Always know what to expect from your data.

Language: Python - Size: 195 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

data-han/twitter_kafka_sentiment

Ingesting real-time Twitter API using tweepy into Kafka and process using Apache Spark Structured Streaming with Sentiment Analysis TextBlob before loading into time-series database, InfluxDB and monitoring dashboard, Grafana

Language: Python - Size: 1.89 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 2

data-han/datahub Fork of datahub-project/datahub

The Metadata Platform for the Modern Data Stack

Size: 1.06 GB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

data-han/dataeng_test Fork of jaabberwocky/dataeng_test

Data Engineering Test

Size: 268 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0