GitHub / RyanQuey / java-podcast-processor
A tool to find podcast metadata over an external api, store them, get their rss feeds and run ETL using Airflow, Kafka, Spark, and Cassandra. The particular Cassandra distribution used is Elassandra, which allows seamless integration with Elasticsearch. Displayed using a Gatsby app, served using Flask
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/RyanQuey%2Fjava-podcast-processor
Stars: 4
Forks: 0
Open issues: 4
License: None
Language: Java
Size: 1.94 MB
Dependencies parsed at: Pending
Created at: about 5 years ago
Updated at: over 3 years ago
Pushed at: over 2 years ago
Last synced at: about 2 years ago
Topics: airflow, cassandra, docker, elassandra, elasticsearch, gatsby, kafka, react, scala, searchkit, spark, zeppelin