An open API service providing repository metadata for many open source software ecosystems.

GitHub / RyanQuey / java-podcast-processor

A tool to find podcast metadata over an external api, store them, get their rss feeds and run ETL using Airflow, Kafka, Spark, and Cassandra. The particular Cassandra distribution used is Elassandra, which allows seamless integration with Elasticsearch. Displayed using a Gatsby app, served using Flask

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/RyanQuey%2Fjava-podcast-processor

Stars: 4
Forks: 0
Open issues: 4

License: None
Language: Java
Size: 1.94 MB
Dependencies parsed at: Pending

Created at: about 5 years ago
Updated at: over 3 years ago
Pushed at: over 2 years ago
Last synced at: about 2 years ago

Topics: airflow, cassandra, docker, elassandra, elasticsearch, gatsby, kafka, react, scala, searchkit, spark, zeppelin

    Loading...