GitHub topics: spark-datasource
StabRise/spark-pdf
PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it
Language: Scala - Size: 5.72 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 49 - Forks: 3

rejeb/netcdf-spark-parser
Scala/Spark Netcdf for reading Netcdf files
Language: Scala - Size: 88.9 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

spark-root/laurelin
Allows reading ROOT TTrees into Apache Spark as DataFrames
Language: Java - Size: 934 KB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 4

miraisolutions/spark-bigquery
Google BigQuery data source for Apache Spark
Language: Scala - Size: 188 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 18 - Forks: 6
