Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

Package Usage: maven: org.apache.spark:spark-sql_2.12

The Apache Software Foundation provides support for the Apache community of open-source software projects. The Apache projects are characterized by a collaborative, consensus based development process, an open and pragmatic software license, and a desire to create high quality software that leads the way in its field. We consider ourselves not simply a group of projects sharing a server, but rather a community of developers and users.
33 versions
Latest release: over 54 years ago
895 dependent packages

View more package details: https://packages.ecosyste.ms/registries/repo1.maven.org/packages/org.apache.spark:spark-sql_2.12

View more repository details: https://repos.ecosyste.ms/hosts/GitHub/repositories/apache%2Fspark

Dependent Repos 2,673

RolandMa1986/bigdata 📦
  • 2.4.6 pom.xml

Size: 46.9 KB - Last synced: about 1 year ago - Pushed: about 1 year ago

ywcb00/systemds Fork of apache/systemds
Mirror of Apache SystemML
  • 3.2.0 pom.xml

Size: 296 MB - Last synced: 10 months ago - Pushed: 10 months ago

anirudhachal-db/mlflow Fork of mlflow/mlflow
Open source platform for the machine learning lifecycle
  • 3.0.0-preview mlflow/java/spark/pom.xml

Size: 72.7 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

stage-tech/basestar Fork of basestar/basestar
Basestar
  • 3.0.1 basestar-bom/pom.xml
  • basestar-spark/pom.xml
  • basestar-spark-aws/pom.xml
  • basestar-spark-elasticsearch/pom.xml

Size: 8.98 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

logicalclocks/hudi Fork of apache/hudi
Upserts And Incremental Processing on Big Data
  • ${spark3.version} hudi-spark-datasource/hudi-spark3/pom.xml

Size: 59.4 MB - Last synced: 27 days ago - Pushed: 27 days ago

yahoojapan/yosegi-spark
  • 3.2.1 pom.xml

Size: 144 KB - Last synced: 6 days ago - Pushed: about 1 year ago

corepointer/systemds Fork of apache/systemds
Mirror of Apache SystemDS
  • 3.2.0 pom.xml

Size: 296 MB - Last synced: 3 months ago - Pushed: 3 months ago

tdermendjiev/dirigible Fork of eclipse/dirigible
Eclipse Dirigible™ Project
  • 3.1.1 ext/ext-api/api-spark/pom.xml

Size: 140 MB - Last synced: about 1 year ago - Pushed: about 1 year ago

andygrove/spark-examples
spark-examples
  • 3.0.1-SNAPSHOT casting-datatypes/pom.xml
  • 3.2.1 parquet-to-json/pom.xml

Size: 30.3 KB - Last synced: 10 months ago - Pushed: 10 months ago

fathollahzadeh/systemds Fork of apache/systemds
Apache SystemDS - A versatile system for the end-to-end data science lifecycle
  • 3.2.0 scripts/staging/SIMD-double-vectors/pom.xml
  • 3.2.0 pom.xml

Size: 311 MB - Last synced: 2 months ago - Pushed: 2 months ago

oap-project/raydp
RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.
  • 3.2.0 core/pom.xml

Size: 1.14 MB - Last synced: 21 days ago - Pushed: 21 days ago

LiliyaLazarova/dirigible Fork of eclipse/dirigible
Eclipse Dirigible™ Project
  • 3.1.1 ext/ext-api/api-spark/pom.xml

Size: 133 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

andreoss/etoile
ETL on Apache Spark
  • 3.0.1 pom.xml

Size: 528 KB - Last synced: about 1 year ago - Pushed: about 1 year ago

priyen/pinot Fork of apache/pinot
Apache Pinot (Incubating) - A realtime distributed OLAP datastore
  • 2.4.7 pinot-plugins/pinot-batch-ingestion/v0_deprecated/pinot-spark/pom.xml

Size: 270 MB - Last synced: 10 months ago - Pushed: 10 months ago

feemstr/TensorFlowOnSpark Fork of yahoo/TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
  • 3.0.1 pom.xml

Size: 8.8 MB - Last synced: 9 months ago - Pushed: 9 months ago

slachiewicz/hudi Fork of apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
  • ${spark3.version} hudi-spark-datasource/hudi-spark3-common/pom.xml
  • ${spark31.version} hudi-spark-datasource/hudi-spark3.1.x/pom.xml

Size: 240 MB - Last synced: about 1 month ago - Pushed: 9 months ago

mridang/hudi Fork of apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
  • ${spark3.version} hudi-spark-datasource/hudi-spark3/pom.xml

Size: 32.5 MB - Last synced: 25 days ago - Pushed: 8 months ago

cement/ysh-CodeRecord
日常代码记录
  • 2.4.3 pom.xml

Size: 62.5 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

yeqqmatlab/spark-learn
  • 2.4.0 pom.xml

Size: 494 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

s3nt3/tispark Fork of pingcap/tispark
TiSpark is built for running Apache Spark on top of TiDB/TiKV
  • 3.0.2 spark-wrapper/spark-3.0/pom.xml
  • 3.1.1 spark-wrapper/spark-3.1/pom.xml

Size: 10.9 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

kingrocy/javaer
java study project
  • 2.4.3 spark/pom.xml

Size: 709 KB - Last synced: 3 months ago - Pushed: 3 months ago

XJJ-YWJ/LogRealTime
基于spark的大数据日志实时分析项目
  • 2.4.4 pom.xml

Size: 31.1 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

flow-lix/platform
  • 2.4.4 pom.xml

Size: 339 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

lee-qinghua/Official-FlinkStudy
学习flink
  • 3.0.0 spark-review/pom.xml

Size: 7.25 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

alonsoir/TestWithFrankKane
The material course from Frank Kane course from Udemy.
  • 3.0.1 pom.xml

Size: 9.81 MB - Last synced: 29 days ago - Pushed: over 1 year ago

SAtanasovv/dirigible Fork of eclipse/dirigible
Eclipse Dirigible™ Project
  • 3.1.1 ext/ext-api/api-spark/pom.xml

Size: 145 MB - Last synced: about 1 year ago - Pushed: about 1 year ago

chethanuk/deequ Fork of awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
  • 3.2.1 pom.xml

Size: 69.2 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

rasmus93/pinot Fork of apache/pinot
Apache Pinot - A realtime distributed OLAP datastore
  • 2.4.7 pinot-plugins/pinot-batch-ingestion/v0_deprecated/pinot-spark/pom.xml

Size: 234 MB - Last synced: about 1 year ago - Pushed: about 1 year ago

nizarhejazi/pinot Fork of apache/pinot
Apache Pinot - A realtime distributed OLAP datastore
  • 2.4.7 pinot-plugins/pinot-batch-ingestion/v0_deprecated/pinot-spark/pom.xml

Size: 289 MB - Last synced: 4 months ago - Pushed: 4 months ago

cassiuscai/postfix-templates
  • 2.4.1 pom.xml

Size: 125 KB - Last synced: about 2 months ago - Pushed: about 2 months ago

fybrik/mover
Fybrik platform - Data Mover
  • 3.2.1 pom.xml

Size: 247 KB - Last synced: 8 days ago - Pushed: about 1 year ago

allwefantasy/mlsql-plugins
  • 3.1.1 pom.xml

Size: 251 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

vmutafov/dirigible Fork of eclipse/dirigible
Eclipse Dirigible™ Project
  • 3.1.1 ext/ext-api/api-spark/pom.xml

Size: 160 MB - Last synced: 17 days ago - Pushed: 6 months ago

vpostrigan/study
  • 3.1.3 Java11/spark3_in_action_2021/pom.xml

Size: 355 MB - Last synced: 12 days ago - Pushed: 12 days ago

DepInjoy/geektime
  • 3.1.2 BigDataTraining/scala-project/pom.xml

Size: 20.1 MB - Last synced: 8 days ago - Pushed: 9 days ago

yhyyz/emr-serverless-example
emr-serverless-example
  • 3.2.0 pom.xml

Size: 4.88 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

huangxiaopingRD/celeborn Fork of apache/celeborn
Celeborn provides an elastic and high-performance service for shuffle and spilled data.
  • 3.3.1 pom.xml

Size: 25.8 MB - Last synced: 2 days ago - Pushed: 2 days ago

ThulasitharanGT/SparkLearning
Learning JDBC, And big data concepts
  • 3.0.0 build.gradle

Size: 60.4 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

RedisLabs/spark-redis
A connector for Spark that allows reading and writing to/from Redis cluster
  • 3.2.1 pom.xml

Size: 2.18 MB - Last synced: 29 days ago - Pushed: 5 months ago

paashzj/java-dependency
  • 3.3.0 pom.xml

Size: 347 KB - Last synced: about 1 year ago - Pushed: about 1 year ago

oap-project/oap-mllib
Optimized Spark package to accelerate machine learning algorithms in Apache Spark MLlib.
  • 3.2.0 examples/als/pom.xml
  • 3.2.0 examples/correlation/pom.xml
  • 3.2.0 examples/kmeans/pom.xml
  • 3.2.0 examples/linear-regression/pom.xml
  • 3.2.0 examples/naive-bayes/pom.xml
  • 3.2.0 examples/pca/pom.xml
  • 3.2.0 examples/summarizer/pom.xml
  • 3.2.0 mllib-dal/pom.xml
  • 3.2.0 mllib-dal/pom.xml

Size: 31.9 MB - Last synced: about 2 months ago - Pushed: about 2 months ago

lethetann/iotdb Fork of apache/iotdb
Apache IoTDB
  • ${spark.version} spark-iotdb-connector/pom.xml

Size: 70.6 MB - Last synced: about 1 year ago - Pushed: about 1 year ago

tikv/migration
Migration tools for TiKV, e.g. online bulk load.
  • 3.0.2 pom.xml

Size: 8.34 MB - Last synced: 2 days ago - Pushed: 2 days ago

sinrimin/rocketmq-connect Fork of apache/rocketmq-connect
A tool for scalable and reliably streaming data between Apache RocketMQ and other systems.
  • 3.1.3 connectors/rocketmq-connect-deltalake/pom.xml

Size: 4.51 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

deepjavalibrary/djl-demo
Demo applications showcasing DJL
  • 3.0.1 aws/emr-distributed-inference/image-classification-gpu/build.gradle

Size: 15.4 MB - Last synced: 1 day ago - Pushed: 1 day ago

guochao521/SparkSQL
Add Test
  • 3.2.0 pom.xml

Size: 203 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

bys-eric-he/com-hadoop-bigdata-demo
源码主要用于学习:1. Spring Boot+Hadoop+Hive+Hbase实现数据基本操作,Hive数据源使用Alibaba DruidDataSource,以及JDBCTemplate操作数据, Hbase使用hbase-client实现数据操作, API可视化界面集成Swagger-UI 2.9.2。2.引入Azkaban离线任务调度,实现Hive数据分层ETL过程,并结合Sqoop实现数据从Hive同步到MySQL操作。3. 引入Kafka消息服务,实现前端日志收集,将消息接收到的数据包持久化到Hive ODS原始数据层。4. 通过SpringBoot API方式提供可视化数据访问服务。
  • 2.4.5 com-hadoop-spark-demo/pom.xml

Size: 603 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago

sq-q/hudi-spark-utilities-plus
hudi-spark-utilities-plus
  • boxer-excel/pom.xml
  • 3.1.1 pom.xml

Size: 850 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

xicm/hudi Fork of apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
  • ${spark3.version} hudi-spark-datasource/hudi-spark3-common/pom.xml
  • ${spark31.version} hudi-spark-datasource/hudi-spark3.1.x/pom.xml
  • ${spark32.version} hudi-spark-datasource/hudi-spark3.2.x/pom.xml
  • ${spark3.version} hudi-spark-datasource/hudi-spark3.2plus-common/pom.xml
  • ${spark33.version} hudi-spark-datasource/hudi-spark3.3.x/pom.xml

Size: 379 MB - Last synced: 8 days ago - Pushed: 9 days ago

mounish3loq/Grafana-Java-API
  • 2.4.5 pom.xml

Size: 433 KB - Last synced: about 1 year ago - Pushed: over 1 year ago

heedojung92/mvc_newswebproject
  • 3.0.0-preview pom.xml

Size: 6.97 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

pipeline-foundation/spark Fork of dotnet/spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
  • 3.0.0 src/scala/microsoft-spark-3-0/pom.xml
  • 3.1.1 src/scala/microsoft-spark-3-1/pom.xml
  • 3.2.0 src/scala/microsoft-spark-3-2/pom.xml

Size: 2.99 MB - Last synced: about 1 year ago - Pushed: about 1 year ago

alessioVnt/SABD_Spark_Project
  • 2.4.1 pom.xml

Size: 7.45 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago

dominik-lenda/docker-kafka-spark-stream
  • 3.4.1 spark_app/pom.xml

Size: 70.4 MB - Last synced: 6 months ago - Pushed: 6 months ago

apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
  • fe/fe-core/pom.xml
  • 2.4.6 fe/pom.xml
  • fe/spark-dpp/pom.xml

Size: 696 MB - Last synced: 20 days ago - Pushed: 20 days ago

apoorvanand/spark-sample
  • 3.0.0 pom.xml

Size: 15.6 KB - Last synced: 15 days ago - Pushed: over 1 year ago

mlflow/mlflow
Open source platform for the machine learning lifecycle
  • 3.0.0-preview mlflow/java/spark/pom.xml

Size: 405 MB - Last synced: about 2 hours ago - Pushed: about 2 hours ago

databrickslabs/geoscan
Geospatial clustering at massive scale
  • 3.2.1 pom.xml

Size: 2.44 MB - Last synced: 3 months ago - Pushed: 12 months ago

GoogleCloudDataproc/hive-bigquery-connector
A library enabling BigQuery as Hive storage handler
  • ${spark.version} shaded-sparksql/pom.xml

Size: 781 KB - Last synced: 20 days ago - Pushed: 20 days ago

binaim/cs523-finalproject-main
  • 3.3.2 pom.xml

Size: 31.7 MB - Last synced: 3 months ago - Pushed: 7 months ago

kangkaisen/starrocks Fork of StarRocks/starrocks
StarRocks is a next-gen sub-second MPP database for full analysis senarios, including multi-dimensional analytics, real-time analytics and ad-hoc query, formerly known as DorisDB.
  • fe/fe-core/pom.xml
  • 2.4.6 fe/pom.xml
  • fe/spark-dpp/pom.xml

Size: 245 MB - Last synced: 3 months ago - Pushed: 3 months ago

shirly121/GraphScope Fork of alibaba/GraphScope
GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba
  • analytical_engine/java/grape-graphx/pom.xml
  • 3.1.3 analytical_engine/java/pom.xml
  • 3.1.1 interactive_engine/pom.xml

Size: 117 MB - Last synced: about 1 month ago - Pushed: about 1 month ago

314649558/bigdate_learn
大数据学习案例
  • 3.0.1 bigdata_parent/bigdata_spark/pom.xml

Size: 1.58 MB - Last synced: about 1 year ago - Pushed: over 1 year ago

tieredblocks/iotdb Fork of apache/iotdb
Apache IoTDB
  • ${spark.version} spark-iotdb-connector/pom.xml

Size: 65.4 MB - Last synced: about 1 year ago - Pushed: about 1 year ago

sergiod31/repositorioSudoku
  • 3.0.1 pom.xml

Size: 52.7 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago

Vanivannan/Spark-practice
learn spark code
  • 3.0.1 pom.xml

Size: 13.2 MB - Last synced: 9 months ago - Pushed: 9 months ago

NVIDIA/spark-rapids-examples
A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
  • 3.1.2 examples/ML+DL-Examples/Spark-cuML/pca/pom.xml
  • 3.2.0 examples/UDF-Examples/Spark-cuSpatial/pom.xml
  • 3.1.1 examples/XGBoost-Examples/pom.xml

Size: 9.84 MB - Last synced: 15 days ago - Pushed: 20 days ago

cobbleacademy/hdpsmoke
HDP Smoke Test utils
  • 3.1.0 delake/pom.xml

Size: 51.2 MB - Last synced: about 1 month ago - Pushed: about 1 month ago

graalsystems/examples
  • spark-examples/pom.xml
  • 3.1.1 spark-examples/pom.xml

Size: 2.43 MB - Last synced: 8 days ago - Pushed: 3 months ago