An open API service providing repository metadata for many open source software ecosystems.

Topic: "modin"

modin-project/modin

Modin: Scale your Pandas workflows by changing a single line of code

Language: Python - Size: 51 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 10,263 - Forks: 666

aws/aws-sdk-pandas

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

Language: Python - Size: 15.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4,057 - Forks: 715

jmcarpenter2/swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

Language: Python - Size: 2.15 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 2,611 - Forks: 104

ray-project/xgboost_ray

Distributed XGBoost on Ray

Language: Python - Size: 472 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 149 - Forks: 36

drshahizan/Python-big-data

Python and Pandas are known to have issues around scalability and efficiency. You will learn how to use libraries such as Modin, Dask, Ray, Vaex etc to overcome the problems faced by Pandas.

Language: Jupyter Notebook - Size: 107 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 126 - Forks: 67

intel/hdk šŸ“¦

A low-level execution library for analytic data processing.

Language: C++ - Size: 66.3 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 14

CangyuanLi/checkedframe

Lightweight, engine-agnostic dataframe validation

Language: Python - Size: 2.85 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 0

mzjp2/kedro-dataframe-dropin

A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)

Language: Python - Size: 516 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 12 - Forks: 1

unum-cloud/udsb

Unlimited Data-Science Benchmarks for Numeric, Tabular and Graph Workloads

Language: Jupyter Notebook - Size: 3.57 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

gandalf1819/NYCOpenData-Profiling-Analysis

Open Data Profiling, Quality and Analysis on NYC OpenData dataset with semantic profiling using fuzzy ratio, Levenshtein distance and regex

Language: Jupyter Notebook - Size: 17.9 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 4

PratikDavidson/intel-oneAPI-LLM

oneAPI Hackathon: The LLM Challenge

Language: Jupyter Notebook - Size: 648 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

SebastianMahecha/data_engineering

Generate data, load and download parquets to GCP and compare performance between Pandas, Polars and Modin with Python

Language: Python - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

oneapi-src/ai-structured-data-generation šŸ“¦

AI Starter Kit to generate structured synthetic data using IntelĀ® Distribution of Modin

Language: Python - Size: 692 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

comprakash/delta-transformation-pipeline

A transformation pipeline for Delta Lake using AWS SDK for Pandas

Language: Python - Size: 85 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

jacobceles/Movie-Recommendation-Rating-Prediction

Using the MovieLens dataset with Surprise to compare different algorithms for rating prediction, and also create a movie recommendation system on top of it.

Language: Jupyter Notebook - Size: 3.83 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

bhattbhavesh91/modin-example

Simple example on how Modin can peed up your Pandas workflows by changing a single line of code

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 9 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

tdiprima/polars-vs-pandas

Polars does it better

Language: Jupyter Notebook - Size: 244 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

AndreBFarias/python-etl-mailing-automation

Um pipeline de ETL modular para automação de mailing com deduplicação, enriquecimento e otimização de performance.

Language: Python - Size: 136 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

AAWorks/options-pricing

Global Markets Options Pricing

Language: Python - Size: 138 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

prehensilecode/sge_accounting_stats

Simple stats on SGE accounting data

Language: Python - Size: 63.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ivanbgd/bioinf_demo

A Bioinformatics demo in Python working with FASTQ files and using the Modin library

Language: Python - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hariprasath-v/Intel_oneAPI_Hackerearth_Predict-the-quality-of-freshwater

Build a machine model to predict whether the freshwater is safe to drink or not.Based on the measures like pH, TDS, etc.

Language: HTML - Size: 5.83 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

adrianmarino/recommendation-system-approaches

Recommendation system approaches

Language: Jupyter Notebook - Size: 34.1 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0