Topic: "modin"
modin-project/modin
Modin: Scale your Pandas workflows by changing a single line of code
Language: Python - Size: 51 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 10,263 - Forks: 666

aws/aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Language: Python - Size: 15.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4,057 - Forks: 715

jmcarpenter2/swifter
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Language: Python - Size: 2.15 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 2,611 - Forks: 104

ray-project/xgboost_ray
Distributed XGBoost on Ray
Language: Python - Size: 472 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 149 - Forks: 36

drshahizan/Python-big-data
Python and Pandas are known to have issues around scalability and efficiency. You will learn how to use libraries such as Modin, Dask, Ray, Vaex etc to overcome the problems faced by Pandas.
Language: Jupyter Notebook - Size: 107 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 126 - Forks: 67

intel/hdk š¦
A low-level execution library for analytic data processing.
Language: C++ - Size: 66.3 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 14

CangyuanLi/checkedframe
Lightweight, engine-agnostic dataframe validation
Language: Python - Size: 2.85 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 0

mzjp2/kedro-dataframe-dropin
A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)
Language: Python - Size: 516 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 12 - Forks: 1

unum-cloud/udsb
Unlimited Data-Science Benchmarks for Numeric, Tabular and Graph Workloads
Language: Jupyter Notebook - Size: 3.57 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

gandalf1819/NYCOpenData-Profiling-Analysis
Open Data Profiling, Quality and Analysis on NYC OpenData dataset with semantic profiling using fuzzy ratio, Levenshtein distance and regex
Language: Jupyter Notebook - Size: 17.9 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 4

PratikDavidson/intel-oneAPI-LLM
oneAPI Hackathon: The LLM Challenge
Language: Jupyter Notebook - Size: 648 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

SebastianMahecha/data_engineering
Generate data, load and download parquets to GCP and compare performance between Pandas, Polars and Modin with Python
Language: Python - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

oneapi-src/ai-structured-data-generation š¦
AI Starter Kit to generate structured synthetic data using IntelĀ® Distribution of Modin
Language: Python - Size: 692 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

comprakash/delta-transformation-pipeline
A transformation pipeline for Delta Lake using AWS SDK for Pandas
Language: Python - Size: 85 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

jacobceles/Movie-Recommendation-Rating-Prediction
Using the MovieLens dataset with Surprise to compare different algorithms for rating prediction, and also create a movie recommendation system on top of it.
Language: Jupyter Notebook - Size: 3.83 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

bhattbhavesh91/modin-example
Simple example on how Modin can peed up your Pandas workflows by changing a single line of code
Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 9 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

tdiprima/polars-vs-pandas
Polars does it better
Language: Jupyter Notebook - Size: 244 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

AndreBFarias/python-etl-mailing-automation
Um pipeline de ETL modular para automação de mailing com deduplicação, enriquecimento e otimização de performance.
Language: Python - Size: 136 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

AAWorks/options-pricing
Global Markets Options Pricing
Language: Python - Size: 138 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

prehensilecode/sge_accounting_stats
Simple stats on SGE accounting data
Language: Python - Size: 63.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ivanbgd/bioinf_demo
A Bioinformatics demo in Python working with FASTQ files and using the Modin library
Language: Python - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hariprasath-v/Intel_oneAPI_Hackerearth_Predict-the-quality-of-freshwater
Build a machine model to predict whether the freshwater is safe to drink or not.Based on the measures like pH, TDS, etc.
Language: HTML - Size: 5.83 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

adrianmarino/recommendation-system-approaches
Recommendation system approaches
Language: Jupyter Notebook - Size: 34.1 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0
