Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: dask
dask/old-dask-yarn 📦
Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
Language: Python - Size: 32.2 KB - Last synced: about 4 hours ago - Pushed: almost 7 years ago - Stars: 7 - Forks: 2
ibis-project/ibis
the portable Python dataframe library
Language: Python - Size: 71.8 MB - Last synced: about 3 hours ago - Pushed: about 5 hours ago - Stars: 4,237 - Forks: 525
bioio-devs/bioio-ome-zarr
A BioIO reader plugin for reading Zarr files in the OME format.
Language: Python - Size: 68.4 KB - Last synced: about 4 hours ago - Pushed: 2 days ago - Stars: 0 - Forks: 1
bioio-devs/bioio
Image reading, metadata management, and image writing for Microscopy images in Python
Language: Python - Size: 5.26 MB - Last synced: about 4 hours ago - Pushed: 2 days ago - Stars: 16 - Forks: 1
Vizzuality/cog_worker
Scalable arbitrary analysis on COGs
Language: Jupyter Notebook - Size: 33.1 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 26 - Forks: 1
xarray-contrib/flox
Fast & furious GroupBy operations for dask.array
Language: Python - Size: 1.65 MB - Last synced: about 10 hours ago - Pushed: about 10 hours ago - Stars: 116 - Forks: 15
pytroll/satpy
Python package for earth-observing satellite data processing
Language: Python - Size: 20.8 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 1,009 - Forks: 283
nebari-dev/nebari-docs
📖 Documentation for Nebari
Size: 42.9 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 14 - Forks: 21
chrimerss/RainfallCamera
Rainfall Camera
Language: Python - Size: 113 MB - Last synced: about 24 hours ago - Pushed: over 4 years ago - Stars: 4 - Forks: 1
donkomura/fsspec-chfs
fsspec implementations for CHFS
Language: Python - Size: 19.5 KB - Last synced: 1 day ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
jgrss/geowombat
GeoWombat: Utilities for geospatial data
Language: Jupyter Notebook - Size: 240 MB - Last synced: about 8 hours ago - Pushed: 1 day ago - Stars: 173 - Forks: 10
discovery-unicamp/dasf-core
Framework for computing Machine Learning algorithms in Python using Dask and RAPIDS AI.
Language: Python - Size: 16.7 MB - Last synced: about 13 hours ago - Pushed: 1 day ago - Stars: 9 - Forks: 1
capitalone/datacompy
Pandas and Spark DataFrame comparison for humans and more!
Language: Python - Size: 9.11 MB - Last synced: about 7 hours ago - Pushed: 1 day ago - Stars: 386 - Forks: 122
mohgavin/code-repository
Code Repository
Language: Jupyter Notebook - Size: 125 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 0 - Forks: 0
casangi/graphviper
Dask Based MapReduce for Multi Xarray Datasets.
Language: Python - Size: 2.24 MB - Last synced: about 4 hours ago - Pushed: 2 days ago - Stars: 1 - Forks: 0
basnijholt/adaptive-scheduler
Run many functions (adaptively) on many cores (>10k-100k) using mpi4py.futures, ipyparallel, loky, or dask-mpi. :tada:
Language: Python - Size: 931 KB - Last synced: about 5 hours ago - Pushed: 2 days ago - Stars: 26 - Forks: 9
nebari-dev/nebari
🪴 Nebari - your open source data science platform
Language: Python - Size: 15.1 MB - Last synced: about 7 hours ago - Pushed: about 13 hours ago - Stars: 257 - Forks: 86
fschuch/xcompact3d_toolbox
A set of tools for pre and postprocessing prepared for the high-order Navier-Stokes solver XCompact3d
Language: Python - Size: 17.8 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 8 - Forks: 5
geoxarray/geoxarray
Geolocation utilities for xarray
Language: Python - Size: 363 KB - Last synced: about 13 hours ago - Pushed: 3 days ago - Stars: 95 - Forks: 7
dask-contrib/dask-sql
Distributed SQL Engine in Python using Dask
Language: Python - Size: 3.33 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 366 - Forks: 70
umr-lops/xsarsea
scientific functions to compute radar or geophysical parameters from satellite images over ocean
Language: Python - Size: 1.67 MB - Last synced: 3 days ago - Pushed: about 1 month ago - Stars: 9 - Forks: 6
TimeEval/TimeEval
Evaluation Tool for Anomaly Detection Algorithms on Time Series
Language: Jupyter Notebook - Size: 24.8 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 73 - Forks: 13
ESDS-Leipzig/cubo
On-Demand Earth System Data Cubes (ESDCs) in Python
Language: Python - Size: 1.64 MB - Last synced: about 19 hours ago - Pushed: 9 days ago - Stars: 151 - Forks: 9
polyaxon/traceml
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
Language: Python - Size: 118 MB - Last synced: 17 days ago - Pushed: 21 days ago - Stars: 492 - Forks: 43
jmcarpenter2/swifter
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Language: Python - Size: 2.15 MB - Last synced: 3 days ago - Pushed: about 1 month ago - Stars: 2,466 - Forks: 101
Nixtla/mlforecast
Scalable machine 🤖 learning for time series forecasting.
Language: Python - Size: 27 MB - Last synced: 3 days ago - Pushed: 17 days ago - Stars: 720 - Forks: 68
raw-lab/mercat2
MerCat2: python code for versatile k-mer counting and diversity estimation for database independent property analysis for metaome data
Language: HTML - Size: 105 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 7 - Forks: 1
RichardScottOZ/xarray-notes
Notes on working with xarray
Size: 67.4 KB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0
DataCanvasIO/HyperGBM
A full pipeline AutoML tool for tabular data
Language: Python - Size: 11 MB - Last synced: 3 days ago - Pushed: 2 months ago - Stars: 323 - Forks: 45
NVIDIA-Merlin/models
Merlin Models is a collection of deep learning recommender system model reference implementations
Language: Python - Size: 113 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 241 - Forks: 48
itamarst/eliot
Eliot: the logging system that tells you *why* it happened
Language: Python - Size: 1.91 MB - Last synced: about 9 hours ago - Pushed: 2 months ago - Stars: 1,087 - Forks: 65
shauryashaurya/learn-data-munging
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
Language: Jupyter Notebook - Size: 582 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 41 - Forks: 21
aurelienmorgan/french_text_sentiment
Sentiment Analysis in texts written in French language using Tensorflow/Keras (and using XGBoost for hyperparameters optimization)
Language: Python - Size: 21.9 MB - Last synced: 6 days ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
elcorto/psweep
Loop like a pro, make parameter studies fun.
Language: Python - Size: 5.81 MB - Last synced: 7 days ago - Pushed: 8 days ago - Stars: 12 - Forks: 2
TDAmeritrade/stumpy
STUMPY is a powerful and scalable Python library for modern time series analysis
Language: Python - Size: 129 MB - Last synced: 7 days ago - Pushed: about 1 month ago - Stars: 2,986 - Forks: 282
umr-lops/xsar
Synthetic Aperture Radar (SAR) Level-1 GRD python mapper for efficient xarray/dask based processing
Language: Python - Size: 20 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 24 - Forks: 8
milesgranger/cluster-clyde
Python tool for launching EC2 clusters with AWS within your script/notebook
Language: Python - Size: 264 KB - Last synced: 8 days ago - Pushed: about 7 years ago - Stars: 0 - Forks: 1
miniufo/xgrads
Parse and read ctl and associated binary file commonly used by GrADS into xarray
Language: Jupyter Notebook - Size: 16.5 MB - Last synced: 1 day ago - Pushed: 8 months ago - Stars: 69 - Forks: 25
bioio-devs/bioio-base
Typing, base classes, and more for BioIO projects.
Language: Python - Size: 1.18 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 0 - Forks: 0
mapsacosta/htcdaskgateway
A Dask Gateway client extension for heterogeneous cluster mode combining the Kubernetes backend for pain-free scheduler networking, with COFFEA-powered HTCondor workers
Language: Python - Size: 67.4 KB - Last synced: about 2 hours ago - Pushed: about 19 hours ago - Stars: 2 - Forks: 2
nilsoncunha/nilsoncunha
Repositório para criação do readme personalizado e listagem dos meus projetos
Size: 29.3 KB - Last synced: 26 days ago - Pushed: 26 days ago - Stars: 0 - Forks: 0
dask-contrib/dask-awkward
Native Dask collection for awkward arrays, and the library to use it.
Language: Python - Size: 1.29 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 56 - Forks: 15
climate-service-center/index_calculator
Calculate climate indicators based on xclim
Language: Jupyter Notebook - Size: 113 MB - Last synced: 11 days ago - Pushed: about 2 months ago - Stars: 1 - Forks: 4
godaai/distributed-python
Python 分布式编程
Language: Jupyter Notebook - Size: 17.5 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 9 - Forks: 5
ranaroussi/pystore
Fast data store for Pandas time-series data
Language: Python - Size: 138 KB - Last synced: 10 days ago - Pushed: about 1 month ago - Stars: 539 - Forks: 97
Quansight/qhub-ops 📦
A tool for initialising and maintaining the state of QHub deployments on Digital Ocean, Amazon Web Services, and Google Cloud Platform
Language: Python - Size: 27.6 MB - Last synced: 12 days ago - Pushed: over 3 years ago - Stars: 8 - Forks: 4
Quansight/ibis-posts
Programs about the ibis sql productivity framework.
Language: Jupyter Notebook - Size: 25.7 MB - Last synced: 12 days ago - Pushed: about 4 years ago - Stars: 0 - Forks: 2
dask-contrib/dask-deltatable
A Delta Lake reader for Dask
Language: Python - Size: 249 KB - Last synced: 8 days ago - Pushed: 29 days ago - Stars: 42 - Forks: 13
fugue-project/fugue
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Language: Python - Size: 6.3 MB - Last synced: 18 days ago - Pushed: 19 days ago - Stars: 1,866 - Forks: 92
JulianWgs/dask-log-server
Preserve all necessary runtime data of a Dask client in order to "replay" and analyze the performance and behavior of the client after the fact
Language: Python - Size: 289 KB - Last synced: 12 days ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0
quantori/scip-dockingfactory
Docking Factory is a tool to automate molecular docking runs on an HPC cluster using the Dask framework. DockingFactory provides unified way of running molecular docking with different software backends: AutoDock Vina, Smina, Qvina2, and rDock.
Language: MATLAB - Size: 1.69 MB - Last synced: 13 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
casangi/astrohack
Antenna panel and position corrections.
Language: Python - Size: 77.4 MB - Last synced: about 4 hours ago - Pushed: about 19 hours ago - Stars: 9 - Forks: 3
mpes-kit/mpes
Distributed data processing routines for multidimensional photoemission spectroscopy (MPES)
Language: Python - Size: 27.5 MB - Last synced: 7 days ago - Pushed: over 1 year ago - Stars: 27 - Forks: 6
CoffeaTeam/coffea-casa
Repository with configuration setup of a prototype of analysis facility - "coffea-casa"
Language: Python - Size: 11.3 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 16 - Forks: 17
ratt-ru/dask-ms
Implementation of a dask/xarray dataset backed by a CASA MS
Language: Python - Size: 6.67 MB - Last synced: about 19 hours ago - Pushed: 16 days ago - Stars: 18 - Forks: 6
ORNL/flowcept Fork of renan-souza/flowcept
Runtime data integration system that empowers any data processing system to capture and query workflow provenance using data observability.
Language: Python - Size: 53.1 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 1 - Forks: 2
MITgcm/xmitgcm
Read MITgcm mds binary files into xarray
Language: Python - Size: 117 MB - Last synced: 2 days ago - Pushed: 3 months ago - Stars: 54 - Forks: 64
thewtex/ngff-zarr
A lean and kind Open Microscopy Environment (OME) Next Generation File Format (NGFF) Zarr implementation.
Language: Python - Size: 223 KB - Last synced: 13 days ago - Pushed: 14 days ago - Stars: 18 - Forks: 3
dask/dask
Parallel computing with task scheduling
Language: Python - Size: 66.7 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 11,971 - Forks: 1,665
beartell/anymlops
🐻❄️ Anymlops: A data science platform that literally works !
Language: HCL - Size: 13.2 MB - Last synced: 15 days ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
coiled/dask-snowflake
Dask integration for Snowflake
Language: Python - Size: 57.6 KB - Last synced: 13 days ago - Pushed: about 1 month ago - Stars: 28 - Forks: 7
dask/distributed
A distributed task scheduler for Dask
Language: Python - Size: 191 MB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 1,539 - Forks: 703
NCAR/ncar-python-tutorial 📦
Numerical & Scientific Computing with Python Tutorial
Language: Jupyter Notebook - Size: 49.4 MB - Last synced: 16 days ago - Pushed: about 4 years ago - Stars: 63 - Forks: 32
BrunoKreiner/cic-mc2
Testing SaturnCloud's GPU Cluster using Dask and PyTorch Parallelized training methods. Test model is a simple convolutional neural network.
Language: Jupyter Notebook - Size: 215 KB - Last synced: 17 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
saturncloud/workshop-dask-pytorch 📦
Introduction to Dask for PyTorch Workflows
Language: Jupyter Notebook - Size: 7.12 MB - Last synced: 17 days ago - Pushed: about 3 years ago - Stars: 13 - Forks: 1
saturncloud/workshop-lightgbm-dask
Saturn Cloud workshop on using LightGBM with Dask
Language: Jupyter Notebook - Size: 114 KB - Last synced: 17 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
pydata/xarray
N-D labeled arrays and datasets in Python
Language: Python - Size: 41.3 MB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 3,396 - Forks: 1,016
rapidsai/cudf
cuDF - GPU DataFrame Library
Language: C++ - Size: 134 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 7,236 - Forks: 829
Ouranosinc/xclim
Library of derived climate variables, ie climate indicators, based on xarray.
Language: Python - Size: 57.1 MB - Last synced: 18 days ago - Pushed: 20 days ago - Stars: 296 - Forks: 49
ratt-ru/codex-africanus
Radio Astronomy Algorithms Library
Language: Python - Size: 1.42 MB - Last synced: 23 days ago - Pushed: 23 days ago - Stars: 16 - Forks: 10
TGSAI/mdio-python
Cloud native, scalable storage engine for various types of energy data.
Language: Python - Size: 3.48 MB - Last synced: about 7 hours ago - Pushed: 1 day ago - Stars: 28 - Forks: 10
scipp/sciline
Build scientific pipelines for your data
Language: Python - Size: 2.16 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 7 - Forks: 1
xarray-contrib/xeofs
Comprehensive EOF analysis in Python with xarray: A versatile, multidimensional, and scalable tool for advanced climate data analysis
Language: Python - Size: 33.3 MB - Last synced: 19 days ago - Pushed: 20 days ago - Stars: 82 - Forks: 16
gjbex/Python-for-HPC
Repository for participants of the "Python for HPC" training
Language: Jupyter Notebook - Size: 6.63 MB - Last synced: 17 days ago - Pushed: 20 days ago - Stars: 30 - Forks: 18
carpentries-incubator/lesson-parallel-python
Parallel Programming in Python
Language: Python - Size: 5.12 MB - Last synced: 8 days ago - Pushed: about 1 year ago - Stars: 10 - Forks: 14
hi-primus/optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Language: Python - Size: 110 MB - Last synced: 17 days ago - Pushed: about 1 month ago - Stars: 1,441 - Forks: 233
NCAR/ncar-jobqueue
Utilities for configuring dask-jobqueue with appropriate settings for NCAR clusters
Language: Python - Size: 140 KB - Last synced: 16 days ago - Pushed: about 1 month ago - Stars: 12 - Forks: 2
aws-samples/amazon-sagemaker-local-mode
Amazon SageMaker Local Mode Examples
Language: Python - Size: 5.98 MB - Last synced: 21 days ago - Pushed: 21 days ago - Stars: 228 - Forks: 55
msoechting/lexcube
Lexcube: 3D Data Cube Visualization in Jupyter Notebooks
Language: TypeScript - Size: 4.6 MB - Last synced: 18 days ago - Pushed: about 1 month ago - Stars: 89 - Forks: 3
LDO-CERT/orochi
The Volatility Collaborative GUI
Language: JavaScript - Size: 35.7 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 189 - Forks: 17
treebeardtech/kubeflow-bootstrap
🪐 1-click Kubeflow using ArgoCD
Language: Shell - Size: 2.67 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 18 - Forks: 5
anovv/svoe
A scalable, declarative, low-code framework for real-time and batch feature calculation/management (quant finance, anomaly/fraud detection, etc.), predictive ML training/inference and simulation. Built on top of Ray
Language: Python - Size: 78.6 MB - Last synced: 15 days ago - Pushed: 3 months ago - Stars: 15 - Forks: 10
octoenergy/dask-remote 📦
Procurement: Dask Cluster as a Process.
Language: Python - Size: 389 KB - Last synced: 27 days ago - Pushed: almost 2 years ago - Stars: 5 - Forks: 0
aertslab/arboreto
A scalable python-based framework for gene regulatory network inference using tree-based ensemble regressors.
Language: Jupyter Notebook - Size: 63.9 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 45 - Forks: 24
bioio-devs/bioio-czi
A BioIO reader plugin for reading CZI files.
Language: Python - Size: 305 KB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
pangeo-data/climpred
:earth_americas: Verification of weather and climate forecasts :earth_africa:
Language: Python - Size: 58.1 MB - Last synced: 18 days ago - Pushed: 30 days ago - Stars: 217 - Forks: 48
pytroll/pyresample
Geospatial image resampling in Python
Language: Python - Size: 16.4 MB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 324 - Forks: 94
facultyai/lens
Summarise and explore Pandas DataFrames
Language: Python - Size: 229 KB - Last synced: about 1 month ago - Pushed: almost 4 years ago - Stars: 102 - Forks: 9
dymaxionlabs/dask-rasterio
Read and write rasters in parallel using Rasterio and Dask
Language: Python - Size: 813 KB - Last synced: 8 days ago - Pushed: over 3 years ago - Stars: 94 - Forks: 8
dask/dask-jobqueue
Deploy Dask on job schedulers like PBS, SLURM, and SGE
Language: Python - Size: 667 KB - Last synced: about 4 hours ago - Pushed: about 1 month ago - Stars: 230 - Forks: 137
CNES/zcollection
Python library allowing to manipulate data splited into a collection of groups stored in Zarr format.
Language: Python - Size: 640 KB - Last synced: 21 days ago - Pushed: about 2 months ago - Stars: 12 - Forks: 3
IsaacCheng9/machine-learning-in-chess
A final year project for the University of Exeter, using machine learning to study patterns in millions of chess games (~350 GB). Ranked 1st in the cohort for undergraduate projects (85%).
Language: Jupyter Notebook - Size: 1.27 GB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1 - Forks: 1
Becksteinlab/Parallel-analysis-in-the-MDAnalysis-Library
Benchmarking MDAnalysis with Dask (and MPI). Supplementary Information for SciPy 2017 paper.
Language: Python - Size: 78.1 KB - Last synced: about 1 month ago - Pushed: over 6 years ago - Stars: 3 - Forks: 4
timkpaine/paperboy
A web frontend for scheduling Jupyter notebook reports
Language: Python - Size: 12.5 MB - Last synced: 19 days ago - Pushed: about 2 years ago - Stars: 248 - Forks: 26
splunk/deep-learning-toolkit
Deep Learning Toolkit for Splunk
Language: Python - Size: 15.4 MB - Last synced: 18 days ago - Pushed: about 1 month ago - Stars: 15 - Forks: 5
mariusvniekerk/dask-hivemetastore
Language: Python - Size: 122 KB - Last synced: about 1 month ago - Pushed: over 6 years ago - Stars: 1 - Forks: 1
ivanbgd/dask_demo_reins
A Dask library for Big Data processing in Python demo
Language: Python - Size: 11.7 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
NucciTheBoss/DS330_final_project 📦
Visualizing some Spotify data!
Language: Jupyter Notebook - Size: 28.3 KB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0
AllenCellModeling/aicsimageio
Image Reading, Metadata Conversion, and Image Writing for Microscopy Images in Python
Language: Python - Size: 173 MB - Last synced: 23 days ago - Pushed: about 2 months ago - Stars: 190 - Forks: 49
vbprojects/ukraine_war_sentiment
Measuring how events shape discourse on twitter surrounding the Ukraine War in 2022 using piecewise exponential decay models.
Language: Jupyter Notebook - Size: 230 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0