Topic: "data-processing"
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Language: Python - Size: 132 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 24,595 - Forks: 363

onceupon/Bash-Oneliner
A collection of handy Bash One-Liners and terminal tricks for data processing and Linux system maintenance.
Size: 919 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 10,423 - Forks: 628

johnkerl/miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Language: Go - Size: 201 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 9,279 - Forks: 222

TomWright/dasel
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
Language: Go - Size: 8.56 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 7,448 - Forks: 146

NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Language: C++ - Size: 394 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 5,383 - Forks: 634

deepseek-ai/smallpond
A lightweight data processing framework built on DuckDB and 3FS.
Language: Python - Size: 1.77 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 4,605 - Forks: 409

unionai-oss/pandera
A light-weight, flexible, and expressive statistical data testing library
Language: Python - Size: 4.09 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 3,784 - Forks: 333

dashbitco/broadway
Concurrent and multi-stage data ingestion and data processing with Elixir
Language: Elixir - Size: 718 KB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 2,513 - Forks: 166

asyml/texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Language: Python - Size: 13.6 MB - Last synced at: 28 days ago - Pushed at: over 3 years ago - Stars: 2,388 - Forks: 372

microsoft/DialoGPT
Large-scale pretraining for dialogue
Language: Python - Size: 43.6 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 2,383 - Forks: 347

numaproj/numaflow
Kubernetes-native platform to run massively parallel data/streaming jobs
Language: Go - Size: 38.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,864 - Forks: 131

bytewax/bytewax
Python Stream Processing
Language: Python - Size: 12 MB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 1,726 - Forks: 79

python-bonobo/bonobo
Extract Transform Load for Python 3.5+
Language: Python - Size: 1.46 MB - Last synced at: 29 days ago - Pushed at: almost 2 years ago - Stars: 1,590 - Forks: 145

GoogleCloudPlatform/data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Language: Jupyter Notebook - Size: 6.51 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 1,383 - Forks: 721

allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Language: Python - Size: 62.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,190 - Forks: 133

pyper-dev/pyper
Concurrent Python made simple
Language: Python - Size: 462 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1,185 - Forks: 24

cocoindex-io/cocoindex
ETL framework to turn your data AI-ready - with realtime incremental updates and support custom logic like lego.
Language: Rust - Size: 6.88 MB - Last synced at: about 24 hours ago - Pushed at: about 24 hours ago - Stars: 1,088 - Forks: 66

NVIDIA/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
Language: Jupyter Notebook - Size: 7.73 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 900 - Forks: 125

microsoft/GODEL
Large-scale pretrained models for goal-directed dialog
Language: Python - Size: 49.8 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 869 - Forks: 112

GoogleCloudPlatform/DataflowJavaSDK 📦
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Size: 12.9 MB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 857 - Forks: 320

asyml/texar-pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Language: Python - Size: 3.08 MB - Last synced at: 1 day ago - Pushed at: about 3 years ago - Stars: 745 - Forks: 115

hstreamdb/hstream
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
Language: Haskell - Size: 6.28 MB - Last synced at: 29 days ago - Pushed at: 5 months ago - Stars: 722 - Forks: 55

benibela/xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Language: Pascal - Size: 2.09 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 720 - Forks: 42

jofpin/synthBTC
A tool that uses advanced Monte Carlo simulations and Turbit parallel processing to create possible Bitcoin prediction scenarios.
Language: JavaScript - Size: 6.46 MB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 684 - Forks: 414

SebKrantz/collapse
Advanced and Fast Data Transformation in R
Language: C - Size: 106 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 679 - Forks: 35

ChenghaoMou/text-dedup
All-in-one text de-duplication
Language: Python - Size: 5.87 MB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 674 - Forks: 75

infoslack/awesome-kafka
A list about Apache Kafka
Size: 96.7 KB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 581 - Forks: 163

kousun12/eternal
👾~ music, eternal ~ 👾
Language: JavaScript - Size: 91.3 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 511 - Forks: 31

Puchaczov/Musoq
SQL Syntax without any database
Language: C# - Size: 15.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 480 - Forks: 21

constellation-rs/amadeus
Harmonious distributed data analysis in Rust.
Language: Rust - Size: 2.46 MB - Last synced at: 4 days ago - Pushed at: almost 4 years ago - Stars: 479 - Forks: 25

polyaxon/haupt
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
Language: Python - Size: 1.14 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 453 - Forks: 209

maykulkarni/Machine-Learning-Notebooks
Machine Learning notebooks for refreshing concepts.
Language: Jupyter Notebook - Size: 13.2 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 420 - Forks: 218

msamogh/nonechucks
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Language: Python - Size: 25.4 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 377 - Forks: 27

flow-php/etl
PHP - ETL (Extract Transform Load) data processing library
Language: PHP - Size: 3.5 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 357 - Forks: 20

ml6team/fondant
Production-ready data processing made easy and shareable
Language: Python - Size: 23 MB - Last synced at: 30 days ago - Pushed at: 12 months ago - Stars: 352 - Forks: 26

keithorange/PatternPy
📈 PatternPy: A Python package revolutionizing trading analysis with high-speed pattern recognition, leveraging Pandas & Numpy. Effortlessly spot Head & Shoulders, Tops & Bottoms, Supports & Resistances. For experts & beginners. #TradingMadeEasy 🔥
Language: Python - Size: 404 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 335 - Forks: 78

lithops-cloud/lithops
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Language: Python - Size: 12.9 MB - Last synced at: about 3 hours ago - Pushed at: 2 months ago - Stars: 330 - Forks: 113

matousc89/padasip
Python Adaptive Signal Processing
Language: Python - Size: 5.93 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 311 - Forks: 52

alttch/rapidtables
Super fast list of dicts to pre-formatted tables conversion library for Python 2/3
Language: Python - Size: 254 KB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 287 - Forks: 8

PytLab/VASPy
Manipulating VASP files with Python.
Language: Python - Size: 21.1 MB - Last synced at: 26 days ago - Pushed at: almost 3 years ago - Stars: 281 - Forks: 98

streamnative/pulsar-flink 📦
Elastic data processing with Apache Pulsar and Apache Flink
Language: Java - Size: 2.16 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 279 - Forks: 119

Yord/pxi
🧚 pxi (pixie) is a small, fast, and magical command-line data processor similar to jq, mlr, and awk.
Language: JavaScript - Size: 19.6 MB - Last synced at: 12 days ago - Pushed at: over 4 years ago - Stars: 269 - Forks: 3

svenkreiss/pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Language: Python - Size: 3.45 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 268 - Forks: 44

ColasGael/Machine-Learning-for-Solar-Energy-Prediction
Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning
Language: Python - Size: 922 MB - Last synced at: 2 days ago - Pushed at: over 5 years ago - Stars: 266 - Forks: 112

scramjetorg/scramjet
Public tracker for Scramjet Cloud Platform, a platform that bring data from many environments together.
Size: 2.71 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 253 - Forks: 20

asyml/forte
Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/
Language: Python - Size: 17.8 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 244 - Forks: 60

airscholar/e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Language: Python - Size: 289 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 240 - Forks: 117

helmholtz-analytics/heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
Language: Python - Size: 21 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 221 - Forks: 54

apache/incubator-wayang
Apache Wayang(incubating) is the first cross-platform data processing system.
Language: Java - Size: 18.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 220 - Forks: 97

mech-lang/mech
🦾 Mech is a programming language for building data-driven systems like robots, games, and interfaces. Start here!
Language: Rust - Size: 10.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 217 - Forks: 12

senbox-org/snap-engine
ESA Earth Observation Toolbox and Java Development Platform
Language: Java - Size: 911 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 193 - Forks: 102

hxz393/BrutalityExtractor
适用于高性能系统的多进程解压缩软件(A multiprocess decompression software for high-performance system)
Language: Python - Size: 4.91 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 181 - Forks: 12

LibreCat/Catmandu
Catmandu - a data processing toolkit
Language: Perl - Size: 53.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 176 - Forks: 31

markus-wa/cq
Clojure Query: A Command-line Data Processor for JSON, YAML, EDN, XML and more
Language: Clojure - Size: 202 KB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 174 - Forks: 11

remotesensinginfo/rsgislib
Remote Sensing and GIS Software Library; python module tools for processing spatial data.
Language: C++ - Size: 140 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 152 - Forks: 27

senbox-org/snap-desktop
Desktop GUI for SNAP based on NetBeans Platform
Language: Java - Size: 77.2 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 141 - Forks: 64

tollwerk/data-processing-agreements
Collection of Data Processing Agreement (DPA) and GDPR compliance resources
Language: SCSS - Size: 98.6 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 134 - Forks: 24

iam-mhaseeb/Skytrax-Data-Warehouse 📦
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
Language: Python - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 133 - Forks: 28

luckylittle/blinkist-m4a-downloader
Grabs all of the audio files from all of the Blinkist books
Language: Go - Size: 101 KB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 132 - Forks: 25

thu-coai/cotk
Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation
Language: Python - Size: 10.5 MB - Last synced at: 29 days ago - Pushed at: over 4 years ago - Stars: 127 - Forks: 26

kfultz07/go-dataframe
A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
Language: Go - Size: 3.93 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 120 - Forks: 7

dream-num/univer-clipsheet
A powerful Chrome extension for web scraping
Language: TypeScript - Size: 5.72 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 119 - Forks: 17

utdemir/distributed-dataset
A distributed data processing framework in Haskell.
Language: Haskell - Size: 875 KB - Last synced at: 6 days ago - Pushed at: almost 5 years ago - Stars: 116 - Forks: 5

Siteimprove/alfa
:wheelchair: Suite of open and standards-based tools for performing reliable accessibility conformance testing at scale
Language: TypeScript - Size: 52.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 115 - Forks: 13

LiberTEM/LiberTEM
Open pixelated STEM framework
Language: Python - Size: 224 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 114 - Forks: 68

drshahizan/HPDP
High performance data processing employs high performance computing (HPC) to process data, which is then translated into information and knowledge. The advent of high-performance computing and data analytics enabled real-time interrogation of extremely large data sets.
Language: Jupyter Notebook - Size: 188 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 113 - Forks: 85

streamnative/pulsar-spark
Spark Connector to read and write with Pulsar
Language: Scala - Size: 711 KB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 113 - Forks: 50

zengwangfa/2019-Electronic-Design-Competition
【电赛】2019 全国大学生电子设计竞赛 (F题)纸张数量检测装置 (基于STM32F407 & FDC2214 & USART HMI)
Language: C - Size: 80.9 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 105 - Forks: 41

whoiskatrin/financial-statement-pdf-extractor
Python script to extract as much structured information as possible from annual/quarterly reports.
Language: Python - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 99 - Forks: 24

NVIDIA/nvImageCodec
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
Language: Jupyter Notebook - Size: 22.3 MB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 97 - Forks: 8

asavinov/prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Language: Python - Size: 1.95 MB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 90 - Forks: 5

docwire/docwire
DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Supports nearly 100 data formats, including email boxes and OCR. Boost efficiency in text extraction, web data extraction, data mining, document analysis. Offline processing is possible for security and confidentiality
Language: C++ - Size: 35.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 83 - Forks: 18

DRUMNICORN/Visio
Visio is an AI-powered IDE concept that turns software development into a visual, code-free experience, making programming accessible to everyone.
Size: 1020 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 83 - Forks: 5

MDSplus/mdsplus
The MDSplus data management system
Language: Java - Size: 148 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 82 - Forks: 48

akashlevy/Deep-Learn-Oil
Deep learning tools for predicting oil well data
Language: Python - Size: 512 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 81 - Forks: 51

aces/cbrain
CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
Language: Ruby - Size: 20.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 77 - Forks: 51

duoan/ijcai18-mama-ads-competition
IJCAI-18 阿里妈妈搜索广告转化预测初赛方案
Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 72 - Forks: 22

vortex-exoplanet/VIP
VIP is a python package/library for angular, reference star and spectral differential imaging for exoplanet/disk detection through high-contrast imaging.
Language: Python - Size: 407 MB - Last synced at: 9 days ago - Pushed at: 18 days ago - Stars: 70 - Forks: 60

alirezatheh/perke
A keyphrase extractor for Persian
Language: Python - Size: 143 KB - Last synced at: 19 days ago - Pushed at: about 1 month ago - Stars: 69 - Forks: 8

pauliacomi/pyGAPS
A framework for processing adsorption data and isotherm fitting
Language: Python - Size: 26.4 MB - Last synced at: 29 days ago - Pushed at: 2 months ago - Stars: 69 - Forks: 24

JusperLee/LRS3-For-Speech-Separation
Multi-modal speech separation task data generation script on LRS3 data set.
Language: MATLAB - Size: 3.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 69 - Forks: 14

p-ranav/pipeline
Pipelines for Modern C++
Language: C++ - Size: 245 KB - Last synced at: 7 days ago - Pushed at: over 4 years ago - Stars: 67 - Forks: 8

UrbanOS-Public/smartcitiesdata
The core micro services of UrbanOS as an umbrella project with component documentation
Language: Elixir - Size: 14.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 65 - Forks: 11

unidentifieddeveloper/blaze
A blazing fast exporter for your Elasticsearch data.
Language: C++ - Size: 34.2 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 62 - Forks: 9

BjoernKW/ZenQuery
Enterprise backend as a service
Language: Java - Size: 5.84 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 60 - Forks: 15

AtomGraph/Processor
Ontology-driven Linked Data processor and server for SPARQL backends. Apache License.
Language: Java - Size: 1.51 MB - Last synced at: 29 days ago - Pushed at: almost 2 years ago - Stars: 59 - Forks: 7

josephmachado/online_store
End to end data engineering project
Language: Python - Size: 1.53 MB - Last synced at: 27 days ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 18

31z4/storm-docker 📦
Docker image packaging for Apache Storm
Language: Dockerfile - Size: 81.1 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 52 - Forks: 27

wq/itertable
⇔ IterTable is a Pythonic API for iterating through tabular data formats, including CSV, XLSX, XML, and JSON.
Language: Python - Size: 248 KB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 52 - Forks: 11

TirendazAcademy/Data-Visualization-with-Python
Data Visualization Tutorial | Matplotlib | Seaborn | Pandas
Language: Jupyter Notebook - Size: 25.5 MB - Last synced at: 22 days ago - Pushed at: almost 2 years ago - Stars: 51 - Forks: 34

luisbelloch/data_processing_course
Some class materials for a data processing course using PySpark
Language: Python - Size: 563 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 51 - Forks: 24

Samson-Mano/Fast_Fourier_Transform
C# implementation of Cooley–Tukey's FFT algorithm.
Language: C# - Size: 1.44 MB - Last synced at: 29 days ago - Pushed at: almost 2 years ago - Stars: 48 - Forks: 17

adelekuzmiakova/CS229-machine-learning-solar-energy-predictions
Predicting solar energy using machine learning (LSTM, PCA, boosting). This is our CS 229 project from autumn 2017. Report and poster are included.
Language: Python - Size: 922 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 48 - Forks: 12

gabyx/ExecutionGraph
Fast Generic Execution Graph/Network
Language: C++ - Size: 24.8 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 46 - Forks: 7

AutodeskAILab/BRepNet
BRepNet: A topological message passing system for solid models
Language: Python - Size: 24.3 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 45 - Forks: 15

jqnpm/jqnpm 📦
A package manager built for the command-line JSON processor jq.
Language: Shell - Size: 158 KB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 45 - Forks: 3

soumyadip007/Data-Science-Using-Python-University-Course-Module
“Data science” is just about as broad of a term as they come. It may be easiest to describe what it is by listing its more concrete components: Data exploration & analysis. Included here: Pandas; NumPy; SciPy; a helping hand from Python's Standard Library.
Language: Jupyter Notebook - Size: 34.1 MB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 45 - Forks: 46

jeffgrunewald/stargate
An Apache Pulsar client written in Elixir
Language: Elixir - Size: 83 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 43 - Forks: 12

CityofToronto/bdit_data-sources
Data sources used by the Big Data Innovation Team
Language: Jupyter Notebook - Size: 119 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 40 - Forks: 8

industrial-edge/Developer-Guide-Hands-on-App
Handson application for Industrial Edge Developer Guide
Language: Python - Size: 3.91 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 40 - Forks: 21
