GitHub topics: data-processing
havrak/fmcw-surveillance-radar
Respository to concentrate all files concerning my bachelor's thesis about constructing an surveillance radar based on FMCW SiRad Easy r4
Language: MATLAB - Size: 89.7 MB - Last synced at: about 8 hours ago - Pushed at: about 8 hours ago - Stars: 0 - Forks: 0

alimghmi/bdlc
Bloomberg API integration, handling data requests, processing, and SQL database insertion.
Language: Python - Size: 42 KB - Last synced at: about 8 hours ago - Pushed at: about 8 hours ago - Stars: 0 - Forks: 0

cocoindex-io/cocoindex
ETL framework to turn your data AI-ready - with realtime incremental updates and support custom logic like lego.
Language: Rust - Size: 6.88 MB - Last synced at: about 24 hours ago - Pushed at: about 24 hours ago - Stars: 1,088 - Forks: 66

Joanna20Carrion/Generador-De-Oficios
Aplicación web en Flask que genera oficios personalizados en Word desde una plantilla, usando datos de destinatarios almacenados en un Excel de directorio empresarial.
Language: HTML - Size: 27.3 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

patricksferraz/cep2address
A high-performance Python tool for batch processing Brazilian postal codes (CEP) into complete addresses. Features parallel processing, multiple API sources, and flexible I/O formats. Perfect for data enrichment and address validation.
Language: Python - Size: 9.77 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 1

patricksferraz/pcr-analysis
Machine learning-powered PCR data analysis toolkit featuring transfer learning, time series forecasting, and SHAP-based model interpretability. Built with TensorFlow and scikit-learn for advanced biological data processing.
Language: Jupyter Notebook - Size: 9.66 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

polyaxon/haupt
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
Language: Python - Size: 1.14 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 453 - Forks: 209

hemicharly/on-demand-archive-process-nodejs
This project demonstrates an example Node.js application, with the goal of applying the power of Node.js stream and pipeline in data processing, aiming to efficiently process large data sets in batches, minimizing memory consumption.
Language: JavaScript - Size: 146 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 1

Djirlic/raw-transactions-handler
AWS Lambda function for validating and transforming CSV data to Parquet format using Polars. Valid data is ingested into an S3 bucket for refined data, while invalid data is quarantined and logged for further analysis.
Language: Python - Size: 84 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

numaproj/numaflow
Kubernetes-native platform to run massively parallel data/streaming jobs
Language: Go - Size: 38.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,864 - Forks: 131

apache/incubator-wayang
Apache Wayang(incubating) is the first cross-platform data processing system.
Language: Java - Size: 18.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 220 - Forks: 97

helmholtz-analytics/heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
Language: Python - Size: 21 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 221 - Forks: 54

Efidieeieiddidfkkfkfkf/Generador-De-Oficios
Aplicación web en Flask que genera oficios personalizados en Word desde una plantilla, usando datos de destinatarios almacenados en un Excel de directorio empresarial.
Language: Python - Size: 14.6 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

flow-php/etl-adapter-xml
PHP ETL Adapter: XML
Language: PHP - Size: 1.76 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 5 - Forks: 2

24greyhat/JSORM
Python JSON ORM (simple module all in one file)
Language: Python - Size: 11.7 KB - Last synced at: 27 minutes ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

venis-majkofci/Log2Csv
A PowerShell script designed to parse and convert unstructured log files into structured CSV format, facilitating easier analysis and processing.
Language: PowerShell - Size: 26.4 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

liblaf/awesome
🌟 A curated collection of awesome tools, libraries, and resources for developers
Language: MDX - Size: 2.24 MB - Last synced at: 3 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0

earthai-tech/gofast
gofast: AIO machine learning package
Language: Python - Size: 38.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 2

speedcell4/torchglyph
Data Processor Combinators for Natural Language Processing
Language: Python - Size: 546 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 7 - Forks: 1

SunilKuruba/Data-Science-Project-Social-Aware-Movie-Revenue-Prediction-Using-Metadata-and-Sentiment-Signals Fork of nithish-kumar-t/movie-box-office-prediction
A machine learning pipeline that predicts movie box office revenue by combining traditional metadata (e.g., budget, genre, cast) with sentiment and emotion scores extracted from Reddit and YouTube using transformer-based NLP models. Achieves up to 15% accuracy improvement using LightGBM, CatBoost, and XGBoost.
Language: Jupyter Notebook - Size: 43.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

zakarialaoui10/ZikoMatrix
Arduino library for creating and manipulating matrices of arbitrary size and data type. The library provides a Matrix class that can be used to create matrices, perform basic matrix operations
Language: C++ - Size: 334 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 38 - Forks: 2

Tyson-cyber/GetMerlin2Api
GetMerlin2Api is a versatile API that allows users to seamlessly integrate Merlin2 software capabilities into their own applications, enabling enhanced project management and collaboration features. With its comprehensive documentation and user-friendly endpoints, developers can easily leverage the power of Merlin2 within their projects for optimal
Size: 1000 Bytes - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ndjapic/mat7-2024
Материјали за предмет математика у седмом разреду у школској 2024/2025. години
Language: TeX - Size: 1.24 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Pig85236/45K-Udemy-Course-WordPress-Posts
XML files of 45K+ Udemy courses for WordPress—Share Knowledge, Drive Traffic, & Make Money! 🔥🚀
Size: 1.95 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3 - Forks: 1

legend-exp/legend-dataflow
LEGEND data flow management
Language: Python - Size: 1.25 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 13

bytewax/bytewax
Python Stream Processing
Language: Python - Size: 12 MB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 1,726 - Forks: 79

drshahizan/HPDP
High performance data processing employs high performance computing (HPC) to process data, which is then translated into information and knowledge. The advent of high-performance computing and data analytics enabled real-time interrogation of extremely large data sets.
Language: Jupyter Notebook - Size: 188 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 113 - Forks: 85

code-mike-code/excursions-order-panel
This project is a JavaScript-based travel order system that allows users to: • Upload a CSV file containing travel offers • Select trips, specify the number of adults/children, and add them to an order list • Review and remove items from the order summary • Submit the order with validated customer details
Language: JavaScript - Size: 22.5 KB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

johnkerl/miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Language: Go - Size: 201 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 9,279 - Forks: 222

NVIDIA/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
Language: Jupyter Notebook - Size: 7.73 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 900 - Forks: 125

abhimehro/Seatek_Analysis
R-based analysis tier for Seatek sensor data processing and Excel workbook generation. Part of a three-tier analysis system working in conjunction with Python-based visualization project.
Language: Python - Size: 49.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Language: C++ - Size: 394 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 5,383 - Forks: 634

etsap-TIMES/xl2times
Open source tool to convert TIMES models specified in Excel
Language: Python - Size: 872 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 17 - Forks: 9

dashbitco/broadway
Concurrent and multi-stage data ingestion and data processing with Elixir
Language: Elixir - Size: 718 KB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 2,513 - Forks: 166

deepseek-ai/smallpond
A lightweight data processing framework built on DuckDB and 3FS.
Language: Python - Size: 1.77 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 4,605 - Forks: 409

tathithienthanh/WomenFashionProductRecommendationSystem
Build a recommendation system for recommending woman fashion's products on e-commerce platforms
Language: Jupyter Notebook - Size: 45.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

unionai-oss/pandera
A light-weight, flexible, and expressive statistical data testing library
Language: Python - Size: 4.09 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 3,784 - Forks: 333

dream-num/univer-clipsheet
A powerful Chrome extension for web scraping
Language: TypeScript - Size: 5.72 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 119 - Forks: 17

microsoft/GODEL
Large-scale pretrained models for goal-directed dialog
Language: Python - Size: 49.8 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 869 - Forks: 112

KikiBoum4980/2025-One-Billion-Row-Challenge
Projeto One Billion Row atualizado para 2025
Language: Python - Size: 438 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Elinoup/-TeraCopy-Pro
TeraCopy Pro is a powerful file transfer utility designed to accelerate the copying and moving of files. With advanced error recovery, pause and resume capabilities, and detailed transfer progress information, it optimizes file management for improved efficiency and reliability
Size: 3.91 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

graphbookai/graphbook
Visual AI development framework for training and inference of ML models, scaling pipelines, and automating workflows with Python.⭐ Leave a star to support us!
Language: Python - Size: 1.9 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 39 - Forks: 3

aces/cbrain
CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
Language: Ruby - Size: 20.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 77 - Forks: 51

UCSB-Library-Research-Data-Services/openrefine
Tidying Messy Spreadsheets with OpenRefine
Language: HTML - Size: 8.67 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

loggdme/kyro
Kyro is a collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.
Language: Go - Size: 15.6 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Siteimprove/alfa
:wheelchair: Suite of open and standards-based tools for performing reliable accessibility conformance testing at scale
Language: TypeScript - Size: 52.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 115 - Forks: 13

pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Language: Python - Size: 132 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 24,595 - Forks: 363

flow-php/etl
PHP - ETL (Extract Transform Load) data processing library
Language: PHP - Size: 3.5 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 357 - Forks: 20

TomWright/dasel
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
Language: Go - Size: 8.56 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 7,448 - Forks: 146

gireeshbharmshetty/scala-log-analyzer
A simple log analyzer in Scala using regex and functional programming.
Language: Scala - Size: 2.93 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

gireeshbharmshetty/scala-data-pipeline
A simple data transformation pipeline in Scala reading CSVs, joining data, and aggregating results.
Language: Scala - Size: 3.91 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

ParitKansal/quora-question-pairs
The goal of this project is to predict which of the provided pairs of questions contain two questions with the same meaning.
Language: Jupyter Notebook - Size: 14 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

markus-wa/cq
Clojure Query: A Command-line Data Processor for JSON, YAML, EDN, XML and more
Language: Clojure - Size: 202 KB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 174 - Forks: 11

alekLukanen/ChapterhouseQE
A simple distributed SQL query engine written in Rust
Language: Rust - Size: 4.85 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 1

ColasGael/Machine-Learning-for-Solar-Energy-Prediction
Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning
Language: Python - Size: 922 MB - Last synced at: 2 days ago - Pushed at: over 5 years ago - Stars: 266 - Forks: 112

hedisam/pipeline
A simple data processing pipeline supporting FIFO, fixed & dynamic worker pools, and broadcast stages.
Language: Go - Size: 1.45 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 0

CityofToronto/bdit_data-sources
Data sources used by the Big Data Innovation Team
Language: Jupyter Notebook - Size: 119 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 40 - Forks: 8

SAIL-Labs/AMICAL
Extraction pipeline and analysis tools for Aperture Masking Interferometry mode of latest generation instruments (ground-based and space).
Language: Python - Size: 113 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 9 - Forks: 7

Benji377/PyScripts
A friendly, open-source collection of standalone Python scripts for automation, data processing, and everyday utilities.
Language: Python - Size: 34.2 KB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 4 - Forks: 1

AmirAli104/Text2Excel
A GUI desktop application that can extract data from a text file and put them in an Excel or CSV file using regular expression (regex) patterns
Language: Python - Size: 123 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 3 - Forks: 0

BenjaminDanker/RLStatistics-ML
Python pipeline for processing Rocket League replay data, extracting player and match statistics, and training a Random Forest model to predict match outcomes
Language: Python - Size: 9.73 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

keithorange/PatternPy
📈 PatternPy: A Python package revolutionizing trading analysis with high-speed pattern recognition, leveraging Pandas & Numpy. Effortlessly spot Head & Shoulders, Tops & Bottoms, Supports & Resistances. For experts & beginners. #TradingMadeEasy 🔥
Language: Python - Size: 404 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 335 - Forks: 78

eyabesbes/Traitement-donnees-manquantes
Preprocessing of missing data within the Wiki4HE dataset to enhance data quality and enable effective analysis. This includes handling missing values through various techniques.
Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

LiberTEM/LiberTEM
Open pixelated STEM framework
Language: Python - Size: 224 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 114 - Forks: 68

asyml/ForteHealth
The project is in the incubation stage and still under development. ForteHealth is a flexible and powerful ML workflow builder for biomedical and clinical scenarios. This is part of the CASL project: http://casl-project.ai/
Language: Python - Size: 1.98 MB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 13 - Forks: 5

tomaszmrugalski/hevelius-web
🔭 Web interface for the Hevelius, an astronomy image processing system.
Language: TypeScript - Size: 5.64 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 0 - Forks: 0

fmartinache/xara
a python package for eXtreme Angular Resolution Astronomy
Language: Python - Size: 26.3 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 11 - Forks: 7

ion-fusion/fusion-java
Ion Fusion is a programmable programming language for working with JSON and Amazon Ion data.
Language: Java - Size: 4.42 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 3

docwire/docwire
DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Supports nearly 100 data formats, including email boxes and OCR. Boost efficiency in text extraction, web data extraction, data mining, document analysis. Offline processing is possible for security and confidentiality
Language: C++ - Size: 35.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 83 - Forks: 18

geoschem/GEOS_IT
Code to process/regrid the GMAO GEOS-IT data for input into GEOS-Chem
Language: Fortran - Size: 1.23 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 2

ull0sm/Drawer
Drawer is a Python tool for automating single-elimination draw systems in karate events. It ensures fairness by filtering player data, forming balanced groups, and generating match brackets with minimal bias. Designed for efficiency, it streamlines tasks such as data organization and bracket creation, making event management seamless and reliable.
Language: Python - Size: 488 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

KILOGAS/kilogas_imaging
Imaging pipeline for KILOGAS ALMA data.
Language: Python - Size: 3.63 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

crate/cratedb-toolkit
CrateDB Toolkit, an SDK for CrateDB and CrateDB Cloud.
Language: Python - Size: 981 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 9 - Forks: 4

forieux/qmm
Python Quadratic Majorization-Minimization (MM) optimization algorithms of half-quadratic criteria. Inverses problems, image restoration, denoising, ...
Language: Python - Size: 797 KB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 3

Puchaczov/Musoq
SQL Syntax without any database
Language: C# - Size: 15.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 480 - Forks: 21

frutik/awesome-e-commerce
Size: 27.3 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 1

GoogleCloudPlatform/DataflowJavaSDK 📦
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Size: 12.9 MB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 857 - Forks: 320

microsoft/DialoGPT
Large-scale pretraining for dialogue
Language: Python - Size: 43.6 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 2,383 - Forks: 347

remotesensinginfo/rsgislib
Remote Sensing and GIS Software Library; python module tools for processing spatial data.
Language: C++ - Size: 140 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 152 - Forks: 27

UrbanOS-Public/smartcitiesdata
The core micro services of UrbanOS as an umbrella project with component documentation
Language: Elixir - Size: 14.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 65 - Forks: 11

flow-php/etl-adapter-parquet
PHP ETL Adapter: Parquet
Language: PHP - Size: 3.14 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 7 - Forks: 0

flow-php/etl-adapter-http
PHP ETL Adapter: Http
Language: PHP - Size: 270 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 4 - Forks: 2

flow-php/etl-adapter-csv
PHP ETL Adapter: CSV
Language: PHP - Size: 1.31 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 5 - Forks: 2

flow-php/etl-adapter-avro
PHP ETL Adapter: Avro
Language: PHP - Size: 1.68 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 0

flow-php/etl-adapter-json
PHP ETL Adapter: JSON
Language: PHP - Size: 1.74 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 6 - Forks: 3

flow-php/etl-adapter-elasticsearch
PHP ETL Adapter: Elasticsearch
Language: PHP - Size: 284 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 1

flow-php/etl-adapter-text
PHP ETL Adapter: Text
Language: PHP - Size: 1.09 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 3 - Forks: 0

flow-php/etl-adapter-logger
PHP ETL Adapter: Logger
Language: PHP - Size: 206 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 4 - Forks: 1

mech-lang/mech
🦾 Mech is a programming language for building data-driven systems like robots, games, and interfaces. Start here!
Language: Rust - Size: 10.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 217 - Forks: 12

ChenghaoMou/text-dedup
All-in-one text de-duplication
Language: Python - Size: 5.87 MB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 674 - Forks: 75

Siteimprove/alfa-act-r
:clipboard: Acceptance testing of rules authored by the ACT Rules Community Group (@act-rules) and implemented by Alfa
Language: TypeScript - Size: 31.6 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 2

MDSplus/mdsplus
The MDSplus data management system
Language: Java - Size: 148 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 82 - Forks: 48

Noursalem2005/Web-Scraping-Project
A basic web scraping project for educational purposes. It demonstrates how to extract data from a website and export it into multiple formats such as CSV, PDF, HTML, JPG, and TXT using Python.
Language: HTML - Size: 39.1 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

senbox-org/snap-engine
ESA Earth Observation Toolbox and Java Development Platform
Language: Java - Size: 911 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 193 - Forks: 102

stastnypremysl/lsql-csv
lsql-csv is a tool for small CSV file data querying from a shell with short queries. It makes it possible to work with small CSV files like with a read-only relational databases. The tool implements a new language LSQL similar to SQL, specifically designed for working with CSV files in a shell. LSQL aims to be a more lapidary language than SQL.
Language: Haskell - Size: 2.78 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

flow-php/etl-adapter-doctrine
PHP ETL Adapter: Doctrine
Language: PHP - Size: 422 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 3 - Forks: 2

SebKrantz/collapse
Advanced and Fast Data Transformation in R
Language: C - Size: 106 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 679 - Forks: 35

ReusJimenez/python-data-engineering
Laboratorios prácticos de ingeniería de datos con Python. ⚙️
Language: Jupyter Notebook - Size: 27.7 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

thibaultmeyer/mupipe
Pipeline microframework for data processing
Language: Java - Size: 104 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

senbox-org/snap-desktop
Desktop GUI for SNAP based on NetBeans Platform
Language: Java - Size: 77.2 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 141 - Forks: 64
