An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-processing

havrak/fmcw-surveillance-radar

Respository to concentrate all files concerning my bachelor's thesis about constructing an surveillance radar based on FMCW SiRad Easy r4

Language: MATLAB - Size: 89.7 MB - Last synced at: about 8 hours ago - Pushed at: about 8 hours ago - Stars: 0 - Forks: 0

alimghmi/bdlc

Bloomberg API integration, handling data requests, processing, and SQL database insertion.

Language: Python - Size: 42 KB - Last synced at: about 8 hours ago - Pushed at: about 8 hours ago - Stars: 0 - Forks: 0

cocoindex-io/cocoindex

ETL framework to turn your data AI-ready - with realtime incremental updates and support custom logic like lego.

Language: Rust - Size: 6.88 MB - Last synced at: about 24 hours ago - Pushed at: about 24 hours ago - Stars: 1,088 - Forks: 66

Joanna20Carrion/Generador-De-Oficios

Aplicación web en Flask que genera oficios personalizados en Word desde una plantilla, usando datos de destinatarios almacenados en un Excel de directorio empresarial.

Language: HTML - Size: 27.3 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

patricksferraz/cep2address

A high-performance Python tool for batch processing Brazilian postal codes (CEP) into complete addresses. Features parallel processing, multiple API sources, and flexible I/O formats. Perfect for data enrichment and address validation.

Language: Python - Size: 9.77 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 1

patricksferraz/pcr-analysis

Machine learning-powered PCR data analysis toolkit featuring transfer learning, time series forecasting, and SHAP-based model interpretability. Built with TensorFlow and scikit-learn for advanced biological data processing.

Language: Jupyter Notebook - Size: 9.66 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

polyaxon/haupt

Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon

Language: Python - Size: 1.14 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 453 - Forks: 209

hemicharly/on-demand-archive-process-nodejs

This project demonstrates an example Node.js application, with the goal of applying the power of Node.js stream and pipeline in data processing, aiming to efficiently process large data sets in batches, minimizing memory consumption.

Language: JavaScript - Size: 146 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 1

Djirlic/raw-transactions-handler

AWS Lambda function for validating and transforming CSV data to Parquet format using Polars. Valid data is ingested into an S3 bucket for refined data, while invalid data is quarantined and logged for further analysis.

Language: Python - Size: 84 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

numaproj/numaflow

Kubernetes-native platform to run massively parallel data/streaming jobs

Language: Go - Size: 38.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,864 - Forks: 131

apache/incubator-wayang

Apache Wayang(incubating) is the first cross-platform data processing system.

Language: Java - Size: 18.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 220 - Forks: 97

helmholtz-analytics/heat

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python

Language: Python - Size: 21 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 221 - Forks: 54

Efidieeieiddidfkkfkfkf/Generador-De-Oficios

Aplicación web en Flask que genera oficios personalizados en Word desde una plantilla, usando datos de destinatarios almacenados en un Excel de directorio empresarial.

Language: Python - Size: 14.6 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

flow-php/etl-adapter-xml

PHP ETL Adapter: XML

Language: PHP - Size: 1.76 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 5 - Forks: 2

24greyhat/JSORM

Python JSON ORM (simple module all in one file)

Language: Python - Size: 11.7 KB - Last synced at: 27 minutes ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

venis-majkofci/Log2Csv

A PowerShell script designed to parse and convert unstructured log files into structured CSV format, facilitating easier analysis and processing.

Language: PowerShell - Size: 26.4 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

liblaf/awesome

🌟 A curated collection of awesome tools, libraries, and resources for developers

Language: MDX - Size: 2.24 MB - Last synced at: 3 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0

earthai-tech/gofast

gofast: AIO machine learning package

Language: Python - Size: 38.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 2

speedcell4/torchglyph

Data Processor Combinators for Natural Language Processing

Language: Python - Size: 546 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 7 - Forks: 1

SunilKuruba/Data-Science-Project-Social-Aware-Movie-Revenue-Prediction-Using-Metadata-and-Sentiment-Signals Fork of nithish-kumar-t/movie-box-office-prediction

A machine learning pipeline that predicts movie box office revenue by combining traditional metadata (e.g., budget, genre, cast) with sentiment and emotion scores extracted from Reddit and YouTube using transformer-based NLP models. Achieves up to 15% accuracy improvement using LightGBM, CatBoost, and XGBoost.

Language: Jupyter Notebook - Size: 43.6 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

zakarialaoui10/ZikoMatrix

Arduino library for creating and manipulating matrices of arbitrary size and data type. The library provides a Matrix class that can be used to create matrices, perform basic matrix operations

Language: C++ - Size: 334 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 38 - Forks: 2

Tyson-cyber/GetMerlin2Api

GetMerlin2Api is a versatile API that allows users to seamlessly integrate Merlin2 software capabilities into their own applications, enabling enhanced project management and collaboration features. With its comprehensive documentation and user-friendly endpoints, developers can easily leverage the power of Merlin2 within their projects for optimal

Size: 1000 Bytes - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ndjapic/mat7-2024

Материјали за предмет математика у седмом разреду у школској 2024/2025. години

Language: TeX - Size: 1.24 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Pig85236/45K-Udemy-Course-WordPress-Posts

XML files of 45K+ Udemy courses for WordPress—Share Knowledge, Drive Traffic, & Make Money! 🔥🚀

Size: 1.95 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3 - Forks: 1

legend-exp/legend-dataflow

LEGEND data flow management

Language: Python - Size: 1.25 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 13

bytewax/bytewax

Python Stream Processing

Language: Python - Size: 12 MB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 1,726 - Forks: 79

drshahizan/HPDP

High performance data processing employs high performance computing (HPC) to process data, which is then translated into information and knowledge. The advent of high-performance computing and data analytics enabled real-time interrogation of extremely large data sets.

Language: Jupyter Notebook - Size: 188 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 113 - Forks: 85

code-mike-code/excursions-order-panel

This project is a JavaScript-based travel order system that allows users to: • Upload a CSV file containing travel offers • Select trips, specify the number of adults/children, and add them to an order list • Review and remove items from the order summary • Submit the order with validated customer details

Language: JavaScript - Size: 22.5 KB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

johnkerl/miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

Language: Go - Size: 201 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 9,279 - Forks: 222

NVIDIA/NeMo-Curator

Scalable data pre processing and curation toolkit for LLMs

Language: Jupyter Notebook - Size: 7.73 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 900 - Forks: 125

abhimehro/Seatek_Analysis

R-based analysis tier for Seatek sensor data processing and Excel workbook generation. Part of a three-tier analysis system working in conjunction with Python-based visualization project.

Language: Python - Size: 49.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

NVIDIA/DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Language: C++ - Size: 394 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 5,383 - Forks: 634

etsap-TIMES/xl2times

Open source tool to convert TIMES models specified in Excel

Language: Python - Size: 872 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 17 - Forks: 9

dashbitco/broadway

Concurrent and multi-stage data ingestion and data processing with Elixir

Language: Elixir - Size: 718 KB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 2,513 - Forks: 166

deepseek-ai/smallpond

A lightweight data processing framework built on DuckDB and 3FS.

Language: Python - Size: 1.77 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 4,605 - Forks: 409

tathithienthanh/WomenFashionProductRecommendationSystem

Build a recommendation system for recommending woman fashion's products on e-commerce platforms

Language: Jupyter Notebook - Size: 45.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

unionai-oss/pandera

A light-weight, flexible, and expressive statistical data testing library

Language: Python - Size: 4.09 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 3,784 - Forks: 333

dream-num/univer-clipsheet

A powerful Chrome extension for web scraping

Language: TypeScript - Size: 5.72 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 119 - Forks: 17

microsoft/GODEL

Large-scale pretrained models for goal-directed dialog

Language: Python - Size: 49.8 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 869 - Forks: 112

KikiBoum4980/2025-One-Billion-Row-Challenge

Projeto One Billion Row atualizado para 2025

Language: Python - Size: 438 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Elinoup/-TeraCopy-Pro

TeraCopy Pro is a powerful file transfer utility designed to accelerate the copying and moving of files. With advanced error recovery, pause and resume capabilities, and detailed transfer progress information, it optimizes file management for improved efficiency and reliability

Size: 3.91 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

graphbookai/graphbook

Visual AI development framework for training and inference of ML models, scaling pipelines, and automating workflows with Python.⭐ Leave a star to support us!

Language: Python - Size: 1.9 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 39 - Forks: 3

aces/cbrain

CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.

Language: Ruby - Size: 20.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 77 - Forks: 51

UCSB-Library-Research-Data-Services/openrefine

Tidying Messy Spreadsheets with OpenRefine

Language: HTML - Size: 8.67 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

loggdme/kyro

Kyro is a collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.

Language: Go - Size: 15.6 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Siteimprove/alfa

:wheelchair: Suite of open and standards-based tools for performing reliable accessibility conformance testing at scale

Language: TypeScript - Size: 52.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 115 - Forks: 13

pathwaycom/pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Language: Python - Size: 132 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 24,595 - Forks: 363

flow-php/etl

PHP - ETL (Extract Transform Load) data processing library

Language: PHP - Size: 3.5 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 357 - Forks: 20

TomWright/dasel

Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.

Language: Go - Size: 8.56 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 7,448 - Forks: 146

gireeshbharmshetty/scala-log-analyzer

A simple log analyzer in Scala using regex and functional programming.

Language: Scala - Size: 2.93 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

gireeshbharmshetty/scala-data-pipeline

A simple data transformation pipeline in Scala reading CSVs, joining data, and aggregating results.

Language: Scala - Size: 3.91 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

ParitKansal/quora-question-pairs

The goal of this project is to predict which of the provided pairs of questions contain two questions with the same meaning.

Language: Jupyter Notebook - Size: 14 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

markus-wa/cq

Clojure Query: A Command-line Data Processor for JSON, YAML, EDN, XML and more

Language: Clojure - Size: 202 KB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 174 - Forks: 11

alekLukanen/ChapterhouseQE

A simple distributed SQL query engine written in Rust

Language: Rust - Size: 4.85 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 1

ColasGael/Machine-Learning-for-Solar-Energy-Prediction

Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning

Language: Python - Size: 922 MB - Last synced at: 2 days ago - Pushed at: over 5 years ago - Stars: 266 - Forks: 112

hedisam/pipeline

A simple data processing pipeline supporting FIFO, fixed & dynamic worker pools, and broadcast stages.

Language: Go - Size: 1.45 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 0

CityofToronto/bdit_data-sources

Data sources used by the Big Data Innovation Team

Language: Jupyter Notebook - Size: 119 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 40 - Forks: 8

SAIL-Labs/AMICAL

Extraction pipeline and analysis tools for Aperture Masking Interferometry mode of latest generation instruments (ground-based and space).

Language: Python - Size: 113 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 9 - Forks: 7

Benji377/PyScripts

A friendly, open-source collection of standalone Python scripts for automation, data processing, and everyday utilities.

Language: Python - Size: 34.2 KB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 4 - Forks: 1

AmirAli104/Text2Excel

A GUI desktop application that can extract data from a text file and put them in an Excel or CSV file using regular expression (regex) patterns

Language: Python - Size: 123 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 3 - Forks: 0

BenjaminDanker/RLStatistics-ML

Python pipeline for processing Rocket League replay data, extracting player and match statistics, and training a Random Forest model to predict match outcomes

Language: Python - Size: 9.73 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

keithorange/PatternPy

📈 PatternPy: A Python package revolutionizing trading analysis with high-speed pattern recognition, leveraging Pandas & Numpy. Effortlessly spot Head & Shoulders, Tops & Bottoms, Supports & Resistances. For experts & beginners. #TradingMadeEasy 🔥

Language: Python - Size: 404 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 335 - Forks: 78

eyabesbes/Traitement-donnees-manquantes

Preprocessing of missing data within the Wiki4HE dataset to enhance data quality and enable effective analysis. This includes handling missing values through various techniques.

Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

LiberTEM/LiberTEM

Open pixelated STEM framework

Language: Python - Size: 224 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 114 - Forks: 68

asyml/ForteHealth

The project is in the incubation stage and still under development. ForteHealth is a flexible and powerful ML workflow builder for biomedical and clinical scenarios. This is part of the CASL project: http://casl-project.ai/

Language: Python - Size: 1.98 MB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 13 - Forks: 5

tomaszmrugalski/hevelius-web

🔭 Web interface for the Hevelius, an astronomy image processing system.

Language: TypeScript - Size: 5.64 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 0 - Forks: 0

fmartinache/xara

a python package for eXtreme Angular Resolution Astronomy

Language: Python - Size: 26.3 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 11 - Forks: 7

ion-fusion/fusion-java

Ion Fusion is a programmable programming language for working with JSON and Amazon Ion data.

Language: Java - Size: 4.42 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 3

docwire/docwire

DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Supports nearly 100 data formats, including email boxes and OCR. Boost efficiency in text extraction, web data extraction, data mining, document analysis. Offline processing is possible for security and confidentiality

Language: C++ - Size: 35.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 83 - Forks: 18

geoschem/GEOS_IT

Code to process/regrid the GMAO GEOS-IT data for input into GEOS-Chem

Language: Fortran - Size: 1.23 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 2

ull0sm/Drawer

Drawer is a Python tool for automating single-elimination draw systems in karate events. It ensures fairness by filtering player data, forming balanced groups, and generating match brackets with minimal bias. Designed for efficiency, it streamlines tasks such as data organization and bracket creation, making event management seamless and reliable.

Language: Python - Size: 488 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

KILOGAS/kilogas_imaging

Imaging pipeline for KILOGAS ALMA data.

Language: Python - Size: 3.63 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

crate/cratedb-toolkit

CrateDB Toolkit, an SDK for CrateDB and CrateDB Cloud.

Language: Python - Size: 981 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 9 - Forks: 4

forieux/qmm

Python Quadratic Majorization-Minimization (MM) optimization algorithms of half-quadratic criteria. Inverses problems, image restoration, denoising, ...

Language: Python - Size: 797 KB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 3

Puchaczov/Musoq

SQL Syntax without any database

Language: C# - Size: 15.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 480 - Forks: 21

frutik/awesome-e-commerce

Size: 27.3 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 1

GoogleCloudPlatform/DataflowJavaSDK 📦

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

Size: 12.9 MB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 857 - Forks: 320

microsoft/DialoGPT

Large-scale pretraining for dialogue

Language: Python - Size: 43.6 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 2,383 - Forks: 347

remotesensinginfo/rsgislib

Remote Sensing and GIS Software Library; python module tools for processing spatial data.

Language: C++ - Size: 140 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 152 - Forks: 27

UrbanOS-Public/smartcitiesdata

The core micro services of UrbanOS as an umbrella project with component documentation

Language: Elixir - Size: 14.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 65 - Forks: 11

flow-php/etl-adapter-parquet

PHP ETL Adapter: Parquet

Language: PHP - Size: 3.14 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 7 - Forks: 0

flow-php/etl-adapter-http

PHP ETL Adapter: Http

Language: PHP - Size: 270 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 4 - Forks: 2

flow-php/etl-adapter-csv

PHP ETL Adapter: CSV

Language: PHP - Size: 1.31 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 5 - Forks: 2

flow-php/etl-adapter-avro

PHP ETL Adapter: Avro

Language: PHP - Size: 1.68 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 0

flow-php/etl-adapter-json

PHP ETL Adapter: JSON

Language: PHP - Size: 1.74 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 6 - Forks: 3

flow-php/etl-adapter-elasticsearch

PHP ETL Adapter: Elasticsearch

Language: PHP - Size: 284 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 1

flow-php/etl-adapter-text

PHP ETL Adapter: Text

Language: PHP - Size: 1.09 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 3 - Forks: 0

flow-php/etl-adapter-logger

PHP ETL Adapter: Logger

Language: PHP - Size: 206 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 4 - Forks: 1

mech-lang/mech

🦾 Mech is a programming language for building data-driven systems like robots, games, and interfaces. Start here!

Language: Rust - Size: 10.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 217 - Forks: 12

ChenghaoMou/text-dedup

All-in-one text de-duplication

Language: Python - Size: 5.87 MB - Last synced at: 6 days ago - Pushed at: 12 months ago - Stars: 674 - Forks: 75

Siteimprove/alfa-act-r

:clipboard: Acceptance testing of rules authored by the ACT Rules Community Group (@act-rules) and implemented by Alfa

Language: TypeScript - Size: 31.6 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 2

MDSplus/mdsplus

The MDSplus data management system

Language: Java - Size: 148 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 82 - Forks: 48

Noursalem2005/Web-Scraping-Project

A basic web scraping project for educational purposes. It demonstrates how to extract data from a website and export it into multiple formats such as CSV, PDF, HTML, JPG, and TXT using Python.

Language: HTML - Size: 39.1 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

senbox-org/snap-engine

ESA Earth Observation Toolbox and Java Development Platform

Language: Java - Size: 911 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 193 - Forks: 102

stastnypremysl/lsql-csv

lsql-csv is a tool for small CSV file data querying from a shell with short queries. It makes it possible to work with small CSV files like with a read-only relational databases. The tool implements a new language LSQL similar to SQL, specifically designed for working with CSV files in a shell. LSQL aims to be a more lapidary language than SQL.

Language: Haskell - Size: 2.78 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

flow-php/etl-adapter-doctrine

PHP ETL Adapter: Doctrine

Language: PHP - Size: 422 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 3 - Forks: 2

SebKrantz/collapse

Advanced and Fast Data Transformation in R

Language: C - Size: 106 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 679 - Forks: 35

ReusJimenez/python-data-engineering

Laboratorios prácticos de ingeniería de datos con Python. ⚙️

Language: Jupyter Notebook - Size: 27.7 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

thibaultmeyer/mupipe

Pipeline microframework for data processing

Language: Java - Size: 104 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

senbox-org/snap-desktop

Desktop GUI for SNAP based on NetBeans Platform

Language: Java - Size: 77.2 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 141 - Forks: 64