GitHub topics: data-loading
get-convex/convex-js
TypeScript/JavaScript client library for Convex
Language: TypeScript - Size: 1.43 MB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 246 - Forks: 48

warp-drive-data/warp-drive
WarpDrive is a lightweight data library for web apps — universal, typed, reactive, and ready to scale.
Language: TypeScript - Size: 360 MB - Last synced at: about 19 hours ago - Pushed at: about 21 hours ago - Stars: 3,089 - Forks: 1,343

planet-a-ventures/dlt-source-personio
DLT (www.github.com/dlt-hub/dlt) source for Personio (www.personio.com)
Language: Python - Size: 212 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Language: Python - Size: 95.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4,073 - Forks: 320

Venecer/Perspective-AI
🤖 Set up and run your AI-powered bot easily with this user-friendly guide, minimizing terminal use for a streamlined experience.
Language: Shell - Size: 15.6 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

Ajinkya-99/snowflake_proj9_table_types
📊 Explore and manage different table types in Snowflake, including Permanent, Temporary, Transient, and External Tables, using AWS S3 data.
Size: 7.81 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

Khateeb21/snowflake_proj2_stages_and_transformations
🌨️ Load and transform data from Amazon S3 into Snowflake efficiently using stages, enhancing your data ingestion practices without altering source files.
Size: 11.7 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

Andrii04/Automated-Weather-ETL-Pipeline
Data engineering project simulating an end-to-end ETL pipeline for weather data. Automates extraction from the OpenWeatherMap API, data cleaning and transformation in Python, and loading into PostgreSQL, all orchestrated with Airflow. Delivers analysis-ready datasets for further exploration or visualization
Language: Python - Size: 178 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

debashisdash1999/snowflake_proj3_error_handling
Error Handling Hands-on project showcasing Snowflake data loading with error handling using VALIDATION_MODE, ON_ERROR = CONTINUE, ON_ERROR = SKIP_FILE, and ON_ERROR = SKIP_FILE_% while ingesting CSV files from AWS S3.
Size: 11.7 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

debashisdash1999/snowflake_proj4_validation_modes_copy_options
Hands-on project covering Snowflake data loading with custom file formats, validation modes, error handling, string length limits, TRUNCATECOLUMNS, and analyzing load history using account_usage.load_history.
Size: 8.79 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

debashisdash1999/snowflake_proj2_stages_and_transformations
This project demonstrates how to use Snowflake stages for loading data from Amazon S3 into Snowflake tables. It also covers applying transformations during loading and selecting only specific columns from the source data.
Size: 9.77 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

kai-tub/rico-hdl
A fast and easy-to-use Remote sensing Image format COnverter for High-throughput Deep-Learning (rico-hdl).
Language: Python - Size: 76.7 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 10 - Forks: 2

scalecraft-dev/preen
Local-first federated analytics query engine using DuckDB.
Language: Go - Size: 3.37 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 17 - Forks: 0

SyedaNimraFatima/Coffee-Shop-Sales-Analysis-SQL-PowerBI
A dynamic Power BI dashboard for analyzing sales, product trends, and customer behavior across NYC coffee shops. Built using Excel, DAX, and custom visuals to support business intelligence and decision-making.
Size: 8.33 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

hegongshan/Storage-for-AI-Paper
Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)
Size: 28.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 30 - Forks: 3

Estif-X/Complete-Data-Engineering-and-Analysis-Project
A team-friendly data pipeline: engineers automate data flows (Airbyte → PostgreSQL → dbt) while analysts create Power BI dashboards. Perfect for reliable, on-premise data workflows.
Size: 43.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

DevExpress-Examples/winforms-scheduler-optimize-performance-large-dataset
Optimize Scheduler performance for large datasets.
Language: C# - Size: 1.65 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 3

DarkStarStrix/DataVolt
Reusable data engineering toolkit My personal data infrastructure
Language: Jupyter Notebook - Size: 56.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 17 - Forks: 2

planet-a-ventures/dlt-source-affinity
DLT (www.github.com/dlt-hub/dlt) Source for Affinity (www.affinity.co)
Language: Python - Size: 195 KB - Last synced at: 19 days ago - Pushed at: 26 days ago - Stars: 2 - Forks: 0

Bmonter7/Online-Retail-EDA
This repository contains an end-to-end exploratory data analysis of transactional data from a UK-based online retail store covering the period from December 2010 to November 2011. The goal is to uncover sales trends, customer behavior, and product performance, and to provide actionable recommendations that can guide strategic business decisions.
Language: Jupyter Notebook - Size: 120 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

fversaci/cassandra-dali-plugin
Cassandra plugin for NVIDIA DALI
Language: C++ - Size: 778 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 2

planet-a-ventures/dlt-source-google-workspace
DLT (www.github.com/dlt-hub/dlt) source for Google Workspace
Language: Python - Size: 66.4 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

planet-a-ventures/dlt-source-airtable
DLT (www.github.com/dlt-hub/dlt) source for airtable (www.airtable.com)
Language: Python - Size: 63.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

planet-a-ventures/dlt-source-slack
DLT (www.github.com/dlt-hub/dlt) source for Slack (www.slack.com)
Language: Python - Size: 74.2 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

planet-a-ventures/dlt-source-notion
DLT (www.github.com/dlt-hub/dlt) source for Personio (www.notion.com)
Language: Python - Size: 304 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

planet-a-ventures/dlt-source-morphais
DLT (www.github.com/dlt-hub/dlt) source for Morphais (www.morphais.com)
Language: Python - Size: 186 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

npuichigo/tarzan
High-level API for tar-based dataset
Language: Python - Size: 27.3 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 1

suyashkumar/deeplesion-gcp-loader
Get the DeepLesion CT Image data set into a GCP Storage Bucket
Language: Go - Size: 8.79 KB - Last synced at: 5 months ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 1

Zenoo/slick-loader
A slick loader to use during your AJAX calls or data processing
Language: JavaScript - Size: 1.09 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

rtimbro185/syr_mads_ist722_data_warehouse
Syracuse University, Masters of Applied Data Science - IST 722 Data Warehouse
Language: TSQL - Size: 50.6 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 5

Andre3002/cmu-week2-pandas-seaborn
CMU week 2 - Stats, Data Load, Pandas, Visualization, Seaborn
Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

maksymsur/spltr 📦
`Spltr` is a simple PyTorch-based data loader and splitter. It may be used to load arrays and matrices or Pandas DataFrames and CSV files containing numerical data with subsequent split it into train, test (validation) subsets in the form of PyTorch DataLoader objects.
Language: Python - Size: 99.6 KB - Last synced at: 12 days ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

TakeLab/podium
Podium: a framework agnostic Python NLP library for data loading and preprocessing
Language: Python - Size: 2.19 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 60 - Forks: 2

harmanveer-2546/World-Best-Cities
Ranking of cities on social, environmental and economic factors.
Language: Jupyter Notebook - Size: 707 KB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Suhas-H-C/batch-processing-ms-v2
Spring batch processing with multiple datasources like mysql and h2
Language: Java - Size: 14.6 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

MelvinJWallace/MelvinJW.github.io
A portfolio of a host of projects completed using python and sql.
Language: CSS - Size: 8.56 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

YZenia/Pandas-Data-Analysis
This repository provides an introduction to essential data analysis libraries, including Numpy and Pandas.
Language: HTML - Size: 1.32 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

diyapratheep/EDA-on-Retail-Sales-Data
The goal is to perform exploratory data analysis (EDA) to uncover patterns, trends, and insights that can help the retail business make informed decisions.
Language: Jupyter Notebook - Size: 718 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

npuichigo/snake
Data loading with combined async Rust stream and Python
Language: Rust - Size: 211 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

motorro/RxLceModel
An Android library for data load with cache and loading state
Language: Kotlin - Size: 2.1 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

SheronRunodamoto/Mexico-Toy-Sales-Data-Warehouse
Dimensional Data Warehouse Project
Size: 109 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

brunodifranco/project-star-jeans-data-engineering
ETL building for an e-commerce Jeans company. Feel free to access the Streamlit App in the link below.
Language: Jupyter Notebook - Size: 178 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

mrrustemka/posts
Create Posts Form
Language: TypeScript - Size: 1.11 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

VishanthSurresh/Spotify-Capstone-Project---Data-Engineering
This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL for Data Loading and Staging ➲Airflow for Data Orchestration and Monitoring ➲PowerBI for Reporting
Language: Python - Size: 2.1 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

JoaoHenriqueRX7/ETL--Data-Scrapping-Python-MySQL-
A Python-based, automates data extraction, transformation, and loading. It focuses ETL pipelines, web scrapping and MySQL database, leveraging Python libraries for processing and MySQL for storage.
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Andrew-Mysaka/fast-react-pizza
The website to order pizza and track your orders using React, React router, Tailwind and Redux
Language: JavaScript - Size: 551 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

emSpot/ImohEdet.github.io Fork of jekyllt/vitae
👨💼 Personal resume
Language: CSS - Size: 2.45 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

aleksandr-miheichev/review_ratings_platform
Интерактивная платформа для сбора пользовательских отзывов о различных видах искусства, классификации их на "Книги", "Фильмы" и "Музыку" и вычисления среднего рейтинга для каждого произведения на основе отзывов пользователей, работающая на основе Django и DjangoRestFramework.
Language: Python - Size: 178 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Darthdevv/Fast-React-Pizza
a website to order pizza and track your orders using React, React router, Tailwind and Redux
Language: JavaScript - Size: 148 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

lbhm/dl2
An experiment sandbox for Deep Learning Data Loading analysis.
Language: Python - Size: 329 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

SunDoge/utfrecord
Fast TFRecord Reader powered by io-uring.
Language: Python - Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

richardwarepam16/ETL-Data_Pipelining_Project_using_AWSservice
Streamline your data flow with AWS Data Pipelining - a reliable and scalable solution for seamless data ingestion, processing, and storage
Language: Jupyter Notebook - Size: 487 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

satishgunjal/House-Price-Prediction-Project
Contains all my data science projects.
Language: Jupyter Notebook - Size: 478 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 7

anushkachauhxn/react-software-architecture
React software architecture techniques and examples. Includes server-side rendering, data loading and code splitting.
Language: JavaScript - Size: 332 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

SyncfusionExamples/Xamarin-Populate-Accordion-Items-using-Bindable-Layout
This repository contains the sample which showcases how to populate the accordion items using bindable layout
Language: C# - Size: 505 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 2

dburles/suspense-data-loader
An experimental React suspense and concurrent mode compatible data loading library
Language: JavaScript - Size: 22.5 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Mindful/m2data
A Python package for working with GEC data in .m2 files
Language: Python - Size: 27.3 KB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

RSKothari/Data2H5
This tool rapidly converts loose files scattered within any folder into a consolidated H5 file. This allows for faster read operations with lower memory requirement.
Language: Python - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

chrishutchinson/react-async-status
A simple React hook for managing the status of an async action and an associated message
Language: TypeScript - Size: 1.05 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

rutvik5/ncsu-csc591-fds
Homeworks and R projects for the course Foundations of Data Science
Language: R - Size: 8.98 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

BrainPlugAI/bp-storage
Library to load various vision datasets from disk
Language: Python - Size: 26.4 KB - Last synced at: 14 days ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0
