An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-loading

get-convex/convex-js

TypeScript/JavaScript client library for Convex

Language: TypeScript - Size: 1.43 MB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 246 - Forks: 48

warp-drive-data/warp-drive

WarpDrive is a lightweight data library for web apps — universal, typed, reactive, and ready to scale.

Language: TypeScript - Size: 360 MB - Last synced at: about 19 hours ago - Pushed at: about 21 hours ago - Stars: 3,089 - Forks: 1,343

planet-a-ventures/dlt-source-personio

DLT (www.github.com/dlt-hub/dlt) source for Personio (www.personio.com)

Language: Python - Size: 212 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

dlt-hub/dlt

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

Language: Python - Size: 95.1 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4,073 - Forks: 320

Venecer/Perspective-AI

🤖 Set up and run your AI-powered bot easily with this user-friendly guide, minimizing terminal use for a streamlined experience.

Language: Shell - Size: 15.6 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

Ajinkya-99/snowflake_proj9_table_types

📊 Explore and manage different table types in Snowflake, including Permanent, Temporary, Transient, and External Tables, using AWS S3 data.

Size: 7.81 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

Khateeb21/snowflake_proj2_stages_and_transformations

🌨️ Load and transform data from Amazon S3 into Snowflake efficiently using stages, enhancing your data ingestion practices without altering source files.

Size: 11.7 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

Andrii04/Automated-Weather-ETL-Pipeline

Data engineering project simulating an end-to-end ETL pipeline for weather data. Automates extraction from the OpenWeatherMap API, data cleaning and transformation in Python, and loading into PostgreSQL, all orchestrated with Airflow. Delivers analysis-ready datasets for further exploration or visualization

Language: Python - Size: 178 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

debashisdash1999/snowflake_proj3_error_handling

Error Handling Hands-on project showcasing Snowflake data loading with error handling using VALIDATION_MODE, ON_ERROR = CONTINUE, ON_ERROR = SKIP_FILE, and ON_ERROR = SKIP_FILE_% while ingesting CSV files from AWS S3.

Size: 11.7 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

debashisdash1999/snowflake_proj4_validation_modes_copy_options

Hands-on project covering Snowflake data loading with custom file formats, validation modes, error handling, string length limits, TRUNCATECOLUMNS, and analyzing load history using account_usage.load_history.

Size: 8.79 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

debashisdash1999/snowflake_proj2_stages_and_transformations

This project demonstrates how to use Snowflake stages for loading data from Amazon S3 into Snowflake tables. It also covers applying transformations during loading and selecting only specific columns from the source data.

Size: 9.77 KB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

kai-tub/rico-hdl

A fast and easy-to-use Remote sensing Image format COnverter for High-throughput Deep-Learning (rico-hdl).

Language: Python - Size: 76.7 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 10 - Forks: 2

scalecraft-dev/preen

Local-first federated analytics query engine using DuckDB.

Language: Go - Size: 3.37 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 17 - Forks: 0

SyedaNimraFatima/Coffee-Shop-Sales-Analysis-SQL-PowerBI

A dynamic Power BI dashboard for analyzing sales, product trends, and customer behavior across NYC coffee shops. Built using Excel, DAX, and custom visuals to support business intelligence and decision-making.

Size: 8.33 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

hegongshan/Storage-for-AI-Paper

Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)

Size: 28.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 30 - Forks: 3

Estif-X/Complete-Data-Engineering-and-Analysis-Project

A team-friendly data pipeline: engineers automate data flows (Airbyte → PostgreSQL → dbt) while analysts create Power BI dashboards. Perfect for reliable, on-premise data workflows.

Size: 43.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

DevExpress-Examples/winforms-scheduler-optimize-performance-large-dataset

Optimize Scheduler performance for large datasets.

Language: C# - Size: 1.65 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 3

DarkStarStrix/DataVolt

Reusable data engineering toolkit My personal data infrastructure

Language: Jupyter Notebook - Size: 56.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 17 - Forks: 2

planet-a-ventures/dlt-source-affinity

DLT (www.github.com/dlt-hub/dlt) Source for Affinity (www.affinity.co)

Language: Python - Size: 195 KB - Last synced at: 19 days ago - Pushed at: 26 days ago - Stars: 2 - Forks: 0

Bmonter7/Online-Retail-EDA

This repository contains an end-to-end exploratory data analysis of transactional data from a UK-based online retail store covering the period from December 2010 to November 2011. The goal is to uncover sales trends, customer behavior, and product performance, and to provide actionable recommendations that can guide strategic business decisions.

Language: Jupyter Notebook - Size: 120 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

fversaci/cassandra-dali-plugin

Cassandra plugin for NVIDIA DALI

Language: C++ - Size: 778 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 2

planet-a-ventures/dlt-source-google-workspace

DLT (www.github.com/dlt-hub/dlt) source for Google Workspace

Language: Python - Size: 66.4 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

planet-a-ventures/dlt-source-airtable

DLT (www.github.com/dlt-hub/dlt) source for airtable (www.airtable.com)

Language: Python - Size: 63.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

planet-a-ventures/dlt-source-slack

DLT (www.github.com/dlt-hub/dlt) source for Slack (www.slack.com)

Language: Python - Size: 74.2 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

planet-a-ventures/dlt-source-notion

DLT (www.github.com/dlt-hub/dlt) source for Personio (www.notion.com)

Language: Python - Size: 304 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

planet-a-ventures/dlt-source-morphais

DLT (www.github.com/dlt-hub/dlt) source for Morphais (www.morphais.com)

Language: Python - Size: 186 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

npuichigo/tarzan

High-level API for tar-based dataset

Language: Python - Size: 27.3 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 1

suyashkumar/deeplesion-gcp-loader

Get the DeepLesion CT Image data set into a GCP Storage Bucket

Language: Go - Size: 8.79 KB - Last synced at: 5 months ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 1

Zenoo/slick-loader

A slick loader to use during your AJAX calls or data processing

Language: JavaScript - Size: 1.09 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

rtimbro185/syr_mads_ist722_data_warehouse

Syracuse University, Masters of Applied Data Science - IST 722 Data Warehouse

Language: TSQL - Size: 50.6 MB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 5

Andre3002/cmu-week2-pandas-seaborn

CMU week 2 - Stats, Data Load, Pandas, Visualization, Seaborn

Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

maksymsur/spltr 📦

`Spltr` is a simple PyTorch-based data loader and splitter. It may be used to load arrays and matrices or Pandas DataFrames and CSV files containing numerical data with subsequent split it into train, test (validation) subsets in the form of PyTorch DataLoader objects.

Language: Python - Size: 99.6 KB - Last synced at: 12 days ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

TakeLab/podium

Podium: a framework agnostic Python NLP library for data loading and preprocessing

Language: Python - Size: 2.19 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 60 - Forks: 2

harmanveer-2546/World-Best-Cities

Ranking of cities on social, environmental and economic factors.

Language: Jupyter Notebook - Size: 707 KB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Suhas-H-C/batch-processing-ms-v2

Spring batch processing with multiple datasources like mysql and h2

Language: Java - Size: 14.6 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

MelvinJWallace/MelvinJW.github.io

A portfolio of a host of projects completed using python and sql.

Language: CSS - Size: 8.56 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

YZenia/Pandas-Data-Analysis

This repository provides an introduction to essential data analysis libraries, including Numpy and Pandas.

Language: HTML - Size: 1.32 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

diyapratheep/EDA-on-Retail-Sales-Data

The goal is to perform exploratory data analysis (EDA) to uncover patterns, trends, and insights that can help the retail business make informed decisions.

Language: Jupyter Notebook - Size: 718 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

npuichigo/snake

Data loading with combined async Rust stream and Python

Language: Rust - Size: 211 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

motorro/RxLceModel

An Android library for data load with cache and loading state

Language: Kotlin - Size: 2.1 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

SheronRunodamoto/Mexico-Toy-Sales-Data-Warehouse

Dimensional Data Warehouse Project

Size: 109 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

brunodifranco/project-star-jeans-data-engineering

ETL building for an e-commerce Jeans company. Feel free to access the Streamlit App in the link below.

Language: Jupyter Notebook - Size: 178 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

mrrustemka/posts

Create Posts Form

Language: TypeScript - Size: 1.11 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

VishanthSurresh/Spotify-Capstone-Project---Data-Engineering

This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL for Data Loading and Staging ➲Airflow for Data Orchestration and Monitoring ➲PowerBI for Reporting

Language: Python - Size: 2.1 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

JoaoHenriqueRX7/ETL--Data-Scrapping-Python-MySQL-

A Python-based, automates data extraction, transformation, and loading. It focuses ETL pipelines, web scrapping and MySQL database, leveraging Python libraries for processing and MySQL for storage.

Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Andrew-Mysaka/fast-react-pizza

The website to order pizza and track your orders using React, React router, Tailwind and Redux

Language: JavaScript - Size: 551 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

emSpot/ImohEdet.github.io Fork of jekyllt/vitae

👨‍💼 Personal resume

Language: CSS - Size: 2.45 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

aleksandr-miheichev/review_ratings_platform

Интерактивная платформа для сбора пользовательских отзывов о различных видах искусства, классификации их на "Книги", "Фильмы" и "Музыку" и вычисления среднего рейтинга для каждого произведения на основе отзывов пользователей, работающая на основе Django и DjangoRestFramework.

Language: Python - Size: 178 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Darthdevv/Fast-React-Pizza

a website to order pizza and track your orders using React, React router, Tailwind and Redux

Language: JavaScript - Size: 148 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

lbhm/dl2

An experiment sandbox for Deep Learning Data Loading analysis.

Language: Python - Size: 329 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

SunDoge/utfrecord

Fast TFRecord Reader powered by io-uring.

Language: Python - Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

richardwarepam16/ETL-Data_Pipelining_Project_using_AWSservice

Streamline your data flow with AWS Data Pipelining - a reliable and scalable solution for seamless data ingestion, processing, and storage

Language: Jupyter Notebook - Size: 487 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

satishgunjal/House-Price-Prediction-Project

Contains all my data science projects.

Language: Jupyter Notebook - Size: 478 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 7

anushkachauhxn/react-software-architecture

React software architecture techniques and examples. Includes server-side rendering, data loading and code splitting.

Language: JavaScript - Size: 332 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

SyncfusionExamples/Xamarin-Populate-Accordion-Items-using-Bindable-Layout

This repository contains the sample which showcases how to populate the accordion items using bindable layout

Language: C# - Size: 505 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 2

dburles/suspense-data-loader

An experimental React suspense and concurrent mode compatible data loading library

Language: JavaScript - Size: 22.5 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Mindful/m2data

A Python package for working with GEC data in .m2 files

Language: Python - Size: 27.3 KB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

RSKothari/Data2H5

This tool rapidly converts loose files scattered within any folder into a consolidated H5 file. This allows for faster read operations with lower memory requirement.

Language: Python - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

chrishutchinson/react-async-status

A simple React hook for managing the status of an async action and an associated message

Language: TypeScript - Size: 1.05 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

rutvik5/ncsu-csc591-fds

Homeworks and R projects for the course Foundations of Data Science

Language: R - Size: 8.98 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

BrainPlugAI/bp-storage

Library to load various vision datasets from disk

Language: Python - Size: 26.4 KB - Last synced at: 14 days ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

Related Keywords
data-loading 61 data-engineering 16 python 16 data-transformation 12 data-visualization 8 data-extraction 7 data-cleaning 7 sql 7 deep-learning 7 data-load-tool 7 dlt 7 dlthub 7 pandas 6 aws-s3 6 etl 6 data-science 6 cloud-data-warehouse 5 react 5 analytics 5 snowflake 5 portfolio-project 5 stages 4 etl-pipeline 4 data-preparation 4 matplotlib 3 javascript 3 seaborn 3 typescript 3 tensorflow 3 exploratory-data-analysis 3 dataset 3 data 3 machine-learning 3 tailwindcss 3 data-analysis 3 data-storage 3 react-router 3 redux-toolkit 2 react-hooks 2 preprocessing 2 airflow 2 postgresql 2 data-quality 2 error-handling 2 on-error 2 validation-mode 2 data-processing 2 time-series-analysis 2 pandas-python 2 outlier-detection 2 database 2 data-preprocessing 2 pytorch 2 query 2 data-modelling 2 async 2 csv 2 business-intelligence 2 datasets 2 natural-language-processing 2 data-exploration 2 data-warehouse 2 loader 2 numpy 2 model-training 2 mlsys 2 linear-regression 2 data-loader 2 checkpoint 2 dataloader 2 kitti 1 workbook 1 schema 1 erdiagram 1 invalidation 1 handling-missing-values-and-duplicates 1 disklrucache 1 data-cache 1 descriptive-analysis 1 lru-cache 1 lrucache 1 android 1 structured-concurrency 1 ui-status 1 rxjava2 1 data-preprocess 1 data-split 1 data-split-pytorch 1 easy-data-split 1 easy-split 1 easy-to-use 1 neural-networks 1 pytorch-dataloader-objects 1 pytorch-dataset-split 1 splitter 1 train-split-pytorch 1 train-test-split 1 train-test-validation 1 nlp 1 best-cities-to-live-in 1