Topic: "data-loading"
dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Language: Python - Size: 91.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,687 - Forks: 283

emberjs/data
WarpDrive is a lightweight data library for web apps — universal, typed, reactive, and ready to scale.
Language: TypeScript - Size: 332 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3,076 - Forks: 1,335

get-convex/convex-js
TypeScript/JavaScript client library for Convex
Language: TypeScript - Size: 1.55 MB - Last synced at: about 5 hours ago - Pushed at: about 6 hours ago - Stars: 189 - Forks: 33

TakeLab/podium
Podium: a framework agnostic Python NLP library for data loading and preprocessing
Language: Python - Size: 2.19 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 60 - Forks: 2

hegongshan/Storage-for-AI-Paper
Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)
Size: 19.5 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 18 - Forks: 2

DarkStarStrix/DataVolt
Reusable data engineering toolkit My personal data infrastructure
Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 17 - Forks: 2

preendata/preen
Local-first federated analytics query engine using DuckDB.
Language: Go - Size: 3.37 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 17 - Forks: 0

npuichigo/tarzan
High-level API for tar-based dataset
Language: Python - Size: 27.3 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

kai-tub/rico-hdl
A fast and easy-to-use Remote sensing Image format COnverter for High-throughput Deep-Learning (rico-hdl).
Language: Python - Size: 76.7 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 2

VishanthSurresh/Spotify-Capstone-Project---Data-Engineering
This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL for Data Loading and Staging ➲Airflow for Data Orchestration and Monitoring ➲PowerBI for Reporting
Language: Python - Size: 2.1 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 1

motorro/RxLceModel
An Android library for data load with cache and loading state
Language: Kotlin - Size: 2.1 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

npuichigo/snake
Data loading with combined async Rust stream and Python
Language: Rust - Size: 211 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

satishgunjal/House-Price-Prediction-Project
Contains all my data science projects.
Language: Jupyter Notebook - Size: 478 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 7

rtimbro185/syr_mads_ist722_data_warehouse
Syracuse University, Masters of Applied Data Science - IST 722 Data Warehouse
Language: TSQL - Size: 50.6 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 4

planet-a-ventures/dlt-source-morphais
DLT (www.github.com/dlt-hub/dlt) source for Morphais (www.morphais.com)
Language: Python - Size: 186 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

Zenoo/slick-loader
A slick loader to use during your AJAX calls or data processing
Language: JavaScript - Size: 1.09 MB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

lbhm/dl2
An experiment sandbox for Deep Learning Data Loading analysis.
Language: Python - Size: 329 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

RSKothari/Data2H5
This tool rapidly converts loose files scattered within any folder into a consolidated H5 file. This allows for faster read operations with lower memory requirement.
Language: Python - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

fversaci/cassandra-dali-plugin
Cassandra plugin for NVIDIA DALI
Language: C++ - Size: 777 KB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 1 - Forks: 2

planet-a-ventures/dlt-source-affinity
DLT (www.github.com/dlt-hub/dlt) Source for Affinity (www.affinity.co)
Language: Python - Size: 190 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

planet-a-ventures/dlt-source-notion
DLT (www.github.com/dlt-hub/dlt) source for Personio (www.notion.com)
Language: Python - Size: 304 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

JoaoHenriqueRX7/ETL--Data-Scrapping-Python-MySQL-
A Python-based, automates data extraction, transformation, and loading. It focuses ETL pipelines, web scrapping and MySQL database, leveraging Python libraries for processing and MySQL for storage.
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Andrew-Mysaka/fast-react-pizza
The website to order pizza and track your orders using React, React router, Tailwind and Redux
Language: JavaScript - Size: 551 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Darthdevv/Fast-React-Pizza
a website to order pizza and track your orders using React, React router, Tailwind and Redux
Language: JavaScript - Size: 148 KB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

brunodifranco/project-star-jeans-data-engineering
ETL building for an e-commerce Jeans company. Feel free to access the Streamlit App in the link below.
Language: Jupyter Notebook - Size: 178 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

richardwarepam16/ETL-Data_Pipelining_Project_using_AWSservice
Streamline your data flow with AWS Data Pipelining - a reliable and scalable solution for seamless data ingestion, processing, and storage
Language: Jupyter Notebook - Size: 487 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

chrishutchinson/react-async-status
A simple React hook for managing the status of an async action and an associated message
Language: TypeScript - Size: 1.05 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

maksymsur/spltr 📦
`Spltr` is a simple PyTorch-based data loader and splitter. It may be used to load arrays and matrices or Pandas DataFrames and CSV files containing numerical data with subsequent split it into train, test (validation) subsets in the form of PyTorch DataLoader objects.
Language: Python - Size: 99.6 KB - Last synced at: 19 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

BrainPlugAI/bp-storage
Library to load various vision datasets from disk
Language: Python - Size: 26.4 KB - Last synced at: 30 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

suyashkumar/deeplesion-gcp-loader
Get the DeepLesion CT Image data set into a GCP Storage Bucket
Language: Go - Size: 8.79 KB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

Estif-X/Complete-Data-Engineering-and-Analysis-Project
This project is a team project consisting of data engineers and data analysts. Starting from data extraction, ingestion, cleaning, transforming, loading up to doing data analysis and visualization. We will use a variety of on-premise and cloud platforms to make this happen.
Size: 17.6 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

Bmonter7/Online-Retail-EDA
This repository contains an end-to-end exploratory data analysis of transactional data from a UK-based online retail store covering the period from December 2010 to November 2011. The goal is to uncover sales trends, customer behavior, and product performance, and to provide actionable recommendations that can guide strategic business decisions.
Language: Jupyter Notebook - Size: 120 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

DevExpress-Examples/winforms-scheduler-optimize-performance-large-dataset
Optimize Scheduler performance for large datasets.
Language: C# - Size: 1.76 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 3

planet-a-ventures/dlt-source-google-workspace
DLT (www.github.com/dlt-hub/dlt) source for Google Workspace
Language: Python - Size: 66.4 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

planet-a-ventures/dlt-source-airtable
DLT (www.github.com/dlt-hub/dlt) source for airtable (www.airtable.com)
Language: Python - Size: 63.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

planet-a-ventures/dlt-source-slack
DLT (www.github.com/dlt-hub/dlt) source for Slack (www.slack.com)
Language: Python - Size: 74.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

planet-a-ventures/dlt-source-personio
DLT (www.github.com/dlt-hub/dlt) source for Personio (www.personio.com)
Language: Python - Size: 205 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Andre3002/cmu-week2-pandas-seaborn
CMU week 2 - Stats, Data Load, Pandas, Visualization, Seaborn
Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

harmanveer-2546/World-Best-Cities
Ranking of cities on social, environmental and economic factors.
Language: Jupyter Notebook - Size: 707 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Suhas-H-C/batch-processing-ms-v2
Spring batch processing with multiple datasources like mysql and h2
Language: Java - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

SyedaNimraFatima/Coffee-Shop-Sales-Analysis-SQL-PowerBI
This Repository's details will be updated in a while.
Size: 8.32 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

MelvinJWallace/MelvinJW.github.io
A portfolio of a host of projects completed using python and sql.
Language: CSS - Size: 8.56 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

YZenia/Pandas-Data-Analysis
This repository provides an introduction to essential data analysis libraries, including Numpy and Pandas.
Language: HTML - Size: 1.32 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

diyapratheep/EDA-on-Retail-Sales-Data
The goal is to perform exploratory data analysis (EDA) to uncover patterns, trends, and insights that can help the retail business make informed decisions.
Language: Jupyter Notebook - Size: 718 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

SheronRunodamoto/Mexico-Toy-Sales-Data-Warehouse
Dimensional Data Warehouse Project
Size: 109 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SyncfusionExamples/Xamarin-Populate-Accordion-Items-using-Bindable-Layout
This repository contains the sample which showcases how to populate the accordion items using bindable layout
Language: C# - Size: 505 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 2

mrrustemka/posts
Create Posts Form
Language: TypeScript - Size: 1.11 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aleksandr-miheichev/review_ratings_platform
Интерактивная платформа для сбора пользовательских отзывов о различных видах искусства, классификации их на "Книги", "Фильмы" и "Музыку" и вычисления среднего рейтинга для каждого произведения на основе отзывов пользователей, работающая на основе Django и DjangoRestFramework.
Language: Python - Size: 178 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SunDoge/utfrecord
Fast TFRecord Reader powered by io-uring.
Language: Python - Size: 27.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

anushkachauhxn/react-software-architecture
React software architecture techniques and examples. Includes server-side rendering, data loading and code splitting.
Language: JavaScript - Size: 332 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dburles/suspense-data-loader
An experimental React suspense and concurrent mode compatible data loading library
Language: JavaScript - Size: 22.5 KB - Last synced at: 5 days ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Mindful/m2data
A Python package for working with GEC data in .m2 files
Language: Python - Size: 27.3 KB - Last synced at: 7 days ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

emSpot/ImohEdet.github.io Fork of jekyllt/vitae
👨💼 Personal resume
Language: CSS - Size: 2.45 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

rutvik5/ncsu-csc591-fds
Homeworks and R projects for the course Foundations of Data Science
Language: R - Size: 8.98 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0
