GitHub topics: scalable-data-analysis
parashardhapola/scarf
Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Language: Python - Size: 32.4 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 105 - Forks: 15
COM6012/ScalableML
COM6012 Scalable Machine Learning - University of Sheffield. Enjoy our resources? ⭐ Star this repository to show your support and help others discover it!
Language: HTML - Size: 268 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 88 - Forks: 85
kaydotdev/stochastic-quantization
Robust and Scalable Stochastic Quasi-Gradient Clustering
Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 1
efeag/aga-MSDA
This repository contain projects completed during my graduate study in Data Science & Analytics at the J. Mack Robinson College of Business, Georgia State University. I worked as part of a team of 4 or 6 members and we equally contributed in completing tasks and preparing final documentations (code file, report & PowerPoint presentation).
Language: Jupyter Notebook - Size: 6.87 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
emmalanguage/emma
A quotation-based Scala DSL for scalable data analysis.
Language: Scala - Size: 9.16 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 63 - Forks: 19
terilios/automated_data_scientist
Automated Data Scientist: An intelligent, adaptive data analysis tool that leverages AI-driven automation to dynamically plan, execute, and refine data science workflows. Automatically handles data preparation, analysis planning, code generation, and result interpretation using advanced language models.
Language: Python - Size: 207 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1
YogiOnBioinformatics/Computational-Drug-Discovery-Internship-at-Merck
Description of work done at Merck pharmaceutical company in the summer of 2018 as a Computational Drug Discovery Intern at West Point, PA. Information excludes all proprietary information belonging to Merck & Co.
Language: Python - Size: 293 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2
Caleydo/lineupjs Fork of lineupjs/lineupjs
Fork and custom implementation of LineUp Library for Visual Analysis of Multi-Attribute
Language: TypeScript - Size: 11.3 MB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 70 - Forks: 4
Caleydo/taggle 📦
deprecated use lineup.js develop branch instead
Language: TypeScript - Size: 416 KB - Last synced at: 11 months ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0
mmaguero/cloud-based-tool-SA
A cloud-based tool for sentiment analysis in reviews about restaurants on TripAdvisor
Language: Python - Size: 12.9 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0
lapets/course-data-mechanics
Lecture notes and other materials for a one-semester course on data mechanics.
Language: HTML - Size: 73.2 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1
JayLohokare/sparkGIS
Spark GIS (Docker + Flask Webserver + SparkGIS)
Language: Java - Size: 11.4 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0
manuparra/knowledgegraphs
Knowledge data processing
Language: HTML - Size: 92.8 KB - Last synced at: 6 months ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1
emmalanguage/emma-lib
Language: Scala - Size: 146 KB - Last synced at: 3 months ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 4