An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ml-pipelines

Ark-kun/pipeline_components

Components that I have created for Kubeflow Pipelines. Try them in https://cloud-pipelines.net/pipeline-editor/

Language: Python - Size: 1020 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 14 - Forks: 4

whylabs/whylogs

An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈

Language: Jupyter Notebook - Size: 181 MB - Last synced at: 9 days ago - Pushed at: 6 months ago - Stars: 2,730 - Forks: 127

souravlouha/ML_Practice_Studio

🧠A hands-on workspace for practicing machine learning concepts, data preprocessing, and experimenting with small ML projects. This repo includes foundational Python scripts, real-world mini-projects, and experiments that reflect a progressive learning journey in applied machine learning.

Language: Jupyter Notebook - Size: 417 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 3 - Forks: 0

sematic-ai/sematic

An open-source ML pipeline development platform

Language: Python - Size: 20.2 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 991 - Forks: 63

zetane/ZetaForge

Open source AI platform for rapid development of advanced AI and AGI pipelines.

Language: Python - Size: 152 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 176 - Forks: 18

evidentlyai/ml_observability_course

Free Open-source ML observability course for data scientists and ML engineers. Learn how to monitor and debug your ML models in production.

Language: Jupyter Notebook - Size: 25.2 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 89 - Forks: 30

saurabh-kudesia/real-world-ai-projects

A collection of real-world machine learning and AI projects. Explore hands-on implementations of cutting-edge models, practical solutions, and techniques to tackle real-world challenges using AI.

Language: Jupyter Notebook - Size: 6.67 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 2 - Forks: 0

chrisliatas/dsnd-ml-pipeline

ML pipeline to categorize emergency messages based on the needs communicated by the sender.

Language: Jupyter Notebook - Size: 2.98 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 2 - Forks: 0

udellgroup/oboe

An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.

Language: Python - Size: 325 MB - Last synced at: 26 days ago - Pushed at: over 3 years ago - Stars: 83 - Forks: 17

IBM/sail

Library for streaming data and incremental learning algorithms.

Language: Python - Size: 28.7 MB - Last synced at: 8 days ago - Pushed at: 2 months ago - Stars: 24 - Forks: 12

SeekAI-786/Electricity_Theft_Detection

Electricity theft is a major issue in regions like Karachi, where unauthorized consumption of electricity leads to significant losses for utility companies. This project provides a solution for detecting electricity theft using machine learning models. By analyzing various factors such as electricity usage, voltage fluctuations, and historical data

Language: Python - Size: 3.3 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Shogun93/sematic

Platform to build resource-intensive pipelines with simple Python.

Language: Python - Size: 13.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

mixail0916/sematic

Platform to build resource-intensive pipelines with simple Python.

Language: Python - Size: 16.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

zacharyvunguyen/Production-Ready-ML-Pipeline-on-GCP-Baby-Weight-Prediction

In this project, I developed a completed Vertex and Kubeflow pipelines SDK to build and deploy an AutoML / BigQuery ML regression model for online predictions. Using this ML Pipeline, I was able to develop, deploy, and manage the production ML lifecycle efficiently and reliably.

Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

opctl/opctl

Free and open source automation platform

Language: Go - Size: 440 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 50 - Forks: 17

banzuzi-carioni/cross-border-electricity-flow-prediction

Serverless ML system to predict the direction and volume of electricity flows to and from the Netherlands and its energy transmission partners.

Language: Python - Size: 44 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 11 - Forks: 0

martynas-subonis/ml-workflows

Guide on how to structure and implement machine learning pipelines.

Language: Python - Size: 923 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

bodywork-ml/ml-pipeline-engineering

Best practices for engineering ML pipelines.

Language: Jupyter Notebook - Size: 244 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 37 - Forks: 9

mopechowski/mlops-case-study

Example solution of the MLOps Case Study covering both online and batch processing.

Language: Pkl - Size: 98.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mbalcerzak/the-warsaw-project

Website built in JavaScript & React as a "blog" to document an ML pipeline I built for Apartment Price Scraping project

Language: Python - Size: 48.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

msunda17/sml-project

Epic-Diffusion

Language: Python - Size: 383 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Fahlevi20/CI-CD-for-Machine-Learning-Github-Actions

Learning create CI-CD for Machine Learning Pipelines Github Actions

Language: Jupyter Notebook - Size: 4.51 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ismailsimsek/StoreSalesTimeSeriesForecasting

Testing preprocessing capabilities of different ML libraries

Language: Python - Size: 111 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

oozdal/Mobile-Price-Classification-with-AWS-SageWaker

This project focuses on building end-to-end machine learning pipeline using AWS SageMaker to predict the price range of mobile phones based on their specifications, enhancing consumer decision-making and streamlining the development process.

Language: Jupyter Notebook - Size: 2.15 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Elkinmt19/airflow-master

This a repo that was created to learn more about Airflow and develop awesome data engineering projects. 🚀🚀

Language: Python - Size: 3.33 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 3

tbsraja/Personalized_Cancer_Treatment

Develop algorithms to classify genetic mutations based on clinical evidence (text).

Language: Jupyter Notebook - Size: 1.28 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

yvgupta03/Big_Data_Project_US-Airlines_Tweet_Processing_and_Analysis

Big data application of Machine Learning concepts for sentiment classification of US Airlines tweets. The focus is on the usage of pyspark libraries (ml-lib) on big data to solve a problem using Machine Learning algorithms and not about the choice of algorithm used in the ML model creation. It also involves data pre-processing using NLP techniques, cross-validation and parameter-grid builder.

Language: Jupyter Notebook - Size: 1.83 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

JZMNE/ML_Pipelines

This shows the machine learning pipeline for Regression, Classification and Clustering using Pycaret 3.0 on jupyter notebook

Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

leosmerling-hopeit/fraud-poc

Fraud detection ML pipeline and serving POC using Dask and hopeit.engine. Project created with nbdev: https://www.fast.ai/2019/12/02/nbdev/

Language: Jupyter Notebook - Size: 8.06 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 4

SteliosGian/model-workflow Fork of udacity/nd0821-c2-build-model-workflow-starter

Course 2 project of the Udacity ML DevOps Nanodegree Program

Language: Python - Size: 5.81 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

prateeksawhney97/Disaster-Response-Pipeline

This Project is a part of Data Science Nanodegree Program by Udacity in collaboration with Figure Eight. The initial dataset contains pre-labelled tweet and messages from real-life disasters. The aim of this project is to build a Natural Language Processing tool that categorize messages.

Language: Jupyter Notebook - Size: 8.53 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 2

siddarthaThentu/Disaster-Response-Pipeline

A deployed machine learning model that has the capability to automatically classify the incoming disaster messages into related 36 categories. Project developed as a part of Udacity's Data Science Nanodegree program.

Language: Python - Size: 9.57 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Related Keywords
ml-pipelines 32 machine-learning 16 mlops 11 python 8 data-science 7 ml 5 ai 3 ml-ops 3 automl 3 deep-learning 3 python3 3 model-performance 2 llm 2 nlp 2 data-analysis 2 fine-tuning 2 docker 2 pipeline 2 ml-pipeline 2 etl-pipeline 2 devops 2 kubeflow-pipelines 2 data-quality 2 end-to-end-machine-learning 2 vertex-ai 2 aws-sagemaker 1 pre-processing 1 predictive-analysis 1 linear-regression 1 github-actions 1 javascript 1 ml-production 1 ci-cd 1 react 1 real-estate 1 actions 1 video 1 diffusion-models 1 bigquery 1 bigqueryml 1 google 1 google-cloud-platform 1 machine 1 productionml 1 streamlit 1 automation 1 containers 1 development 1 energy-data 1 etl 1 hopsworks 1 serverless 1 xgboost-regression 1 model-training 1 tutorial 1 containerization 1 ml-serving 1 packaging-python 1 docusaurus 1 render-deployment 1 dask-distributed 1 dask-ml 1 fraud-detection 1 microservices 1 nbdev 1 hydra 1 mlflow 1 udacity-nanodegree 1 weights-and-biases 1 disaster-management 1 disaster-response 1 flask-application 1 flask-sqlalchemy 1 bootstrap 1 data-analytics 1 ensemble-models 1 etl-pipelines 1 feature-engineering 1 flask 1 hyperparameter-optimization 1 plotly 1 sagemaker-deployment 1 streamlit-webapp 1 airflow 1 dags 1 data-engineering 1 data-pipelines 1 orchestration 1 logistic-regression 1 nlp-machine-learning 1 random-forest-classifier 1 svm-classifier 1 big-data 1 databricks-notebooks 1 pyspark-mllib 1 twitter-sentiment-analysis 1 classification-model 1 clustering 1 machine-learning-algorithms 1 pycaret 1