Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: stratified-sampling

Lefteris-Souflas/Business-Analytics-Case-Studies

Three business analytics case studies were undertaken, encompassing market basket analysis, customer segmentation, and campaign management. SAS Visual Data Mining and Machine Learning on SAS Viya was utilized to explore data and provide insights. A comprehensive report addressing both technical and business aspects was delivered.

Size: 1.31 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

Divya-Bhargavi/wids_datathon_2019

Women in Data Science Competition

Language: Jupyter Notebook - Size: 88.3 MB - Last synced: 4 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

RimTouny/Image-Classification-using-Chars74K-dataset

Employing advanced techniques, the project seamlessly integrates binary and multiclass classifiers for character classification. It offers a comprehensive analysis and adeptly addresses challenges in the realm of computer vision.This project was part of my uOttawa Master's in Computer Vision course (2023).

Language: Jupyter Notebook - Size: 374 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

MsTao-68/Debt-Churn-Data-Analysis

使用比赛方提供的脱敏数据,进行客户信贷流失预测。

Language: Jupyter Notebook - Size: 15 MB - Last synced: 5 months ago - Pushed: about 2 years ago - Stars: 5 - Forks: 0

langthom/sirasac

A C library with Python bindings for efficient stratified random sampling from binary buffers or files.

Language: C - Size: 909 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

Nikhilkohli1/Natural-Language-Processing

This repository contains Natural Language Processing Projects like Sarcasm Detection, Quora Insincere Questions Classification & Edgar Sentiment Analysis

Language: Jupyter Notebook - Size: 22.6 MB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 2

anthonyli01/R-Derivatives-Pricing

University Project: simulation techniques to price derivatives. It will involve Monte-Carlo, variance-reduction techniques, and advanced simulation methods.

Size: 1.38 MB - Last synced: 9 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

anthonyli01/Advanced-Simulation-Methods

This project focuses on applying advanced simulation methods for derivatives pricing. It includes Monte-Carlo, Variance Reduction Techniques, Distribution Sampling Methods, Euler Schemes, and Milstein Schemes.

Language: Jupyter Notebook - Size: 1.37 MB - Last synced: 4 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 1

garciparedes/statistical-sampling-stratified

Language: TeX - Size: 1.22 MB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0

pagoma3/Sampling

Sprint 6, Task 1

Language: Jupyter Notebook - Size: 156 KB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

StarlangSoftware/Sampling-CPP

Data sampling library

Language: C++ - Size: 185 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 2 - Forks: 1

dataditya/US-Airlines-Delay-Analysis-2023

The objective is to analyze flight delays in the United States. Data from airlines, airports, and runways will be collected and processed. Machine learning models will be built using logistic regression, decision trees, and XGB classifiers. Visualizations will be created in Tableau, and Excel dashboards and SQL queries will be used for analysis.

Language: Jupyter Notebook - Size: 35 MB - Last synced: 11 months ago - Pushed: 12 months ago - Stars: 2 - Forks: 0

iamkirankumaryadav/Loan

Loan Approval Prediction

Language: Jupyter Notebook - Size: 968 KB - Last synced: 12 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 1

zca21/Statistical_Consultancy

Code to help the presentation to the client. The Sampling code.Rmd file contains the code performing the sampling method and produces the visualisations and diagnostics seen in the presentation.

Size: 85.9 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

StarlangSoftware/Sampling-Py

Data sampling library

Language: Python - Size: 72.3 KB - Last synced: 28 days ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0

aktgpt/onlinetripletmining

Fast Online Triplet mining in Pytorch

Language: Python - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 5 - Forks: 1

rochitasundar/Twitter-Sentiment-Analysis

Data consists of tweets scrapped using Twitter API. Objective is sentiment labelling using a lexicon approach, performing text pre-processing (such as language detection, tokenisation, normalisation, vectorisation), building pipelines for text classification models for sentiment analysis, followed by explainability of the final classifier

Language: Jupyter Notebook - Size: 3.71 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 3 - Forks: 0

shivtosh/Stroke_prediction

Models implemented for stroke prediction amongst individuals

Size: 2.18 MB - Last synced: 11 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

david-garza/Credit_Risk_Analysis

Supervised machine learning model to classify loan applicants into high and low risk categories

Language: Jupyter Notebook - Size: 136 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

kristoffhernan/ProfessorSurvey

Web scraper to get professor information, and a mass emailer that sends a website with a survey.

Language: Jupyter Notebook - Size: 7.49 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

rrfsantos/Projeto-Redes-Neurais-OCT-Images

BI Master - Automated methods to detect and classify human diseases from medical images. Convolutional Neural Network, Data Augmentation, Transfer Learning, Tensorflow, Keras, Xception, ImageNet, StratifiedKFold.

Language: Jupyter Notebook - Size: 20.3 MB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 1

nelsoncardenas/Coursera-IBM-Project

Language: Jupyter Notebook - Size: 176 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 3 - Forks: 0

StarlangSoftware/Sampling

Data sampling library

Language: Java - Size: 200 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 2 - Forks: 0

imane-ayouni/California-Housing-Price-Predictions

Regression algorithms to predict the median house prices in California districts

Language: Jupyter Notebook - Size: 4.51 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

StarlangSoftware/Sampling-Js

Data Sampling Library

Language: TypeScript - Size: 29.3 KB - Last synced: 4 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

arbasher/straSplit

Stratification of multi-label datasets

Language: HTML - Size: 66.4 MB - Last synced: 12 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

StarlangSoftware/Sampling-Cy

Data sampling library

Language: Python - Size: 48.8 KB - Last synced: 23 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

StarlangSoftware/Sampling-CS

Data sampling library

Language: C# - Size: 50.8 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

StarlangSoftware/Sampling-Swift

Data sampling library

Language: Swift - Size: 56.6 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

saminens/Women-in-Data-Science-2020

WiDS Datathon 2020 on patient health through data from MIT’s GOSSIS (Global Open Source Severity of Illness Score) initiative.

Language: Jupyter Notebook - Size: 929 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 2 - Forks: 1

jesussantana/Sampling

Perform Data Sampling with Python

Language: Jupyter Notebook - Size: 5.04 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0

MasashiSode/mcs_kfold

mcs_kfold stands for "monte carlo stratified k fold". This library attempts to achieve equal distribution of discrete/categorical variables in all folds. The greatest advantage of this method is that it can be applied to multi-dimensional targets.

Language: Python - Size: 130 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 57 - Forks: 1

divya21raj/RealEstate-Modelling

Predicting house prices in an area

Language: Jupyter Notebook - Size: 22 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

shanujshekhar/Visual_Data_Analytics

Performing common visual data analytic tasks using Python and D3.js.

Language: HTML - Size: 1.39 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

shreyasbhatia09/Google-Analytics-Customer-Revenue-Prediction

Kaggle Challenge

Language: Jupyter Notebook - Size: 449 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0

Related Keywords
stratified-sampling 35 kfold-cross-validation 8 cross-validation 7 bootstrap 7 machine-learning 4 sampling 3 python 3 random-forest 3 lightgbm 3 logistic-regression 3 stratified-cross-validation 2 hyperparameter-tuning 2 xgboost-model 2 kaggle-competition 2 clustering 2 hyperparameter-optimization 2 pandas 2 exploratory-data-analysis 2 antithetic-variates 2 control-variates 2 derivatives-pricing 2 inverse-transform-method 2 monte-carlo 2 variance-reduction 2 von-neumann 2 classification 2 xgboost 2 k-fold-cross-validation 2 data-visualization 2 oversampling 2 reservoir-sampling 2 pytorch 2 feature-importance 2 random-sampling 2 neural-network 2 scikit-learn 2 image-classification 2 decision-tree 2 eda 2 smote-sampling 2 decison-trees 1 support-vector-machines 1 k-nearest-neighbours 1 classification-model 1 undersampling 1 f1-score 1 spark 1 hadoop 1 association-rules 1 courier 1 xception-model 1 vgg16-filters 1 vgg16 1 css3 1 html5 1 mass-emailer 1 transfer-learning 1 tensorflow 1 keras-tensorflow 1 keras 1 python3 1 imagenet 1 sass 1 selenium 1 web-scraping 1 augmentation 1 convolutional-neural-networks 1 datascience 1 linear-regression 1 label-encoding 1 mice 1 shap-values 1 shapely 1 wids 1 wids-datathon 1 preprocessing 1 systematic-sampling 1 applied-machine-learning 1 real-estate 1 elbow-method 1 k-means-clustering 1 multidimensional-scaling 1 pca-analysis 1 scatterplot-matrix 1 scree-plot 1 customer-revenue-prediction 1 imbalanced-learning 1 one-hot-encode 1 predictive-modeling 1 mae 1 mse 1 rmse 1 xgboost-regression 1 active-learning 1 class-imbalance 1 community-detection 1 graph-learning 1 label-propagation 1 multi-label-imbalance 1 multi-label-learning 1