GitHub topics: random-sampling
rapidsai/raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
Language: Cuda - Size: 15.4 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 929 - Forks: 214

pblischak/zprob
A Zig Module for Random Number Distributions
Language: Zig - Size: 2.11 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 10 - Forks: 1

debashisdash1999/snowflake_proj11_data_sampling
This project demonstrates data sampling techniques in Snowflake. It covers loading datasets from S3, performing RANDOM and SYSTEM sampling methods to extract subsets, validating sampled data, and optimizing analysis on datasets.
Size: 4.88 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

willGuimont/prosac
PROSAC algorithm in python
Language: Python - Size: 36.1 KB - Last synced at: 14 days ago - Pushed at: 11 months ago - Stars: 45 - Forks: 7

ocramz/splitmix-distributions
Sampling procedures for some common random variables based on splitmix
Language: Haskell - Size: 30.3 KB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

probsys/optimal-approximate-sampling
Optimal approximate sampling from discrete probability distributions
Language: Python - Size: 64.5 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 18 - Forks: 0

baggepinnen/Hyperopt.jl
Hyperparameter optimization in Julia.
Language: Julia - Size: 213 KB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 202 - Forks: 19

monty-se/skellam
R package for statistical modeling with the Skellam distribution, supporting inference, random sampling, and regression for differences of independent Poisson counts.
Language: R - Size: 56.6 KB - Last synced at: 24 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

daqana/dqrng
Fast Pseudo Random Number Generators for R
Language: C++ - Size: 6.61 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 43 - Forks: 8

smahala02/Monte-Carlo-Simulation
A comprehensive tutorial on Monte Carlo Simulation using Python, demonstrating how random sampling and probabilistic models can be used for various real-world applications, including finance, physics, and engineering.
Language: Jupyter Notebook - Size: 149 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0

jlumbroso/affirmative-sampling
Reference implementation of the Affirmative Sampling algorithm by Jérémie Lumbroso and Conrado Martínez (2022). 🍀
Language: Python - Size: 794 KB - Last synced at: 2 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

pngo1997/N-gram-Language-Models
Builds N-gram language modes and applies text generation.
Language: Jupyter Notebook - Size: 4.73 MB - Last synced at: 6 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

aneeshnaik/lintsampler
Efficient random sampling via linear interpolation.
Language: Python - Size: 54.1 MB - Last synced at: 2 days ago - Pushed at: 12 months ago - Stars: 12 - Forks: 2

LKEthridge/SDA_Project
A Statistical Data Analysis project from TripleTen
Language: Jupyter Notebook - Size: 2.8 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Jungstershark/P-Median-Problem
The P-Median Problem project uses metaheuristic optimization to solve the p-median location problem, with Jupyter notebooks implementing random sampling and local search algorithms to minimize service distances.
Language: Jupyter Notebook - Size: 3.05 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

cicirello/small-sample-experiments
Code and data for experiments for paper "Algorithms for Generating Small Random Samples"
Language: Java - Size: 49.8 KB - Last synced at: 9 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

org-arl/AlphaStableDistributions.jl
Alpha stable and sub-Gaussian distributions in Julia
Language: Julia - Size: 26.4 MB - Last synced at: 21 days ago - Pushed at: 8 months ago - Stars: 6 - Forks: 4

shyammanikandan/Loan_Default_Analysis
Analyze a loan default dataset to understand the factors that contribute to loan defaults.
Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

AlexBuccheri/random_sampling
Personal random sampling testing
Language: Jupyter Notebook - Size: 1.5 MB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

lady-bluecopper/NuDHy
Null Models for Directed Hypergraphs
Language: Jupyter Notebook - Size: 53.6 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Snawoot/terse
Output randomly sampled lines from input stream or file
Language: Go - Size: 21.5 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

spatstat/spatstat.core
sub-package of spatstat containing core functionality for data analysis and modelling
Language: R - Size: 3.83 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 9

vgherard/gsample
Efficient weighted sampling without replacement in R
Language: R - Size: 99.6 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

amitkp57/dbms-correlated-columns-detection
Detecting correlated columns in DBMS systems using techniques like Pearson Correlation, LSH Minhashing and Random Sampling.
Language: Jupyter Notebook - Size: 594 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

gautamHCSCV/Modelling_Viscoelastic_Objects
This paper proposes an alternative data-driven hap- tic modeling method of homogeneous deformable objects based on a CatBoost approach – a variant of gradient boosting machine learning approach. In this approach, decision trees are trained sequentially to learn the required mapping function for modeling the objects.
Language: Jupyter Notebook - Size: 55.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Daniel-Ze/python_scripts
Collection of python scripts
Language: Python - Size: 313 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

AlexZasorin/fastshuf.jl
Optimal implementation of reservoir sampling algorithm in Julia.
Language: Julia - Size: 7.81 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

xiusu/NAS-Bench-Macro
NAS Benchmark in "Prioritized Architecture Sampling with Monto-Carlo Tree Search", CVPR2021
Language: Python - Size: 464 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 38 - Forks: 8

antoyang/NAS-Benchmark
[ICLR 2020] NAS evaluation is frustratingly hard
Language: Python - Size: 1.23 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 145 - Forks: 23

Jangwonjin/valid_cdm
⚡ Validation method of cognitive diagnosis models (CDMs)
Language: Jupyter Notebook - Size: 1.5 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

kweterings/MonteCarlo_Estimation
An introduction to Monte Carlo methods by estimating π. This code comes in the form of a Python program.
Language: Python - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

seungwoo-stat/rvMF
Fast Generation of von Mises-Fisher Distributed Pseudo-Random Vectors
Language: R - Size: 59.6 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

gstamatelat/random-sampling
A collection of algorithms in Java 8 for the problem of random sampling with a reservoir
Language: Java - Size: 428 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 33 - Forks: 6

Jayplect/credit-risk-classification
In this project, I used a dataset containing the historical lending activity from a peer-to-peer lending services company to build a model that can identify the creditworthiness of borrowers.
Language: Jupyter Notebook - Size: 693 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

EdinZiga/PneumoniaImagesANN
CS404 Artificial Intelligence final project. This project is based on the Pneumonia Images dataset found on Kaggle. The goal was to classify the images using classic Artificial Neural Networks.
Language: Jupyter Notebook - Size: 386 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

EmilienN/Data-Science-Portfolio
Language: Jupyter Notebook - Size: 54.9 MB - Last synced at: 10 months ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 3

zeroboo/nodejs-random-selector
A nodejs module for randomly select elements.
Language: JavaScript - Size: 1.69 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

aakankshaws/numpy-exercise
numpy practice exercise with solution
Language: Jupyter Notebook - Size: 157 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 10 - Forks: 9

bhattbhavesh91/text-generation-huggingface
Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

sprcoder/Customer_Segmentation_ML
A machine learning project to predict Customers/Clients into correct segment to provide promotional information or for product advertising.
Language: Jupyter Notebook - Size: 683 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

acdmammoths/Bavarian-code
Code for the paper "Bavarian: Betweenness Centrality Approximation with Variance-Aware Rademacher Averages", by Chloe Wohlgemuth, Cyrus Cousins, and Matteo Riondato, appearing in ACM KDD'21 and ACM TKDD'23
Language: C++ - Size: 76.3 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

GeekEast/random-sampling-based-grouping
Create 2 item group from even number of items.
Language: TypeScript - Size: 141 KB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

nani757/cca-mean-mode-arbitary-value-end-of-distribution-missing-data-
complete case analysis drops the whole column if there are missing values, arbitrary value imputation in this we can use replace (mean or median) with -1 or 99.999, end of the distribution it replaces the values with "missing" term
Language: Jupyter Notebook - Size: 802 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

probcomp/TracedRandom.jl
Make Julia code probabilistic-programming-ready by allowing calls to `rand` to be annotated with traced addresses.
Language: Julia - Size: 10.7 KB - Last synced at: 13 days ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

gstamatelat/rsx
A collection of random sampling algorithms in Python.
Language: Python - Size: 135 KB - Last synced at: 11 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

dadavalangege/Sampling_Methods
The aim of this project was to sample a sports data set
Language: Jupyter Notebook - Size: 198 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

jesussantana/Sampling
Perform Data Sampling with Python
Language: Jupyter Notebook - Size: 5.04 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

acharles7/data-science-notebooks
Credit card fraud detection, gender classification from name etc.
Language: Jupyter Notebook - Size: 2.34 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

shanujshekhar/Visual_Data_Analytics
Performing common visual data analytic tasks using Python and D3.js.
Language: HTML - Size: 1.39 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

edxz7/Random-Sampling-With-A-Reservoir
Source code written in java and python for random sampling without replacement with a reservoir
Language: Java - Size: 779 KB - Last synced at: 17 days ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1
