An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-programming

JieyuZ2/wrench

[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark

Language: Python - Size: 1.81 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 223 - Forks: 34

decile-team/spear

SPEAR: Programmatically label and build training data quickly.

Language: Jupyter Notebook - Size: 432 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 106 - Forks: 20

JieyuZ2/Awesome-Weak-Supervision

A curated list of programmatic weak supervision papers and resources

Language: TeX - Size: 322 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 190 - Forks: 27

benbo/interactive-weak-supervision

Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling

Language: Python - Size: 45.9 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 4

megagonlabs/ruler

Data Programming by Demonstration (DPBD) for Document Classification

Language: Jupyter Notebook - Size: 17 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 35 - Forks: 6

sfpugh/Naturally-Adversarial-Datasets

An approach to curating naturally adversarial datasets.

Language: Python - Size: 314 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

megagonlabs/tagruler

Data programming by demonstration for information extraction and span annotation

Language: JavaScript - Size: 82.6 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 6

junchenzhi/Neural-Hidden-CRF

Code for the KDD-2023 paper: Neural-Hidden-CRF: A Robust Weakly-Supervised Sequence Labeler

Language: Python - Size: 6.37 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

junchenzhi/Awesome-Weak-Supervision-Sequence-Labeling

A curated list of awesome Weak-Supervision-Sequence-Labeling (WSSL) papers, methods & resources.

Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

sampathkethineedi/snorkel-process

Process flow to generate labels on Text data using Snorkel and maintain DB to repurpose unlabelled data

Language: Python - Size: 30.3 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

ayushbits/Semi-Supervised-LFs-Subset-Selection

This repository contains source code of our ACL 2021 paper **Data Programming using Semi-Supervision and Subset Selection**

Language: Python - Size: 1.67 GB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 2

ayushbits/robust-aggregate-lfs

Source code of our ACL 2022 paper 'Learning to robustly aggregate labeling functions for semi-supervised data programming'

Language: Python - Size: 32.6 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 2

cse163/book

Source code for the CSE 163: Intermediate Data Programming book (with code for practice problems)

Language: Jupyter Notebook - Size: 39.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3

MrPatrek/r-projects

One common repo for all of my R projects

Language: HTML - Size: 854 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

bayartsogt-ya/mnpolarity

Mongolian Polarity Detection in Weakly Supervised manner

Language: Python - Size: 4.03 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

semantic-health/deep-patient-cohorts

A tool for automatically labelling discharge summaries into disease categories.

Language: Python - Size: 539 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 3

erossini/FSharpTutorial

F# tutorial: building applications, data programming and tests

Language: HTML - Size: 136 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0