GitHub topics: data-programming
JieyuZ2/wrench
[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark
Language: Python - Size: 1.81 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 223 - Forks: 34

decile-team/spear
SPEAR: Programmatically label and build training data quickly.
Language: Jupyter Notebook - Size: 432 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 106 - Forks: 20

JieyuZ2/Awesome-Weak-Supervision
A curated list of programmatic weak supervision papers and resources
Language: TeX - Size: 322 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 190 - Forks: 27

benbo/interactive-weak-supervision
Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling
Language: Python - Size: 45.9 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 4

megagonlabs/ruler
Data Programming by Demonstration (DPBD) for Document Classification
Language: Jupyter Notebook - Size: 17 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 35 - Forks: 6

sfpugh/Naturally-Adversarial-Datasets
An approach to curating naturally adversarial datasets.
Language: Python - Size: 314 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

megagonlabs/tagruler
Data programming by demonstration for information extraction and span annotation
Language: JavaScript - Size: 82.6 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 6

junchenzhi/Neural-Hidden-CRF
Code for the KDD-2023 paper: Neural-Hidden-CRF: A Robust Weakly-Supervised Sequence Labeler
Language: Python - Size: 6.37 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

junchenzhi/Awesome-Weak-Supervision-Sequence-Labeling
A curated list of awesome Weak-Supervision-Sequence-Labeling (WSSL) papers, methods & resources.
Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

sampathkethineedi/snorkel-process
Process flow to generate labels on Text data using Snorkel and maintain DB to repurpose unlabelled data
Language: Python - Size: 30.3 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 1

ayushbits/Semi-Supervised-LFs-Subset-Selection
This repository contains source code of our ACL 2021 paper **Data Programming using Semi-Supervision and Subset Selection**
Language: Python - Size: 1.67 GB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 2

ayushbits/robust-aggregate-lfs
Source code of our ACL 2022 paper 'Learning to robustly aggregate labeling functions for semi-supervised data programming'
Language: Python - Size: 32.6 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 2

cse163/book
Source code for the CSE 163: Intermediate Data Programming book (with code for practice problems)
Language: Jupyter Notebook - Size: 39.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 3

MrPatrek/r-projects
One common repo for all of my R projects
Language: HTML - Size: 854 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

bayartsogt-ya/mnpolarity
Mongolian Polarity Detection in Weakly Supervised manner
Language: Python - Size: 4.03 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

semantic-health/deep-patient-cohorts
A tool for automatically labelling discharge summaries into disease categories.
Language: Python - Size: 539 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 3

erossini/FSharpTutorial
F# tutorial: building applications, data programming and tests
Language: HTML - Size: 136 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0
