An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-augmentation"

textflint/textflint

Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

Language: Python - Size: 11.6 MB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 643 - Forks: 95

prakhar21/TextAugmentation-GPT2

Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.

Language: Python - Size: 655 KB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 188 - Forks: 43

beyondguo/genius

💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.

Language: Python - Size: 6.02 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 174 - Forks: 16

KennethEnevoldsen/augmenty

Augmenty is an augmentation library based on spaCy for augmenting texts.

Language: Python - Size: 6.12 MB - Last synced at: 4 days ago - Pushed at: 11 months ago - Stars: 153 - Forks: 11

varunkumar-dev/TransformersDataAugmentation

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

Language: Python - Size: 824 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 127 - Forks: 23

misha345a/E-commerce_Reviews_Classifier

Text augmentation, deep learning, and aspect-based sentiment analysis.

Language: Jupyter Notebook - Size: 12.6 MB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 14 - Forks: 6

sagorbrur/bnaug

Bangla Text Augmentation

Language: Jupyter Notebook - Size: 51.8 KB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 1

Schlampig/HanziGraph

Chinese Characters Visualization & Chinese Text Augmentation.

Language: Python - Size: 25.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 2

AbhishekSalian/Random-Word-Generator

This library helps you to create random words i.e noise in text data. Helpful in many tasks like the generation of random authorization token generation of constant or variable length, text data augmentation, etc.

Language: Python - Size: 174 KB - Last synced at: 19 days ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 1

jaaack-wang/linguistic-knowledge-in-DA-for-NLP

Source Code, data, and results for my paper titled Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching.

Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

mohsin-riad/automation-bijoy-to-avro

ANSI and Unicode are encoding standards used across the world by writers and common users. ANSI is an older encoding version and is used in operating systems like Windows 95/ 98 and much older systems. Unicode is a newer version of encoding used in the current day operating systems

Language: Jupyter Notebook - Size: 3.81 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

the-black-knight-01/NLP-Models-Tensorflow Fork of huseinzol05/NLP-Models-Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems

Language: Jupyter Notebook - Size: 44.7 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 3

wyu-du/Self-Training-Dialogue-Generation

This repository contains the data and code for the paper "Self-training with Two-phase Self-augmentation for Few-shot Dialogue Generation" (EMNLP2022-Findings).

Language: Python - Size: 5.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

k4black/fast-aug

Fast Augmentation library for NLP

Language: Rust - Size: 807 KB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

tznurmin/TEA

Taxonomic Entity Augmentation makes biomedical texts less repetitive

Language: Python - Size: 161 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

suhail-chand/industrial-accident-severity-assessment

NLP based Industrial Accident Severity Assessment

Language: HTML - Size: 28.4 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

kimdanny/Olfactory-NLP

Language: Jupyter Notebook - Size: 21.7 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

Shizu-ka/Easy-NLP-Augmentation

A PyPI package for augmenting text data using NLP techniques directly in your pandas dataframe.

Language: Python - Size: 45.9 KB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mosh98/Text_Aug_Low_Res

AAAI Knowledge NLP Submission

Language: Jupyter Notebook - Size: 104 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mosh98/swe_aug

Dritributed Text Augmentation Techniques (Appeared AAAI 2023)

Language: Python - Size: 532 KB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

mosh98/Feature_Space_Augmentation

Feature space Augmentation

Language: Python - Size: 25.4 KB - Last synced at: 1 day ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

jaaack-wang/text-augmentation-techniques

Common approaches to text augmentation, from random text-editing perturbations, back translation, to model-based transformations.

Language: Jupyter Notebook - Size: 8.39 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ijhrecto/Taglish-Emotion-Recognition-of-Students-during-COVID-19

A study that aims to unfold what emotions did Filipino students manifest during a year of Covid-19 quarantines.

Language: Jupyter Notebook - Size: 7.3 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

yurayli/google-quest-challenge

solution for https://www.kaggle.com/c/google-quest-challenge

Language: Jupyter Notebook - Size: 438 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

scairesearch/agrelin

augmented reinforment learning in tensorflow with open gym and blender or unity

Size: 2.93 KB - Last synced at: 11 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

Related Topics
nlp 9 augmentation 6 natural-language-processing 6 text-classification 6 data-augmentation 5 python 5 machine-learning 3 transformers 3 bert 3 back-translation 2 gpt-2 2 data-augmentation-strategies 2 deep-learning 2 nlp-machine-learning 2 aspect-based-sentiment-analysis 1 lstm-neural-networks 1 covid-19 1 emotion-recognition 1 social-media 1 swedish 1 code-generation 1 augmentation-libraries 1 bangla 1 bangla-text-augmentation 1 bengali 1 bengali-nlp 1 adversarial-samples 1 attack 1 model-robustness 1 robustness-analysis 1 subpopulation 1 text-transformations 1 transformation 1 conditional-text-generation 1 keywords-to-text 1 named-entities-recognition 1 sketch-to-text 1 text-classificaiton 1 text-generation 1 noise 1 password 1 password-generator 1 python3 1 random-generator 1 random-word-generator 1 text-mining 1 nlp-library 1 pypi 1 pypi-package 1 text-augment 1 nlproc 1 spacy 1 spacy-extension 1 spacy-nlp 1 training-data 1 swedish-language 1 low-resource-languages 1 low-resource-nlp 1 transformers-models 1 word2vec 1 natural-language-generation 1 textclassification 1 transformer-architecture 1 ensemble-learning 1 industrial-safety 1 lstm 1 neural-network 1 risk-assessment 1 svm 1 vectorization 1 rust 1 authorization 1 word-embeddings 1 natural-language-understanding 1 huggingface-transformers 1 unicode 1 mohsin-riad 1 bijoytounicode 1 bijoy2unicode 1 bijoy-to-avro 1 bijoy 1 avro 1 automation 1 ansi-to-unicode 1 ansi 1 topic-generator 1 text-to-speech 1 text-similarity 1 stemming 1 squad-question-answers 1 speech-to-text 1 sentence-pair-classification 1 question-answers 1 pos-tagging 1 optical-character-recognition 1 ocr 1 neural-machine-translation 1 language-detection 1 extractive-summarization 1 entity-tagging 1