An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: token-classification

ufal/factgenie

Lightweight self-hosted span annotation tool

Language: Python - Size: 30.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 29 - Forks: 6

KRLabsOrg/LettuceDetect

LettuceDetect is a hallucination detection framework for RAG applications.

Language: Python - Size: 2.22 MB - Last synced at: 12 days ago - Pushed at: 16 days ago - Stars: 207 - Forks: 16

modelscope/AdaSeq

AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models

Language: Python - Size: 5.03 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 434 - Forks: 41

4AI/LS-LLaMA

A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning

Language: Python - Size: 3.54 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 154 - Forks: 25

satya77/Transformer_Temporal_Tagger

Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging

Language: Python - Size: 365 KB - Last synced at: 2 days ago - Pushed at: almost 3 years ago - Stars: 66 - Forks: 5

Kardbord/hfapigo

Unofficial (Golang) Go bindings for the Hugging Face Inference API

Language: Go - Size: 3.35 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 62 - Forks: 5

TiagoSanti/LID-token-classification

Scrap, token classification and model deployment for a selective process.

Language: Python - Size: 415 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

nlp4se/RE-Miner-Dashboard

NLP interactive dashboard for users to interact with the RE-Miner Ecosystem for data analysis, visualization, and NLP-based insights.

Language: SCSS - Size: 5.48 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

fido-ai/ua-datasets

A collection of datasets for Ukrainian language

Language: Python - Size: 2.08 MB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 58 - Forks: 2

Antarlekhaka/code

Multi-task NLP Annotation Framework

Language: JavaScript - Size: 10.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 2

C-bianc/NER-task

Token classification for named entities

Language: Jupyter Notebook - Size: 3.37 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

arnabd64/spacy-ner-hf-space

A webapp built using Gradio for demonstrating the capabilities of the Spacy NER pipeline.

Language: Python - Size: 15.6 KB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Ahwar/NER-NLP-with-ONNX-Java

A Java NLP application that identifies names, organizations, and locations in text by utilizing Hugging Face's RoBERTa NER model through the ONNX runtime and the Deep Java Library.

Language: Java - Size: 218 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 4 - Forks: 2

nubebytes/Yoda-API

API for Yoda-NER and Yoda-FITS model. NLP models for Google Feed product optimization

Language: Python - Size: 1.15 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

matteo-stat/transformers-nlp-ner-token-classification

This repo provides scripts for fine-tuning HuggingFace Transformers, setting up pipelines and optimizing token classification models for inference. They are based on my experience developing a custom chatbot, I’m sharing these in the hope they will help others to quickly fine-tune and use models in their projects! 😊

Language: Python - Size: 22.5 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

luozhouyang/transformers-keras

Transformer-based models implemented in tensorflow 2.x(using keras).

Language: Python - Size: 696 KB - Last synced at: 18 days ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 13

1024-m/NAACL-2024-SemEval-TASK-8C

Code for the paper : Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts

Language: Jupyter Notebook - Size: 28.2 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

JersonGB22/TokenClassification-TensorFlow

Language: Jupyter Notebook - Size: 586 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

nachoDRT/MERIT-Dataset

The MERIT Dataset is a fully synthetic, labeled dataset created for training and benchmarking LLMs on Visually Rich Document Understanding tasks. It is also designed to help detect biases and improve interpretability in LLMs, where we are actively working. This repository is actively maintained, and new features are continuously being added.

Language: Python - Size: 495 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

aditeyabaral/maple-v2

MAPLEv2 - Multi-task Approach for generating blackout Poetry with Linguistic Evaluation

Language: Python - Size: 55 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1

koshkidadanet/lilt-finetuning-piad-ya-ocr

Проект в рамках ВКР под названием "Разработка программного модуля для анализа документов, подтверждающих индивидуальные достижения"

Language: Jupyter Notebook - Size: 12.2 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

vedantMahangade/PII-Data-Detection

A reliable automated LLM based Model for detecting PII in Student Writing

Language: Jupyter Notebook - Size: 650 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Semihocakli/nlp-with-hugging-face

Language: Jupyter Notebook - Size: 247 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

TirendazAcademy/Multilingual-NER-App

Building a multilingual NER app with HuggingFace, Gradio and Comet

Language: Jupyter Notebook - Size: 23.7 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

MohammedAly22/ArabNizer

ArabiNizer is a state-of-the-art Arabic named entity recognizer (NER) leveraging the XLMR transformer model with an impressive testing accuracy of 95.00% and a remarkable testing F1-score of 88.00% on the PAN-X.AR subset from XTREME.

Language: Jupyter Notebook - Size: 147 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MohammedAly22/Tasneef

A state-of-the-art Arabic part-of-speech tagger leveraging the XLMR transformer model With an impressive testing accuracy of 97.49% and a remarkable testing F1-score of 96.44% on the Arabic UD Treebank.

Language: Jupyter Notebook - Size: 217 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

lucien1011/kaggle-coleridgeinitiative-show-us-the-data

Keyword extraction to automate the discovery of dataset in publications and public reports

Language: Python - Size: 504 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

datnnt1997/VPhoBertTagger

Token classification using Phobert Models for Vietnamese

Language: Python - Size: 17.7 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 3

awsaf49/pii-data-detection

The Learning Agency Lab - PII Data Detection || Develop automated techniques to detect and remove PII from educational data.

Language: Jupyter Notebook - Size: 39.1 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

WikKam/roberta-pos-finetuning

Part-Of-Speech tagging in polish with finetuned RoBERTa model

Language: Jupyter Notebook - Size: 43.9 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

frankl1/SCIA-MMF-POS Fork of NTeALan/Sangkak-Challenge-IA

A 16M LLM for POS tagging in African languages

Language: Jupyter Notebook - Size: 6.87 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mohammadoumar/yes_we_can_token_classification

Token Classification task on the Yes We Can dataset

Language: Jupyter Notebook - Size: 32.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mohammadoumar/token_classification

Token Classification in essay level, paragraph level and sentence level with BERT, DistillBERT and NER

Language: Jupyter Notebook - Size: 274 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vineetk1/sequence-tagging

Sequence-tagging using deep learning

Language: Python - Size: 237 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

anudeepvanjavakam1/lit_or_not_on_reddit

This app searches reddit posts and comments to determine if a product or service has a positive or negative sentiment and predicts top product mentions using Named Entity Recognition

Language: Jupyter Notebook - Size: 33.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

abmami/Fine-tuning-CamemBERT-for-Keyword-Extraction

Fine-tuning CamemBERT for French keywords extraction on custom dataset.

Language: Jupyter Notebook - Size: 8.28 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

VirtualRoyalty/gan-plus-nlp

Generative adversarial approach to most popular NLP tasks

Language: Jupyter Notebook - Size: 169 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

Ryu0nPrivateProject/ABSA

Aspect-Based Sentimental Analysis For Korean

Language: Jupyter Notebook - Size: 4.94 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

prasoonvarshney/scientific-entity-recognition

End-to-end pipeline for (1) automatic scraping and parsing of NLP research papers, (2) token-level entity annotations in Label Studio, and (3) BERT-based models for span identification and entity recognition

Language: Jupyter Notebook - Size: 2.93 MB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

aditeyabaral/maple

Implementation of the paper, MAPLE - MAsking words to generate blackout Poetry using sequence-to-sequence LEarning, ICNLSP 2021

Language: Python - Size: 17.7 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 2

AshutoshDongare/softskill-NER

Fine tuning 🤗 transformer model for softskill NER task

Language: Jupyter Notebook - Size: 65.4 KB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

Ryu0nPrivateProject/POSABSA

POSBert를 활용한 Aspect-Based Sentiment Analysis

Language: Python - Size: 1.6 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mahvash-siavashpour/BERT-Token-Classification-for-Persian-Kasr-e-Ezafeh

Identify if each of the words in a Persian sentence need a kasr-e-ezafeh tag or not.

Language: Jupyter Notebook - Size: 48.8 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

naivenlp/rapidnlp-datasets

Data pipelines for both TensorFlow and PyTorch!

Language: Python - Size: 117 KB - Last synced at: 14 days ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

dsashulya/biobert-distillation

summer internship project @ JetBrains Research

Language: Python - Size: 12.9 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

qAp/showus

Language: Jupyter Notebook - Size: 1.22 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Related Keywords
token-classification 46 named-entity-recognition 15 nlp 15 ner 13 bert 12 transformers 11 pytorch 9 huggingface 9 natural-language-processing 8 part-of-speech-tagging 5 sequence-classification 5 tensorflow 4 sentiment-analysis 4 sequence-labeling 4 text-classification 4 question-answering 4 huggingface-transformers 4 nlp-machine-learning 3 fine-tuning 3 kaggle 3 python 3 deep-learning 3 bert-model 3 transformer 3 ai 3 neural-network 2 machine-learning 2 transfer-learning 2 onnx 2 roberta-model 2 summarization 2 text-generation 2 bert-fine-tuning 2 arabic-language 2 gradio 2 arabic-nlp 2 part-of-speech-tagger 2 dataset 2 keras 2 keyword-extraction 2 python3 2 finetuning 2 large-language-models 2 distilbert 2 annotation-tool 2 information-extraction 2 masked-language-models 2 data-annotation 2 simcse 2 xlm-roberta 2 llm 2 natural-language-understanding 2 blackout-poetry 2 arabic-ner 1 perplexity 1 grammar-checker 1 arabic-part-of-speech 1 part-of-speech 1 synthetic-dataset-generation 1 phobert 1 phobert-ner 1 synthetic-dataset 1 documents 1 multilingual 1 lilt 1 ocr 1 data-science 1 comet-ml 1 russian-language 1 app 1 yandex-cloud 1 bigbird 1 ml 1 tokenization 1 a 1 streamlit 1 vader-sentiment-analysis 1 camembert 1 french 1 keywords 1 nlp-keywords-extraction 1 pytorch-lightning 1 sentence-similarity 1 gan 1 gan-bert 1 multi-label-classification 1 multiple-choice 1 semi-supervised-learning 1 absa 1 electra 1 entity-recognition 1 softskills 1 training-data 1 pos 1 parsbert 1 dataset-loader 1 biobert 1 distillation 1 pytorch-token-classification 1 vietnamese-ner 1