An open API service providing repository metadata for many open source software ecosystems.

Topic: "named-entity-recognition"

leduckhai/MultiMed

[LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025] A Series of Multilingual Multitask Medical Speech Processing

Language: Python - Size: 22.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 293 - Forks: 31

sagorbrur/bnlp

BNLP is a natural language processing toolkit for Bengali Language.

Language: Jupyter Notebook - Size: 22.5 MB - Last synced at: 18 days ago - Pushed at: 6 months ago - Stars: 291 - Forks: 65

Joforde/Shukongdashi 📦

使用知识图谱,自然语言处理,卷积神经网络等技术,基于python语言,设计了一个数控领域故障诊断专家系统

Language: HTML - Size: 33.1 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 288 - Forks: 96

Nealcly/BiLSTM-LAN

Hierarchically-Refined Label Attention Network for Sequence Labeling

Language: Python - Size: 5.17 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 285 - Forks: 51

boat-group/fancy-nlp

NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.

Language: Python - Size: 769 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 284 - Forks: 40

cliang1453/BOND

BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision

Language: Python - Size: 115 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 283 - Forks: 35

Kyubyong/bert_ner

Ner with Bert

Language: Python - Size: 484 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 281 - Forks: 56

microsoft/vert-papers

This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).

Language: Python - Size: 22 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 278 - Forks: 94

opensemanticsearch/open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

Language: Python - Size: 615 KB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 268 - Forks: 72

hooshvare/parsbert

🤗 ParsBERT: Transformer-based Model for Persian Language Understanding

Language: Jupyter Notebook - Size: 1.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 267 - Forks: 33

samueldobbie/markup

A web-based document annotation tool, powered by GPT-4 :rocket:

Language: TypeScript - Size: 79.7 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 260 - Forks: 32

vngrs-ai/vnlp

State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.

Language: Python - Size: 392 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 259 - Forks: 17

tsujuifu/pytorch_graph-rel

A PyTorch implementation of GraphRel

Language: Python - Size: 60.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 255 - Forks: 53

cooscao/Bert-BiLSTM-CRF-pytorch

bert-bilstm-crf implemented in pytorch for named entity recognition.

Language: Jupyter Notebook - Size: 3.58 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 249 - Forks: 52

monpa-team/monpa

MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型

Language: Python - Size: 8.25 MB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 247 - Forks: 25

oroszgy/awesome-hungarian-nlp

A curated list of NLP resources for Hungarian

Size: 125 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 245 - Forks: 18

iesl/dilated-cnn-ner

Dilated CNNs for NER in TensorFlow

Language: Python - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 242 - Forks: 58

mpuig/spacy-lookup

Named Entity Recognition based on dictionaries

Language: Python - Size: 3.55 MB - Last synced at: 14 days ago - Pushed at: over 6 years ago - Stars: 242 - Forks: 38

janlukasschroeder/nlp-cheat-sheet-python

NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition

Language: Jupyter Notebook - Size: 3.05 MB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 239 - Forks: 74

26hzhang/neural_sequence_labeling

A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.

Language: Python - Size: 136 MB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 234 - Forks: 46

Text-Mining/Persian-NER

پیکره بزرگ شناسایی موجودیت‌های نامدار فارسی برچسب خورده

Size: 211 MB - Last synced at: 13 days ago - Pushed at: almost 4 years ago - Stars: 233 - Forks: 22

csebuetnlp/banglabert

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.

Language: Python - Size: 1.14 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 230 - Forks: 31

kirralabs/indonesian-NLP-resources

data resource untuk NLP bahasa indonesia

Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 228 - Forks: 50

dice-group/gerbil

GERBIL - General Entity annotatoR Benchmark

Language: Java - Size: 120 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 226 - Forks: 58

baiyyang/medical-entity-recognition

包含传统的基于统计模型(CRF)和基于深度学习(Embedding-Bi-LSTM-CRF)下的医疗数据命名实体识别

Language: Python - Size: 211 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 218 - Forks: 70

davidsbatista/NER-Evaluation

An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tokens that are part of the named-entity

Language: Python - Size: 85.9 KB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 217 - Forks: 48

microsoft/presidio-research

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.

Language: Jupyter Notebook - Size: 10.8 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 216 - Forks: 65

vunb/vntk

Vietnamese NLP Toolkit for Node

Language: JavaScript - Size: 3.56 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 216 - Forks: 63

kamalkraj/BERT-NER-TF

Named Entity Recognition with BERT using TensorFlow 2.0

Language: Python - Size: 1.16 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 213 - Forks: 71

alexandrainst/danlp 📦

DaNLP is a repository for Natural Language Processing resources for the Danish Language.

Language: Python - Size: 49.4 MB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 205 - Forks: 33

createmomo/CRF-Layer-on-the-Top-of-BiLSTM

The CRF Layer was implemented by using Chainer 2.0. Please see more details here: https://createmomo.github.io/2017/09/12/CRF_Layer_on_the_Top_of_BiLSTM_1/

Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 200 - Forks: 50

Nealcly/templateNER

Source code for template-based NER

Language: Python - Size: 1.57 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 199 - Forks: 38

milaan9/Python_Natural_Language_Processing

This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.

Language: Jupyter Notebook - Size: 182 KB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 198 - Forks: 175

opensemanticsearch/open-semantic-entity-search-api

Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of entities like persons, organizations and places for (semi)automatic semantic tagging & analysis of documents by linked data knowledge graph like SKOS thesaurus, RDF ontology, database(s) or list(s) of names

Language: Python - Size: 146 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 194 - Forks: 34

d5555/TagEditor

🏖TagEditor - Annotation tool for spaCy

Size: 488 MB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 193 - Forks: 12

dice-group/FOX

Federated Knowledge Extraction Framework

Language: Java - Size: 757 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 191 - Forks: 53

MantisAI/nervaluate

Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13

Language: Python - Size: 425 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 180 - Forks: 22

dmis-lab/bern

A neural named entity recognition and multi-type normalization tool for biomedical text mining

Language: Python - Size: 1010 KB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 178 - Forks: 43

OpenSextant/SolrTextTagger

A text tagger based on Lucene / Solr, using FST technology

Language: Java - Size: 394 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 172 - Forks: 37

EuropeanaNewspapers/ner-corpora

Named Entity Recognition data for Europeana Newspapers

Size: 17.1 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 172 - Forks: 31

AnthonyMRios/pymetamap

Python wraper for MetaMap

Language: Python - Size: 45.9 KB - Last synced at: 21 days ago - Pushed at: almost 5 years ago - Stars: 172 - Forks: 63

INK-USC/TriggerNER

TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)

Language: Python - Size: 2.22 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 170 - Forks: 19

ZihanWangKi/CrossWeigh

CrossWeigh: Training Named Entity Tagger from Imperfect Annotations

Language: Python - Size: 1.77 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 170 - Forks: 21

taishan1994/pytorch_bert_intent_classification_and_slot_filling

基于pytorch的中文意图识别和槽位填充

Language: Python - Size: 158 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 169 - Forks: 27

huspacy/huspacy

HuSpaCy: industrial-strength Hungarian natural language processing

Language: Python - Size: 2.2 MB - Last synced at: 16 days ago - Pushed at: 7 months ago - Stars: 167 - Forks: 15

yanwii/ChineseNER

基于Bi-GRU + CRF 的中文机构名、人名识别, 支持google bert模型

Language: Python - Size: 4.1 MB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 167 - Forks: 41

chambliss/Multilingual_NER

Applying BERT to named entity recognition in English and Russian.

Language: Python - Size: 10.1 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 162 - Forks: 24

Alibaba-NLP/KB-NER

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

Language: Python - Size: 1.73 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 158 - Forks: 17

Anwarvic/Dan-Jurafsky--Chris-Manning--NLP

My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.

Language: Java - Size: 49.7 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 157 - Forks: 55

lonePatient/TorchBlocks

A PyTorch-based toolkit for natural language processing

Language: Python - Size: 481 KB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 156 - Forks: 26

supercoderhawk/DeepLearning_NLP

基于深度学习的自然语言处理库

Language: Python - Size: 12.2 MB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 156 - Forks: 40

taishan1994/BERT-Relation-Extraction

使用bert进行关系三元组抽取。

Language: Python - Size: 1.47 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 15

4AI/LS-LLaMA

A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning

Language: Python - Size: 3.54 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 24

savasy/Turkish-Bert-NLP-Pipeline

Bert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question Answering etc.

Language: Jupyter Notebook - Size: 2.66 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 153 - Forks: 21

LiyuanLucasLiu/LD-Net

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling

Language: Python - Size: 599 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 146 - Forks: 13

VinAIResearch/PhoNLP

PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)

Language: Python - Size: 588 KB - Last synced at: about 14 hours ago - Pushed at: 5 months ago - Stars: 143 - Forks: 19

ankane/mitie-ruby

Named-entity recognition for Ruby

Language: Ruby - Size: 85 KB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 141 - Forks: 5

syuoni/eznlp

Easy Natural Language Processing

Language: Python - Size: 3.53 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 140 - Forks: 22

napsternxg/TwitterNER

Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html

Language: Jupyter Notebook - Size: 41.6 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 138 - Forks: 32

yahshibu/nested-ner-tacl2020-transformers

Implementation of Nested Named Entity Recognition using BERT

Language: Python - Size: 76.2 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 138 - Forks: 26

INK-USC/AlpacaTag

AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging (ACL 2019 Demo)

Language: HTML - Size: 87.3 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 137 - Forks: 22

pkuserc/ChatGPT_for_IE

Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

Language: Python - Size: 5.78 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 136 - Forks: 7

rikeda71/TorchCRF

An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0

Language: Python - Size: 63.5 KB - Last synced at: 22 days ago - Pushed at: almost 5 years ago - Stars: 136 - Forks: 11

FreedomIntelligence/Evaluation-of-ChatGPT-on-Information-Extraction

An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extraction (EE) and Aspect-based Sentiment Analysis (ABSA).

Language: Python - Size: 761 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 132 - Forks: 11

yuzhimanhua/Multi-BioNER

Cross-type Biomedical Named Entity Recognition with Deep Multi-task Learning (Bioinformatics'19)

Language: Python - Size: 157 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 129 - Forks: 28

MagedSaeed/farasapy

A Python implementation of Farasa toolkit

Language: Python - Size: 265 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 128 - Forks: 22

weizhepei/BERT-NER

Using pre-trained BERT models for Chinese and English NER with 🤗Transformers

Language: Python - Size: 4.51 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 127 - Forks: 26

jayavardhanr/End-to-end-Sequence-Labeling-via-Bi-directional-LSTM-CNNs-CRF-Tutorial

Tutorial for End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

Language: Jupyter Notebook - Size: 25.4 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 124 - Forks: 72

taishan1994/PointerNet_Chinese_Information_Extraction

利用指针网络进行信息抽取,包含命名实体识别、关系抽取、事件抽取。

Language: Python - Size: 5.2 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 123 - Forks: 18

NorskRegnesentral/weak-supervision-for-NER 📦

Framework to learn Named Entity Recognition models without labelled data using weak supervision.

Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 123 - Forks: 30

SNUDerek/multiLSTM

keras attentional bi-LSTM-CRF for Joint NLU (slot-filling and intent detection) with ATIS

Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 123 - Forks: 42

ckiplab/ckipnlp

CKIP CoreNLP Toolkits

Language: Python - Size: 573 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 122 - Forks: 15

yaleimeng/NER_corpus_chinese

NER(命名实体识别)中文语料,一站式获取

Size: 18.5 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 121 - Forks: 34

saiwaiyanyu/bi-lstm-crf-ner-tf2.0

Named Entity Recognition (NER) task using Bi-LSTM-CRF model implemented in Tensorflow 2.0(tensorflow2.0 +)

Language: Python - Size: 3.34 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 119 - Forks: 44

CogComp/talen

A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities

Language: Java - Size: 5.28 MB - Last synced at: 29 days ago - Pushed at: about 3 years ago - Stars: 114 - Forks: 25

shibing624/nerpy

🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。

Language: Python - Size: 6.13 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 113 - Forks: 15

zjunlp/Generative_KG_Construction_Papers

[EMNLP 2022] Generative Knowledge Graph Construction: A Review

Size: 15.8 MB - Last synced at: 27 days ago - Pushed at: almost 2 years ago - Stars: 112 - Forks: 7

aymara/lima

The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.

Language: C++ - Size: 276 MB - Last synced at: 20 days ago - Pushed at: 12 months ago - Stars: 111 - Forks: 20

sina-al/pynlp 📦

A pythonic wrapper for Stanford CoreNLP.

Language: Python - Size: 79.1 KB - Last synced at: 23 days ago - Pushed at: almost 7 years ago - Stars: 108 - Forks: 11

kaisugi/entity-related-papers

Named Entity Recognition, Entity Linking, and more

Size: 143 KB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 107 - Forks: 9

DmitryRyumin/EMNLP-2023-Papers

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!

Language: Python - Size: 6.43 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 107 - Forks: 7

zliucr/CrossNER

CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)

Language: Python - Size: 2.27 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 106 - Forks: 22

mit-ccc/TweebankNLP

[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset

Language: Python - Size: 16.8 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 104 - Forks: 8

opensemanticsearch/open-semantic-search-apps

Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations and named entities) and data import (ETL like text extraction, OCR and crawling filesystems or websites)

Language: CSS - Size: 1.37 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 97 - Forks: 38

FuxiaoLiu/VisualNews-Repository

[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning

Language: Jupyter Notebook - Size: 6.94 MB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 96 - Forks: 9

fingeredman/teanaps

자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.

Language: Jupyter Notebook - Size: 62.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 92 - Forks: 11

explosion/healthsea

Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.

Language: Python - Size: 57 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 91 - Forks: 17

lyutyuh/ASP

PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxiv.org/pdf/2210.14698.pdf

Language: Python - Size: 5.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 90 - Forks: 15

howl-anderson/seq2annotation

基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注(Part Of Speech, POS)和命名实体识别(Named Entity Recognition, NER)等序列标注任务。

Language: Python - Size: 8.81 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 84 - Forks: 21

poteminr/instruct-ner

Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)

Language: Python - Size: 297 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 83 - Forks: 8

microsoft/SkillsExtractorCognitiveSearch 📦

Azure Search Cognitive Skill to extract technical and business skills from text

Language: Python - Size: 760 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 81 - Forks: 46

mukhal/xlm-roberta-ner

Named Entity Recognition with Pretrained XLM-RoBERTa

Language: Python - Size: 2.86 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 81 - Forks: 25

tomasonjo/trinity-ie

Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction

Language: Python - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 81 - Forks: 17

benbusby/namebuster

A tool for enumerating usernames from text, files, or websites

Language: Go - Size: 45.9 KB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 78 - Forks: 11

ArneBinder/pytorch-ie

PyTorch-IE: State-of-the-art Information Extraction in PyTorch

Language: Python - Size: 1.71 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 77 - Forks: 7

LanguageMachines/frog

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

Language: C++ - Size: 70.2 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 77 - Forks: 10

Alibaba-NLP/CLNER

[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning

Language: Python - Size: 1.76 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 77 - Forks: 13

napsternxg/DeepSequenceClassification

Deep neural network based model for sequence to sequence classification

Language: Python - Size: 34.2 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 77 - Forks: 20

dmis-lab/GeNER

Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)

Language: Python - Size: 82.5 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 76 - Forks: 9

zliucr/coach

Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling (ACL-2020)

Language: Python - Size: 264 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 76 - Forks: 19

Related Topics
nlp 526 natural-language-processing 431 ner 382 machine-learning 188 python 179 spacy 144 deep-learning 138 pytorch 130 relation-extraction 119 bert 118 sentiment-analysis 117 information-extraction 106 text-classification 70 tensorflow 68 transformers 66 nlp-machine-learning 65 crf 57 sequence-labeling 51 dataset 43 pos-tagging 42 knowledge-graph 41 lstm 40 python3 39 text-mining 38 spacy-nlp 37 keras 37 nltk 36 question-answering 34 transformer 34 topic-modeling 34 entity-linking 33 conditional-random-fields 30 llm 29 huggingface 28 artificial-intelligence 27 tokenization 27 ai 27 entity-extraction 27 large-language-models 26 neural-network 25 corpus 25 natural-language-understanding 24 data-science 23 lemmatization 23 java 22 text-summarization 22 huggingface-transformers 22 named-entities 22 bert-model 22 language-model 21 flask 21 bilstm-crf 21 word-embeddings 21 roberta 20 jupyter-notebook 20 neural-networks 20 part-of-speech-tagger 20 bilstm 20 classification 20 part-of-speech-tagging 19 annotation-tool 19 event-extraction 18 machine-translation 18 docker 17 dependency-parsing 17 transfer-learning 17 flair 16 token-classification 16 streamlit 16 tokenizer 16 coreference-resolution 16 fine-tuning 16 intent-classification 16 dependency-parser 15 conll-2003 15 chatbot 15 bert-fine-tuning 14 named-entity-disambiguation 14 biomedical 14 summarization 14 part-of-speech 13 text-analysis 13 lstm-crf 13 nlp-library 13 stemming 13 text-generation 13 api 13 text-processing 13 annotation 12 vietnamese-nlp 12 named-entity-linking 12 information-retrieval 12 llama 12 ocr 12 anonymization 12 deep-neural-networks 12 lstm-neural-networks 12 cnn 12 bioinformatics 12 bert-ner 11