Topic: "named-entity-recognition"
leduckhai/MultiMed
[LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025] A Series of Multilingual Multitask Medical Speech Processing
Language: Python - Size: 22.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 293 - Forks: 31

sagorbrur/bnlp
BNLP is a natural language processing toolkit for Bengali Language.
Language: Jupyter Notebook - Size: 22.5 MB - Last synced at: 18 days ago - Pushed at: 6 months ago - Stars: 291 - Forks: 65

Joforde/Shukongdashi 📦
使用知识图谱,自然语言处理,卷积神经网络等技术,基于python语言,设计了一个数控领域故障诊断专家系统
Language: HTML - Size: 33.1 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 288 - Forks: 96

Nealcly/BiLSTM-LAN
Hierarchically-Refined Label Attention Network for Sequence Labeling
Language: Python - Size: 5.17 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 285 - Forks: 51

boat-group/fancy-nlp
NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.
Language: Python - Size: 769 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 284 - Forks: 40

cliang1453/BOND
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
Language: Python - Size: 115 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 283 - Forks: 35

Kyubyong/bert_ner
Ner with Bert
Language: Python - Size: 484 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 281 - Forks: 56

microsoft/vert-papers
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
Language: Python - Size: 22 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 278 - Forks: 94

opensemanticsearch/open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Language: Python - Size: 615 KB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 268 - Forks: 72

hooshvare/parsbert
🤗 ParsBERT: Transformer-based Model for Persian Language Understanding
Language: Jupyter Notebook - Size: 1.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 267 - Forks: 33

samueldobbie/markup
A web-based document annotation tool, powered by GPT-4 :rocket:
Language: TypeScript - Size: 79.7 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 260 - Forks: 32

vngrs-ai/vnlp
State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
Language: Python - Size: 392 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 259 - Forks: 17

tsujuifu/pytorch_graph-rel
A PyTorch implementation of GraphRel
Language: Python - Size: 60.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 255 - Forks: 53

cooscao/Bert-BiLSTM-CRF-pytorch
bert-bilstm-crf implemented in pytorch for named entity recognition.
Language: Jupyter Notebook - Size: 3.58 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 249 - Forks: 52

monpa-team/monpa
MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Language: Python - Size: 8.25 MB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 247 - Forks: 25

oroszgy/awesome-hungarian-nlp
A curated list of NLP resources for Hungarian
Size: 125 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 245 - Forks: 18

iesl/dilated-cnn-ner
Dilated CNNs for NER in TensorFlow
Language: Python - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 242 - Forks: 58

mpuig/spacy-lookup
Named Entity Recognition based on dictionaries
Language: Python - Size: 3.55 MB - Last synced at: 14 days ago - Pushed at: over 6 years ago - Stars: 242 - Forks: 38

janlukasschroeder/nlp-cheat-sheet-python
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Language: Jupyter Notebook - Size: 3.05 MB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 239 - Forks: 74

26hzhang/neural_sequence_labeling
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
Language: Python - Size: 136 MB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 234 - Forks: 46

Text-Mining/Persian-NER
پیکره بزرگ شناسایی موجودیتهای نامدار فارسی برچسب خورده
Size: 211 MB - Last synced at: 13 days ago - Pushed at: almost 4 years ago - Stars: 233 - Forks: 22

csebuetnlp/banglabert
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.
Language: Python - Size: 1.14 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 230 - Forks: 31

kirralabs/indonesian-NLP-resources
data resource untuk NLP bahasa indonesia
Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 228 - Forks: 50

dice-group/gerbil
GERBIL - General Entity annotatoR Benchmark
Language: Java - Size: 120 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 226 - Forks: 58

baiyyang/medical-entity-recognition
包含传统的基于统计模型(CRF)和基于深度学习(Embedding-Bi-LSTM-CRF)下的医疗数据命名实体识别
Language: Python - Size: 211 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 218 - Forks: 70

davidsbatista/NER-Evaluation
An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tokens that are part of the named-entity
Language: Python - Size: 85.9 KB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 217 - Forks: 48

microsoft/presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
Language: Jupyter Notebook - Size: 10.8 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 216 - Forks: 65

vunb/vntk
Vietnamese NLP Toolkit for Node
Language: JavaScript - Size: 3.56 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 216 - Forks: 63

kamalkraj/BERT-NER-TF
Named Entity Recognition with BERT using TensorFlow 2.0
Language: Python - Size: 1.16 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 213 - Forks: 71

alexandrainst/danlp 📦
DaNLP is a repository for Natural Language Processing resources for the Danish Language.
Language: Python - Size: 49.4 MB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 205 - Forks: 33

createmomo/CRF-Layer-on-the-Top-of-BiLSTM
The CRF Layer was implemented by using Chainer 2.0. Please see more details here: https://createmomo.github.io/2017/09/12/CRF_Layer_on_the_Top_of_BiLSTM_1/
Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 200 - Forks: 50

Nealcly/templateNER
Source code for template-based NER
Language: Python - Size: 1.57 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 199 - Forks: 38

milaan9/Python_Natural_Language_Processing
This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.
Language: Jupyter Notebook - Size: 182 KB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 198 - Forks: 175

opensemanticsearch/open-semantic-entity-search-api
Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of entities like persons, organizations and places for (semi)automatic semantic tagging & analysis of documents by linked data knowledge graph like SKOS thesaurus, RDF ontology, database(s) or list(s) of names
Language: Python - Size: 146 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 194 - Forks: 34

d5555/TagEditor
🏖TagEditor - Annotation tool for spaCy
Size: 488 MB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 193 - Forks: 12

dice-group/FOX
Federated Knowledge Extraction Framework
Language: Java - Size: 757 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 191 - Forks: 53

MantisAI/nervaluate
Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13
Language: Python - Size: 425 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 180 - Forks: 22

dmis-lab/bern
A neural named entity recognition and multi-type normalization tool for biomedical text mining
Language: Python - Size: 1010 KB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 178 - Forks: 43

OpenSextant/SolrTextTagger
A text tagger based on Lucene / Solr, using FST technology
Language: Java - Size: 394 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 172 - Forks: 37

EuropeanaNewspapers/ner-corpora
Named Entity Recognition data for Europeana Newspapers
Size: 17.1 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 172 - Forks: 31

AnthonyMRios/pymetamap
Python wraper for MetaMap
Language: Python - Size: 45.9 KB - Last synced at: 21 days ago - Pushed at: almost 5 years ago - Stars: 172 - Forks: 63

INK-USC/TriggerNER
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
Language: Python - Size: 2.22 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 170 - Forks: 19

ZihanWangKi/CrossWeigh
CrossWeigh: Training Named Entity Tagger from Imperfect Annotations
Language: Python - Size: 1.77 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 170 - Forks: 21

taishan1994/pytorch_bert_intent_classification_and_slot_filling
基于pytorch的中文意图识别和槽位填充
Language: Python - Size: 158 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 169 - Forks: 27

huspacy/huspacy
HuSpaCy: industrial-strength Hungarian natural language processing
Language: Python - Size: 2.2 MB - Last synced at: 16 days ago - Pushed at: 7 months ago - Stars: 167 - Forks: 15

yanwii/ChineseNER
基于Bi-GRU + CRF 的中文机构名、人名识别, 支持google bert模型
Language: Python - Size: 4.1 MB - Last synced at: 2 months ago - Pushed at: over 6 years ago - Stars: 167 - Forks: 41

chambliss/Multilingual_NER
Applying BERT to named entity recognition in English and Russian.
Language: Python - Size: 10.1 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 162 - Forks: 24

Alibaba-NLP/KB-NER
Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.
Language: Python - Size: 1.73 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 158 - Forks: 17

Anwarvic/Dan-Jurafsky--Chris-Manning--NLP
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Language: Java - Size: 49.7 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 157 - Forks: 55

lonePatient/TorchBlocks
A PyTorch-based toolkit for natural language processing
Language: Python - Size: 481 KB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 156 - Forks: 26

supercoderhawk/DeepLearning_NLP
基于深度学习的自然语言处理库
Language: Python - Size: 12.2 MB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 156 - Forks: 40

taishan1994/BERT-Relation-Extraction
使用bert进行关系三元组抽取。
Language: Python - Size: 1.47 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 15

4AI/LS-LLaMA
A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning
Language: Python - Size: 3.54 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 155 - Forks: 24

savasy/Turkish-Bert-NLP-Pipeline
Bert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question Answering etc.
Language: Jupyter Notebook - Size: 2.66 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 153 - Forks: 21

LiyuanLucasLiu/LD-Net
Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
Language: Python - Size: 599 KB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 146 - Forks: 13

VinAIResearch/PhoNLP
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
Language: Python - Size: 588 KB - Last synced at: about 14 hours ago - Pushed at: 5 months ago - Stars: 143 - Forks: 19

ankane/mitie-ruby
Named-entity recognition for Ruby
Language: Ruby - Size: 85 KB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 141 - Forks: 5

syuoni/eznlp
Easy Natural Language Processing
Language: Python - Size: 3.53 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 140 - Forks: 22

napsternxg/TwitterNER
Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html
Language: Jupyter Notebook - Size: 41.6 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 138 - Forks: 32

yahshibu/nested-ner-tacl2020-transformers
Implementation of Nested Named Entity Recognition using BERT
Language: Python - Size: 76.2 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 138 - Forks: 26

INK-USC/AlpacaTag
AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging (ACL 2019 Demo)
Language: HTML - Size: 87.3 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 137 - Forks: 22

pkuserc/ChatGPT_for_IE
Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness
Language: Python - Size: 5.78 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 136 - Forks: 7

rikeda71/TorchCRF
An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0
Language: Python - Size: 63.5 KB - Last synced at: 22 days ago - Pushed at: almost 5 years ago - Stars: 136 - Forks: 11

FreedomIntelligence/Evaluation-of-ChatGPT-on-Information-Extraction
An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extraction (EE) and Aspect-based Sentiment Analysis (ABSA).
Language: Python - Size: 761 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 132 - Forks: 11

yuzhimanhua/Multi-BioNER
Cross-type Biomedical Named Entity Recognition with Deep Multi-task Learning (Bioinformatics'19)
Language: Python - Size: 157 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 129 - Forks: 28

MagedSaeed/farasapy
A Python implementation of Farasa toolkit
Language: Python - Size: 265 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 128 - Forks: 22

weizhepei/BERT-NER
Using pre-trained BERT models for Chinese and English NER with 🤗Transformers
Language: Python - Size: 4.51 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 127 - Forks: 26

jayavardhanr/End-to-end-Sequence-Labeling-via-Bi-directional-LSTM-CNNs-CRF-Tutorial
Tutorial for End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF
Language: Jupyter Notebook - Size: 25.4 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 124 - Forks: 72

taishan1994/PointerNet_Chinese_Information_Extraction
利用指针网络进行信息抽取,包含命名实体识别、关系抽取、事件抽取。
Language: Python - Size: 5.2 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 123 - Forks: 18

NorskRegnesentral/weak-supervision-for-NER 📦
Framework to learn Named Entity Recognition models without labelled data using weak supervision.
Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 123 - Forks: 30

SNUDerek/multiLSTM
keras attentional bi-LSTM-CRF for Joint NLU (slot-filling and intent detection) with ATIS
Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 123 - Forks: 42

ckiplab/ckipnlp
CKIP CoreNLP Toolkits
Language: Python - Size: 573 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 122 - Forks: 15

yaleimeng/NER_corpus_chinese
NER(命名实体识别)中文语料,一站式获取
Size: 18.5 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 121 - Forks: 34

saiwaiyanyu/bi-lstm-crf-ner-tf2.0
Named Entity Recognition (NER) task using Bi-LSTM-CRF model implemented in Tensorflow 2.0(tensorflow2.0 +)
Language: Python - Size: 3.34 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 119 - Forks: 44

CogComp/talen
A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities
Language: Java - Size: 5.28 MB - Last synced at: 29 days ago - Pushed at: about 3 years ago - Stars: 114 - Forks: 25

shibing624/nerpy
🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。
Language: Python - Size: 6.13 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 113 - Forks: 15

zjunlp/Generative_KG_Construction_Papers
[EMNLP 2022] Generative Knowledge Graph Construction: A Review
Size: 15.8 MB - Last synced at: 27 days ago - Pushed at: almost 2 years ago - Stars: 112 - Forks: 7

aymara/lima
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Language: C++ - Size: 276 MB - Last synced at: 20 days ago - Pushed at: 12 months ago - Stars: 111 - Forks: 20

sina-al/pynlp 📦
A pythonic wrapper for Stanford CoreNLP.
Language: Python - Size: 79.1 KB - Last synced at: 23 days ago - Pushed at: almost 7 years ago - Stars: 108 - Forks: 11

kaisugi/entity-related-papers
Named Entity Recognition, Entity Linking, and more
Size: 143 KB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 107 - Forks: 9

DmitryRyumin/EMNLP-2023-Papers
EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. :star: support NLP!
Language: Python - Size: 6.43 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 107 - Forks: 7

zliucr/CrossNER
CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)
Language: Python - Size: 2.27 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 106 - Forks: 22

mit-ccc/TweebankNLP
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Language: Python - Size: 16.8 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 104 - Forks: 8

opensemanticsearch/open-semantic-search-apps
Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations and named entities) and data import (ETL like text extraction, OCR and crawling filesystems or websites)
Language: CSS - Size: 1.37 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 97 - Forks: 38

FuxiaoLiu/VisualNews-Repository
[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning
Language: Jupyter Notebook - Size: 6.94 MB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 96 - Forks: 9

fingeredman/teanaps
자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Language: Jupyter Notebook - Size: 62.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 92 - Forks: 11

explosion/healthsea
Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.
Language: Python - Size: 57 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 91 - Forks: 17

lyutyuh/ASP
PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxiv.org/pdf/2210.14698.pdf
Language: Python - Size: 5.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 90 - Forks: 15

howl-anderson/seq2annotation
基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注(Part Of Speech, POS)和命名实体识别(Named Entity Recognition, NER)等序列标注任务。
Language: Python - Size: 8.81 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 84 - Forks: 21

poteminr/instruct-ner
Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)
Language: Python - Size: 297 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 83 - Forks: 8

microsoft/SkillsExtractorCognitiveSearch 📦
Azure Search Cognitive Skill to extract technical and business skills from text
Language: Python - Size: 760 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 81 - Forks: 46

mukhal/xlm-roberta-ner
Named Entity Recognition with Pretrained XLM-RoBERTa
Language: Python - Size: 2.86 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 81 - Forks: 25

tomasonjo/trinity-ie
Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction
Language: Python - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 81 - Forks: 17

benbusby/namebuster
A tool for enumerating usernames from text, files, or websites
Language: Go - Size: 45.9 KB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 78 - Forks: 11

ArneBinder/pytorch-ie
PyTorch-IE: State-of-the-art Information Extraction in PyTorch
Language: Python - Size: 1.71 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 77 - Forks: 7

LanguageMachines/frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Language: C++ - Size: 70.2 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 77 - Forks: 10

Alibaba-NLP/CLNER
[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
Language: Python - Size: 1.76 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 77 - Forks: 13

napsternxg/DeepSequenceClassification
Deep neural network based model for sequence to sequence classification
Language: Python - Size: 34.2 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 77 - Forks: 20

dmis-lab/GeNER
Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)
Language: Python - Size: 82.5 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 76 - Forks: 9

zliucr/coach
Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling (ACL-2020)
Language: Python - Size: 264 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 76 - Forks: 19
