An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: named-entity-recognition

ukairia777/pytorch-nlp-tutorial

pytorch를 사용하여 텍스트 전처리부터 RAG, 에이전트, LLM 파인튜닝을 정리한 Deep Learning NLP 저장소입니다.

Language: Jupyter Notebook - Size: 50.6 MB - Last synced at: about 1 hour ago - Pushed at: about 3 hours ago - Stars: 49 - Forks: 20

microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Language: Python - Size: 234 MB - Last synced at: about 19 hours ago - Pushed at: 3 days ago - Stars: 5,097 - Forks: 699

urchade/GLiNER

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Language: Python - Size: 30.9 MB - Last synced at: about 16 hours ago - Pushed at: about 21 hours ago - Stars: 2,174 - Forks: 193

stanfordnlp/stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Language: Python - Size: 82.5 MB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 7,537 - Forks: 909

explosion/spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

Language: Python - Size: 1.79 MB - Last synced at: about 12 hours ago - Pushed at: 7 months ago - Stars: 1,288 - Forks: 102

SashaFlores/Crypto-Sentiment-Analysis

NLP to understand the sentiment in the latest news articles featuring Bitcoin and Ethereum, and to better understand the other factors involved with the coin prices change such as common words and phrases and organizations and entities mentioned in the articles.

Language: Jupyter Notebook - Size: 6.22 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

microsoft/presidio-research

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.

Language: Jupyter Notebook - Size: 15.8 MB - Last synced at: about 19 hours ago - Pushed at: 2 days ago - Stars: 228 - Forks: 66

spencermountain/compromise

modest natural-language processing

Language: JavaScript - Size: 55.2 MB - Last synced at: 1 day ago - Pushed at: 10 days ago - Stars: 11,803 - Forks: 659

The-FinAI/PIXIU

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

Language: Jupyter Notebook - Size: 49.5 MB - Last synced at: about 14 hours ago - Pushed at: 5 months ago - Stars: 750 - Forks: 95

hitz-zentroa/GoLLIE

Guideline following Large Language Model for Information Extraction

Language: Python - Size: 10.8 MB - Last synced at: about 13 hours ago - Pushed at: 9 months ago - Stars: 389 - Forks: 26

drzippie/ner-service

Spanish Named Entity Recognition service with configurable backends (MITIE/spaCy). FastAPI web API, CLI interface, Docker support. Extracts PERSON, LOCATION, ORGANIZATION entities with confidence scoring.

Language: Python - Size: 35.2 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

ukairia777/tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

Language: Jupyter Notebook - Size: 126 MB - Last synced at: about 12 hours ago - Pushed at: 30 days ago - Stars: 551 - Forks: 286

keanteng/wqd7005-project

Harnessing AI and Language Models for Predictive Modeling of Patient Health Deterioration

Language: Jupyter Notebook - Size: 22.9 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

pranshurastogi29/Named_entity_Relation_Extraction_SOMD_2025_ACL

This is the Official Submission to SOMD 2025 workshop at ACL2025 by psr123

Language: Jupyter Notebook - Size: 5.34 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Tongjilibo/bert4torch

An elegent pytorch implement of transformers

Language: Python - Size: 11 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 1,310 - Forks: 164

CAMeL-Lab/camel_tools

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

Language: Python - Size: 11.5 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 471 - Forks: 77

mddunlap924/PII-Detection

Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation

Language: Python - Size: 548 KB - Last synced at: about 13 hours ago - Pushed at: 11 months ago - Stars: 30 - Forks: 5

JohnSnowLabs/spark-nlp

State of the Art Natural Language Processing

Language: Scala - Size: 3.42 GB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4,014 - Forks: 729

ankane/informers

Fast transformer inference for Ruby

Language: Ruby - Size: 2.48 MB - Last synced at: 2 days ago - Pushed at: 6 months ago - Stars: 577 - Forks: 17

hankcs/HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Language: Python - Size: 69.5 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 35,405 - Forks: 10,709

mhbashari/awesome-persian-nlp-ir

Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources

Size: 192 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 754 - Forks: 113

hankcs/pyhanlp

中文分词

Language: Python - Size: 280 KB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 3,191 - Forks: 803

esmailza/Llama2-vLLM-LangChain-knowledge-graph

Preserving entities through the integration of knowledge graphs, Llama 2, vLLM, and LangChain.

Language: Python - Size: 763 KB - Last synced at: about 13 hours ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

NemesLaszlo/Social-Media-Analysis-based-on-COVID-19-with-Sentiment-Analysis-NER-and-Information-Extraction

This repository contains the social media data scraper and the notebooks of this analysis. Where we analise the Social Media posts - tweets with Sentiment Analysis then we analyse this results with Named Entity Recognition (NER) and Information Extraction methods to get a more accurate and detailed picture of this sentiment results.

Language: Jupyter Notebook - Size: 33.1 MB - Last synced at: 2 days ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 2

mawiesne/DE-NERmed

DE-NERmed: An OpenNLP named entity recognition tool and model files trained for medical NLP use cases

Language: Java - Size: 367 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

zjunlp/OneKE

[WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.

Language: HTML - Size: 24.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 93 - Forks: 11

fastdatascience/country_named_entity_recognition

Code to find country names

Language: Python - Size: 69.3 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 4 - Forks: 2

freinold/GLiNER-API

Easily configurable container app providing standardized access to dynamic NER models.

Language: Python - Size: 2.43 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 1

JayYip/m3tl

BERT for Multitask Learning

Language: Jupyter Notebook - Size: 29.1 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 548 - Forks: 126

apache/ctakes

Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text.

Language: Java - Size: 128 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 86 - Forks: 16

BrikerMan/Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Language: Python - Size: 14.3 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 2,387 - Forks: 434

mirpo/fastapi-gen

Build LLM-enabled FastAPI applications without build configuration.

Language: Python - Size: 473 KB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 7 - Forks: 1

deeppavlov/DeepPavlov

An open source library for deep learning end-to-end dialog systems and chatbots.

Language: Python - Size: 31.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 6,911 - Forks: 1,165

zjunlp/DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Language: Python - Size: 121 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 4,036 - Forks: 726

KhymNad/resume-matcher-api

Backend API for Resume Matcher Full-Stack Project

Language: C# - Size: 1.24 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

ArneBinder/pytorch-ie

PyTorch-IE: State-of-the-art Information Extraction in PyTorch

Language: Python - Size: 1.84 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 79 - Forks: 7

nlpaueb/gr-nlp-toolkit

The Greek NLP toolkit for Python. Supports NER/DP/POS Tagging/Greeklish-to-Greek Transliteration. Visit the web demo here: https://huggingface.co/spaces/AUEB-NLP/greek-nlp-toolkit-demo (paper presented at COLING 2025)

Language: Python - Size: 39.3 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 69 - Forks: 6

sindhu213/Niamt-project Fork of Jayant915/Niamt-project

Multilingual NER system using BiLSTM and XLM-R for real-time entity extraction via CLI and API.

Language: Python - Size: 40.3 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

davidsinclar/Task-tracker

A task tracker using tkinter

Language: Python - Size: 7.81 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

Hironsan/anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

Language: Python - Size: 7.33 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 1,484 - Forks: 365

aymara/lima

The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.

Language: C++ - Size: 276 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 113 - Forks: 20

rodrigopivi/Chatito

🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!

Language: TypeScript - Size: 6.42 MB - Last synced at: about 15 hours ago - Pushed at: almost 2 years ago - Stars: 882 - Forks: 153

Franck-Dernoncourt/NeuroNER

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

Language: Python - Size: 121 MB - Last synced at: 8 days ago - Pushed at: over 2 years ago - Stars: 1,713 - Forks: 474

ICIJ/datashare

A self-hosted search engine for documents. Fill our user survey about structured content: : https://forms.gle/PYgusFsoBaMyzUec9

Language: Java - Size: 395 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 642 - Forks: 61

SWEATZONE/NER-for-New-Articles-trained-on-CoNLL03-Dataset

Discover a powerful NER web app for news articles, utilizing spaCy models trained on the CoNLL-2003 dataset. Try it now! 🐙📦

Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

stanfordnlp/CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Language: Java - Size: 380 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 9,934 - Forks: 2,715

Dibakar270/pdf-tools-browser

Manage and enhance PDF tools in the browser with pdf.js, pdflibjs, and more. Progress on wasm libraries ongoing. 🛠️📄

Size: 6.84 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

snipsco/snips-nlu

Snips Python library to extract meaning from text

Language: Python - Size: 19.3 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 3,935 - Forks: 513

zjunlp/OpenUE

[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text

Language: Python - Size: 78.8 MB - Last synced at: 10 days ago - Pushed at: almost 3 years ago - Stars: 327 - Forks: 59

baidu/lac

百度NLP:分词,词性标注,命名实体识别,词重要性

Language: C++ - Size: 63.6 MB - Last synced at: 8 days ago - Pushed at: about 4 years ago - Stars: 3,951 - Forks: 595

fastdatascience/drug_named_entity_recognition

Language: Jupyter Notebook - Size: 11.4 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 25 - Forks: 12

UniversalDataTool/universal-data-tool

Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.

Language: JavaScript - Size: 247 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 2,001 - Forks: 193

chakki-works/seqeval

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Language: Python - Size: 180 KB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 1,145 - Forks: 133

viniciusfinger/NER-powered-semantic-search

Named Entity Recognition powered Semantic Search

Language: Jupyter Notebook - Size: 20.6 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

LHNCBC/metamaplite

A near real-time named-entity recognizer

Language: Java - Size: 1020 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 61 - Forks: 14

macanv/BERT-BiLSTM-CRF-NER

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Language: Python - Size: 3.75 MB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 4,829 - Forks: 1,252

crownpku/Information-Extraction-Chinese

Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取

Language: Python - Size: 78.9 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 2,257 - Forks: 807

MagedSaeed/farasapy

A Python implementation of Farasa toolkit

Language: Python - Size: 265 MB - Last synced at: 12 days ago - Pushed at: about 2 months ago - Stars: 132 - Forks: 23

ckiplab/ckipnlp

CKIP CoreNLP Toolkits

Language: Python - Size: 573 KB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 124 - Forks: 15

daviden1013/llm-ie

A comprehensive toolkit that provides building blocks for LLM-based named entity recognition, attribute extraction, and relation extraction pipelines.

Language: Python - Size: 10.9 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 26 - Forks: 4

Az-r-ow/TravelNER Fork of lucas066001/TravelOrderResolver

Travel Named Entity Recognition using probabilistic model vs Deep Learning and Transformers

Language: Jupyter Notebook - Size: 18.2 MB - Last synced at: 7 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

PavanYellathakota/Text-Analysis-using-NLP-LDA

Advanced Text Analysis using NLP: Sentiment Analysis, Named Entity Recognition (NER) & Topic Modeling (LDA)

Language: Jupyter Notebook - Size: 471 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

kuhumcst/texton

Text Tonsorium - a toolbox that automatically arranges NLP tools in workflows and enacts them with user's inputs

Language: PHP - Size: 9.76 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 5 - Forks: 0

AmirLayegh/kg-rag-book

Implementation of the book "Essential GraphRAG MEAP V04"

Language: Python - Size: 762 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

boyfriend120393/transformers-tutorial

Explore the "transformers-tutorial" to master transformer architecture and GPT models through interactive HTML lessons. Start your journey on GitHub today! 🐙✨

Language: HTML - Size: 35.2 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

yyDing1/GNER

[ACL-24 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"

Language: Python - Size: 4.69 MB - Last synced at: about 13 hours ago - Pushed at: over 1 year ago - Stars: 54 - Forks: 2

abhinav4747/howmanyofme

Curious how many people share your name? With howmanyofme, you can instantly find out using AI-powered name search. Discover if you're truly one of a kind.

Language: HTML - Size: 1.11 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

ahmedbesbes/anonymization-api

How to build and deploy an anonymization API with FastAPI and SpaCy

Language: Python - Size: 32.1 MB - Last synced at: 13 days ago - Pushed at: about 4 years ago - Stars: 71 - Forks: 28

LanguageMachines/frog

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

Language: C++ - Size: 70.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 77 - Forks: 10

oroszgy/awesome-hungarian-nlp

A curated list of NLP resources for Hungarian

Size: 125 KB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 249 - Forks: 19

yongzhuo/Pytorch-NLU

中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text classification and sequence labeling toolkit, supports multi class and multi label classification, text similsrity, text summary and NER.

Language: Python - Size: 379 KB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 346 - Forks: 50

Shannu3766/AgroAI

🌱 Smart Agriculture Assistant: NLP-powered system that extracts agricultural parameters (temperature, humidity, moisture, soil type, NPK levels) from natural language input to provide crop and fertilizer recommendations. Built with NER, DeepSeek, Flask, and Docker support.

Language: Jupyter Notebook - Size: 408 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 1

TrongNV2003/viNER-Bert

Recognize entity from a sentence

Language: Python - Size: 2.88 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

flairNLP/flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language: Python - Size: 376 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 14,214 - Forks: 2,119

Knowledge-Graph-Hub/kg-microbe

Language: Jupyter Notebook - Size: 529 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 19 - Forks: 3

IBM/zshot

Zero and Few shot named entity & relationships recognition

Language: Python - Size: 1.47 MB - Last synced at: 17 days ago - Pushed at: 3 months ago - Stars: 381 - Forks: 24

Georgetown-IR-Lab/QuickUMLS

System for Medical Concept Extraction and Linking

Language: Python - Size: 89.8 KB - Last synced at: 16 days ago - Pushed at: 12 months ago - Stars: 408 - Forks: 98

JohnSnowLabs/nlu

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

Language: Python - Size: 474 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 929 - Forks: 139

ThilinaRajapakse/simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Language: Python - Size: 20 MB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 4,189 - Forks: 727

StarlangSoftware/TurkishNamedEntityRecognition

NER Corpus Processing Library

Language: Java - Size: 2.21 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 5 - Forks: 0

StarlangSoftware/DataGenerator

Classification dataset generator library for high level Nlp tasks

Language: Java - Size: 17.9 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 1

cooscao/Bert-BiLSTM-CRF-pytorch

bert-bilstm-crf implemented in pytorch for named entity recognition.

Language: Jupyter Notebook - Size: 3.58 MB - Last synced at: 17 days ago - Pushed at: about 4 years ago - Stars: 280 - Forks: 57

hanine-bgt/ELNER-DZ

ELNER-DZ: A Dataset for Named Entity Recognition and Linking in Algerian Arabic Dialect (Darija)

Size: 22.7 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

VinAIResearch/PhoNLP

PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)

Language: Python - Size: 588 KB - Last synced at: 16 days ago - Pushed at: 7 months ago - Stars: 146 - Forks: 19

shibing624/nerpy

🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。

Language: Python - Size: 6.13 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 115 - Forks: 15

basedavishkar/GeoMine-NER-Geolocation

🧠 NLP pipeline to extract mining project names & locations from PDFs using spaCy NER + GeoNames geolocation.

Language: Python - Size: 15.9 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 2 - Forks: 0

blmoistawinde/HarvestText

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

Language: Python - Size: 4.27 MB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 2,530 - Forks: 337

alisonmitchell/Biomedical-Knowledge-Graph

Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.

Language: Jupyter Notebook - Size: 26.3 MB - Last synced at: 17 days ago - Pushed at: 8 months ago - Stars: 12 - Forks: 0

jftuga/deidentification

Deidentify people's names and gender specific pronouns

Language: Python - Size: 284 KB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 37 - Forks: 2

nerel-ds/NEREL

NEREL: A Russian Dataset with Nested Named Entities, Relations and Events

Size: 4.64 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 4

arthur02100/laclaugpt-dashboard

Explore the LaclauGPT demo dashboard, a tool for visualizing multimodal data from social media, designed for political science research. 🚀📊

Language: Python - Size: 2.39 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

explosion/spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

Language: Python - Size: 61.5 KB - Last synced at: 27 days ago - Pushed at: 12 months ago - Stars: 840 - Forks: 118

monarch-initiative/ontogpt

LLM-based ontological extraction tools, including SPIRES

Language: Jupyter Notebook - Size: 80.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 701 - Forks: 97

TomiToivio/laclaugpt-dashboard

LaclauGPT demo with dummy data.

Language: Python - Size: 2.39 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

MuhammadHelmyOmar/ArabicPIIRedaction

Developing a system that can identify and mask sensitive information in Arabic sentences.

Language: Jupyter Notebook - Size: 2.82 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

explosion/spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Language: Python - Size: 194 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 31,826 - Forks: 4,520

deeppavlov/ner

Named Entity Recognition

Language: Python - Size: 116 KB - Last synced at: 17 days ago - Pushed at: about 2 years ago - Stars: 333 - Forks: 64

shaheennabi/Natural-Language-Processing-Practices-and-Mini-Projects

🎇 NLP Experiments 🎆 A hands-on collection of NLP experiments 💬, featuring models like RNN, LSTM, and Attention Mechanism. 🚀 Explore applications like text classification, sentiment analysis, and language generation 🌍. Continuously updated with new algorithms and research implementations! 🔥

Language: Jupyter Notebook - Size: 47.9 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

MantisAI/nervaluate

Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13

Language: Python - Size: 389 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 182 - Forks: 23

myhhub/KnowledgeGraph

knowledge graph知识图谱,从零开始构建知识图谱

Language: Python - Size: 2.56 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 1,385 - Forks: 156

Related Keywords
named-entity-recognition 1,508 nlp 536 natural-language-processing 437 ner 390 machine-learning 192 python 183 spacy 146 deep-learning 140 pytorch 136 relation-extraction 122 sentiment-analysis 122 bert 120 information-extraction 105 text-classification 70 transformers 70 tensorflow 69 nlp-machine-learning 66 crf 59 sequence-labeling 51 dataset 43 knowledge-graph 42 pos-tagging 42 lstm 41 python3 40 text-mining 38 topic-modeling 37 spacy-nlp 37 keras 37 nltk 36 transformer 35 entity-linking 34 question-answering 34 llm 32 conditional-random-fields 30 huggingface 29 entity-extraction 28 ai 28 artificial-intelligence 27 large-language-models 27 tokenization 27 corpus 25 neural-network 25 huggingface-transformers 24 natural-language-understanding 24 text-summarization 23 lemmatization 23 data-science 23 java 22 classification 22 named-entities 22 bilstm-crf 22 bert-model 22 neural-networks 22 word-embeddings 21 language-model 21 jupyter-notebook 21 flask 21 part-of-speech-tagger 20 roberta 20 bilstm 20 part-of-speech-tagging 19 annotation-tool 19 machine-translation 18 streamlit 18 transfer-learning 18 event-extraction 18 docker 18 token-classification 17 dependency-parsing 17 fine-tuning 17 tokenizer 16 coreference-resolution 16 intent-classification 16 conll-2003 16 flair 16 named-entity-disambiguation 15 chatbot 15 dependency-parser 15 text-analysis 14 bert-fine-tuning 14 summarization 14 biomedical 14 lstm-crf 14 lstm-neural-networks 13 nlp-library 13 api 13 stemming 13 part-of-speech 13 annotation 13 bioinformatics 13 text-processing 13 text-generation 13 vietnamese-nlp 12 anonymization 12 information-retrieval 12 fastapi 12 deep-neural-networks 12 ocr 12 cnn 12 named-entity-linking 12