An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: disambiguation

arya-io/NER-EntityLinker

A Streamlit app that performs Named Entity Recognition (NER), links entities to Wikipedia, and handles disambiguation for ambiguous terms like "Apple," using NLP techniques.

Language: Python - Size: 34.5 MB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

sprytnyk/pylint-errors

A curated list of pylint errors with explanation and examples

Language: Python - Size: 304 KB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 74 - Forks: 22

zhongjingyu1/Partial-Multi-Label-Learning

[Summary] A curated list of resources for "Partial-Multi-Label-Learning"

Size: 184 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 33 - Forks: 6

OlivierBinette/er-evaluation

An End-to-End Evaluation Framework for Entity Resolution Systems

Language: Python - Size: 62.4 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 9

vanessaklee/akin

A collection of metrics and phonetic algorithms for fuzzy string matching in Elixir.

Language: Elixir - Size: 8.47 MB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 36 - Forks: 4

kermitt2/biblio-glutton

A high performance bibliographic information service: https://biblio-glutton.readthedocs.io

Language: Java - Size: 7.31 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 137 - Forks: 17

Senzing/awesome

Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.

Language: Python - Size: 244 KB - Last synced at: 25 days ago - Pushed at: about 2 months ago - Stars: 57 - Forks: 2

PatentsView/PatentsView-Evaluation 📦

Evaluation and benchmarking of PatentsView disambiguation algorithms

Language: Python - Size: 156 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 8

earthstar-one/WordFX

Digital resources and tools for enhanced sense-making through natural language disambiguation

Size: 3.11 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

opensemanticsearch/open-semantic-entity-search-api

Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of entities like persons, organizations and places for (semi)automatic semantic tagging & analysis of documents by linked data knowledge graph like SKOS thesaurus, RDF ontology, database(s) or list(s) of names

Language: Python - Size: 146 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 194 - Forks: 34

flyingdoog/NDCC

A Collective Approach to Scholar Name Disambiguation

Language: C++ - Size: 40 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

reynoldsnlp/udar

UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.

Language: Python - Size: 161 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 28 - Forks: 1

jonasengelmann/crossref-reconciliation-service

Crossref.org reconciliation service for OpenRefine.

Language: Python - Size: 9.77 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

wcmc-its/ReCiter-Scopus-Retrieval-Tool

A tool for retrieving articles from Scopus. Can work as a standalone application or in conjunction with the author disambiguation application, ReCiter.

Language: Java - Size: 5.62 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 5

clintval/cvbio 📦

Artisanal 🤣 bioinformatics tools and pipelines in Scala

Language: Scala - Size: 209 KB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 20 - Forks: 3

verginer/disamby

Python package aiding in entity disambiguation based on string and location matching

Language: Python - Size: 716 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 18 - Forks: 0

hirmeos/entity-fishing-client-python

Repository hosting the common code for the entity-fishing clients

Language: Python - Size: 67.4 KB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 4

tommasosansone91/metaglossario_gestisco

Il metaglossario del progetto Interreg Italia-Svizzera Gestisco è una piattaforma di gestione e condivisione del linguaggio che raccoglie la terminologia normativa, tecnica e operativa utilizzata in ambito di protezione civile nel territorio transfrontaliero tra Italia e Svizzera, in particolare tra il Canton Ticino e le province di Como e Varese. La piattaforma è dotata di un query wizard che permette agli utenti di interrogare il database in modo puntuale, semplice e intuitivo. Il metaglossario è inoltre dotato di un form che permette agli utenti di suggerire l’inserimento di nuova terminologia nel database e di un servizio di API che permette l’esportazione della terminologia aggiornata in real-time in modalità linked data, in linguaggio JSON. La piattaforma dispone infine di un sistema che consente all’amministratore di gestire i dati lato backend.

Language: Python - Size: 12.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

kmdn/combining-linking-techniques

Combining Linking Techniques (CLiT) is an entity linking combination and execution framework, allowing for the seamless integration of EL systems and result exploitation for the sake of system reusability, result reproducibility, analysis and continuous improvement. (We hate waste. Especially wasting time. So let's reuse instead!)

Language: Python - Size: 2.17 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

projekt-opal/AGDISTIS Fork of dice-group/AGDISTIS

AGDISTIS - Agnostic Named Entity Disambiguation (D4.4)

Language: Java - Size: 159 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

habeanf/yap

Yet Another (natural language) Parser

Language: Go - Size: 47.4 MB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 43 - Forks: 30

shyamupa/wikidump_preprocessing

Extracting useful metadata from Wikipedia dumps in any language.

Language: Python - Size: 82 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 25 - Forks: 5

Te-Chih/ARCC-AND

Implementation of our WWWW'24 paper: Author Name Disambiguation via Paper Association Refinement and Compositional Contrastive Embedding

Language: Python - Size: 55.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ziqizhang/scholarlydata

Experimental code for author name and affiliation linking/disabmiguation

Language: Java - Size: 148 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 2

patonz/Hodor-Hold-the-Name

Hodor

Language: Java - Size: 66.9 MB - Last synced at: almost 2 years ago - Pushed at: over 9 years ago - Stars: 0 - Forks: 0

prohippo/pyellytoo

This is a conversion of the PyElly NLP tool from Python 2.7 code into Python 3.8 code.

Language: Python - Size: 15.5 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

mynlp/wikilex

Wikipedia Entities Lexicon Extractor

Language: Python - Size: 7.78 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

USC-NSL/sage

SAGE disambiguates protocol description in an IETF RFC document, then converts the disambiguated protocol description into executable protocol implementation.

Language: Python - Size: 1.05 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 2

joe817/Name-Disambiguation-Biendata-

2019 Biendata竞赛平台“OAG–WhoIsWho 同名消歧竞赛 赛道一”消歧比赛,第一名解决方案

Language: Jupyter Notebook - Size: 81.1 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 30 - Forks: 9

carmanzhang/LAGOS-AND

LAGOS-AND: A Large Gold Standard Dataset for Scholarly Author Name Disambiguation

Language: Python - Size: 229 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 1

Lupanoide/graph_disambiguator

Word sense disambiguation project that uses a wikipedia dump and discovers relationship with a graph. Cooked with python microservices, neo4j, elasticsearch

Language: Python - Size: 58.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 1

diging/black-goat

Platform for democratic non-hierarchical authority alignments

Language: Python - Size: 2.37 MB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

Innovation-Information-Initiative/Innovation-Data-Processing-Scripts

A shared repository for data cleaning scripts used for innovation data.

Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 18 - Forks: 2

yanqingan/SfM_Disambiguation

Code for CVPR 2017 paper --- Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context.

Language: C++ - Size: 23.4 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 23 - Forks: 5

DASHANANT/Wordsense

Language: Python - Size: 29.3 KB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

hanuor/amber

Power your app's data links with named entity recognition and disambiguation. A special library for android.

Language: Java - Size: 624 KB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 5 - Forks: 3

joscarlossr/newspaper_entity_disambiguation

Named Entity Disambiguation on Atribuna (Brazil) newspaper as a College research project.

Size: 0 Bytes - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

realyanyang/disambiguation

OAG-WhoIsWho 赛道二 代码分享

Language: Jupyter Notebook - Size: 5.29 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 24 - Forks: 9

wss1996/Name-disambiguation

同名论文消歧的工程化方案(参考2019智源-aminer人名消歧竞赛第一名方案)

Language: Jupyter Notebook - Size: 521 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 3

elmiram/Yiddish

Disambiguation for Yiddish

Language: Python - Size: 152 KB - Last synced at: over 1 year ago - Pushed at: about 11 years ago - Stars: 0 - Forks: 1

hirmeos/entity-fishing-client-java

Java client for entity-fishing

Language: Java - Size: 310 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

subhalingamd/nlp-contextual-word-meaning-comparision

Evaluating context-sensitive word meaning understanding in pair of sentences for BERT and GloVe+BiLSTM using WiC dataset | A3 for COL772 course (Fall 21)

Language: Jupyter Notebook - Size: 1.46 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

sayarghoshroy/Acronym-Sense-Disambiguator

Identifies acronyms in a text file and disambiguates possible expansions

Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

simran-pandey/Disambiguation-keyboard-for-Visually-Challenged

The project proposes a novel design for Disambiguation based chording keyboard for blinds. Results also state that there is an optimum number of words to be included in the corpus which would benefit the users by not increasing his cognitive toll. Empirically, we have found that for the Swarachakra Hindi corpus of 10,000 words, the prediction is best and most effective for the user when the corpus size consist of about 200 high frequency words. This also opens a room for further research where a general method could be implemented to find out the optimum size of corpus in any language model

Language: Python - Size: 17.1 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

gabeorlanski/ACL-Author-Disambiguation

Author Entity disambiguation for the new ACL Anthology

Language: Python - Size: 40.2 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

Kensuke-Mitsuzawa/word2vec-wikification-py

Disambiguation of wikipedia article name

Language: Python - Size: 31.3 KB - Last synced at: 25 days ago - Pushed at: about 8 years ago - Stars: 16 - Forks: 1

avnomad/dop-parser

An operator precedence parser variation that adds a disambiguation step in order to support overloaded fixities, juxtaposition and other features.

Language: D - Size: 434 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

martinthenext/eth_ml

Projects in Machine Learning ETH team trying to use mechanical turk and active learning for solving word-sense disambiguation task

Language: Python - Size: 1.69 MB - Last synced at: over 1 year ago - Pushed at: almost 11 years ago - Stars: 7 - Forks: 3

tommasosansone91/Metaglossario-v2

Metaglossario is a linguistic database that collects and organizes terminological resources available on the web, without altering their specificity, knowledge heritage and semantic depth. It is a web platform which poses strong attention in a glossary of civil protection terms, released in the formats of the semantic web.

Language: CSS - Size: 3.81 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

BCDH/lemator

A brute-force lemmatizer and disambiguator.

Language: PowerShell - Size: 17.6 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

heroxbd/dedup

Deduplication of author names in the literature.

Language: Python - Size: 141 KB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 3

DajeRoma/Things_and_Strings

A machine learning method for improving Place Name Disambiguation

Language: Python - Size: 125 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 0

uds-lsv/disambiguation-of-verbal-shifters

This repository contains the dataset created for the LREC 2018 paper "Disambiguation of Verbal Shifters" by Michael Wiegand, Sylvette Loda and Josef Ruppenhofer

Size: 133 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

mark-lvl/twitter-disambiguation

Disambiguating terms in social media using the Naive-Bayes algorithm.

Language: Python - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

ldrahnik/ldrahnik

Personal website.

Language: JavaScript - Size: 325 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

generall/SkNN-NEL

Implementation of named entity disambiguation and linking with usage of SkNN algorithm

Language: Scala - Size: 5.45 MB - Last synced at: 10 days ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 1

Related Keywords
disambiguation 56 nlp 14 natural-language-processing 8 python 8 entity-resolution 4 named-entity-recognition 4 entity-linking 4 python3 4 record-linkage 4 linked-data 3 machine-learning 3 knowledge-graph 3 wikipedia 3 wsd 3 deduplication 3 entity-extraction 3 interreg 2 uspto 2 glossary 2 gestisco 2 sklearn 2 civil-protection 2 civil-defence 2 ner 2 string-matching 2 rest-api 2 doi 2 nltk 2 pubmed 2 named-entities 2 wikipedia-database 2 sense 2 lemmatization 2 machinelearning 2 morphological-analysis 2 spacy 2 dependency-parser 2 wikipedia-dump 2 computational-linguistics 2 morphological-disambiguator 2 named-entity-linking 2 named-entity-disambiguation 2 disambiguate 2 author-name-disambiguation 2 dataset 2 terminology 2 fuzzy-matching 2 switzerland 2 semantic-web 2 natural-hazard 2 metaglossary 2 language-processing 2 italy 2 neo4j 1 authority-control 1 universal-dependencies 1 concept 1 django 1 end-of-life 1 hierarchical-alignments 1 identities 1 data-cleaning 1 datasets 1 innovation 1 acronym 1 metadata-extraction 1 multilingual 1 redirects 1 wikiextractor 1 www24 1 link-discovery 1 linking 1 rdf 1 grammar-parser 1 high-school-project 1 natural-language 1 procedural-semanticcs 1 rule-based 1 stemming 1 unicode 1 lexicon 1 wikipedia-scraper 1 protocol-specification 1 aminer 1 network-representation-learning 1 unsupervised-learning 1 bibliographic-datasets 1 gold-standard 1 orcid 1 elasticsearch 1 graph 1 microservice 1 text-processing 1 corpus 1 indian-language 1 keyboard 1 acl-anthology 1 grobid 1 python-3 1 command-line 1