An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: multiword-expressions

nert-nlp/streusle

STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)

Language: Python - Size: 42.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 66 - Forks: 17

meghdadFar/wordview

A Python package for Exploratory Data Analysis (EDA) for text-based data.

Language: Python - Size: 40.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 1

kavgan/phrase-at-scale

Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English

Language: Python - Size: 80.6 MB - Last synced at: 5 months ago - Pushed at: about 6 years ago - Stars: 128 - Forks: 45

empiriker/mwe-detector

A SpaCy MWE identification pipeline component

Language: Python - Size: 14.9 MB - Last synced at: 5 days ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0

Mindful/MWEasWSD

Repo for the paper "MWE as WSD: Solving Multi-Word Expression Identification with Word Sense Disambiguation"

Language: Python - Size: 1.12 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 1

agneknie/com4520DarwinProject

Adjacent code related to the paper prepared for Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024), 25th May, 2024.

Language: Jupyter Notebook - Size: 111 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

gwaps4nlp/rigor-mortis

Rigor-Mortis is an online GWAP where players have to find multiword expressions in French sentences

Language: PHP - Size: 5.79 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 2

dcavar/fomaMWT

Foma-based multi-word tagger and morphological analyzer

Language: C++ - Size: 614 KB - Last synced at: 5 months ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 1

meghdadFar/SDMA

Python implementation of Substitution-driven Measures of Association

Language: Python - Size: 44.9 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Babelscape/ner4id

Data and code for the paper "NER4ID at SemEval-2022 Task 2: Named Entity Recognition for Idiomaticity Detection".

Language: Jupyter Notebook - Size: 1.55 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

deep-bgt/Deep-BGT

Language: Python - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

vered1986/NC_embeddings

Comparison between various noun compound embeddings

Language: Python - Size: 2.32 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 9 - Forks: 2

Babelscape/ID10M

Data and code for the paper "ID10M: Idiom Identification in 10 Languages" (NAACL 2022).

Language: Python - Size: 17.9 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

M4t1ss/MWE-Tools

A set of useful tools for use with multiword expression extraction from parallel corpora for Moses statistical machine translation system

Language: C++ - Size: 744 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 3

meghdadFar/mwes_m2

Java implementation of substitution driven measures of association that can be used to identify MWEs.

Language: Java - Size: 153 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

omidrohanian/gappy-mwes

Code for NAACL 2019 paper: "Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions"

Language: Python - Size: 1010 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 4

isVy08/mwe-demo

Learning English expressions has never been so easy

Language: JavaScript - Size: 575 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

dimsum16/dimsum-data

Data for the DiMSUM shared task at SEMEVAL 2016

Language: Python - Size: 2.46 MB - Last synced at: over 2 years ago - Pushed at: over 9 years ago - Stars: 14 - Forks: 5

OFAI/German_SupportVerbConstructions_FigurativExpressions

Size: 255 KB - Last synced at: 22 days ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0