An open API service providing repository metadata for many open source software ecosystems.

framagit.org topics: token

nlp/iamtokenizing

Tokenizer classes for several NLP tasks: splitting a text on white space, using a REGEX expression, ... This package is based on the tokenspan package, see https://framagit.org/nlp/tokenspan

Last synced at: 18 days ago - Stars: 0 - Forks: 0

nlp/extractionstring

Extract part of a string in a versatile way, and without destroying information from the parent string. Allows discontinuous part of a string to be collected as an ExtractionString. Allows several strategies of string-splitting at the same time, for a given string.

Last synced at: 26 days ago - Stars: 0 - Forks: 0

nlp/tokenspan

Deprecated from sept. 2022. See https://framagit.org/nlp/extractionstring for improved tools to extract any sub-string from a parent one without losing information from the parent string.

Last synced at: 6 days ago - Stars: 0 - Forks: 0