Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / SpydazWebAI-NLP / Basic_Tokenizer2023
The Tokenizer is a versatile text processing library written in Visual Basic (VB.NET). It provides functionalities for tokenizing text into words, sentences, characters, and n-grams. The library is designed to be flexible, customizable, and easy to integrate into your VB.NET projects.
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SpydazWebAI-NLP%2FBasic_Tokenizer2023
Stars: 0
Forks: 1
Open Issues: 0
License: mit
Language: Visual Basic .NET
Repo Size: 1.06 MB
Dependencies:
0
Created: 11 months ago
Updated: 9 months ago
Last pushed: 9 months ago
Last synced: 9 months ago
Topics: bpe, frequent-pattern-mining, ngrams, pmi, text-preprocessing, tokenization, tokenizer, vocabulary-builder
Files
No dependencies found