Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / SpydazWebAI-NLP / Basic_Tokenizer2023

The Tokenizer is a versatile text processing library written in Visual Basic (VB.NET). It provides functionalities for tokenizing text into words, sentences, characters, and n-grams. The library is designed to be flexible, customizable, and easy to integrate into your VB.NET projects.

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SpydazWebAI-NLP%2FBasic_Tokenizer2023

Stars: 0
Forks: 1
Open Issues: 0

License: mit
Language: Visual Basic .NET
Repo Size: 1.06 MB
Dependencies: 0

Created: 11 months ago
Updated: 9 months ago
Last pushed: 9 months ago
Last synced: 9 months ago

Topics: bpe, frequent-pattern-mining, ngrams, pmi, text-preprocessing, tokenization, tokenizer, vocabulary-builder

Files
    Loading...
    Readme
    Loading...

    No dependencies found