Topic: "language-recognition"
antlr/antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Language: Java - Size: 67.3 MB - Last synced at: 1 day ago - Pushed at: 20 days ago - Stars: 17,841 - Forks: 3,344

pemistahl/lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Language: Python - Size: 287 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 1,323 - Forks: 45

pemistahl/lingua-go
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
Language: Go - Size: 226 MB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 1,232 - Forks: 68

greyblake/whatlang-rs
Natural language detection library for Rust. Try demo online: https://whatlang.org/
Language: Rust - Size: 2.05 MB - Last synced at: 33 minutes ago - Pushed at: about 1 month ago - Stars: 1,010 - Forks: 112

pemistahl/lingua-rs
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
Language: Rust - Size: 241 MB - Last synced at: 24 minutes ago - Pushed at: 1 day ago - Stars: 953 - Forks: 45

pemistahl/lingua
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Language: Kotlin - Size: 424 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 660 - Forks: 60

cisnlp/GlotLID
Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
Language: Python - Size: 409 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 92 - Forks: 7

KrishnaDN/x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
Language: Python - Size: 86.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 78 - Forks: 23

adbar/py3langid Fork of saffsd/langid.py
Faster, modernized fork of the language identification tool langid.py
Language: Python - Size: 12.3 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 55 - Forks: 9

py-lidbox/lidbox
End-to-end spoken language identification out of the box.
Language: Python - Size: 6.52 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 48 - Forks: 13

SpeechFlow-io/Spoken_language_identification
A TensorFlow-based spoken language identification
Language: Python - Size: 64.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 28 - Forks: 2

igorsitdikov/lid_kaldi
Language: C++ - Size: 32.3 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 22 - Forks: 6

theolepage/ssl-for-slr
Collection of self-supervised models for speaker and language recognition tasks.
Language: Jupyter Notebook - Size: 4.67 MB - Last synced at: 24 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 2

lord-alfred/dnlp
📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа
Language: Python - Size: 43 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 5

swshon/dialectID_siam
Dialect identification using Siamese network
Language: Jupyter Notebook - Size: 116 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 15 - Forks: 4

ZeroBone/Knife
Knife is a Java top-down parser generator for building parsers from grammars in BNF format.
Language: Java - Size: 251 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 1

Vanova/mfom_attribute_detection
Multi-label MFoM Framework for Speech Articulatory Attributes Detection
Language: HTML - Size: 995 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 4

swshon/lre15_siam
Language identification using Siamese network based on i-vector
Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 3

A-LPG/LPG2
The LALR parser generator (LPG) is a tool for developing scanners and parsers. Supports multi-language . Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.
Language: C++ - Size: 3.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 2

sigrlami/lanhunch
Language Detection Library
Language: Haskell - Size: 7.81 KB - Last synced at: 5 days ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 0

Marcono1234/tiny-lingua Fork of pemistahl/lingua
👄 Fork of the language detector Lingua, with the intention to increase detection speed and reduce memory consumption
Language: Kotlin - Size: 1.89 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

osadj/kaldi_nnet3
Simple, yet fast, Python scripts to read Kaldi NNet3 models and compute bottleneck features
Language: Python - Size: 12.7 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 5

Lhx94As/JSTSP_w2v_for_LID
Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification
Language: Python - Size: 78.1 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

4security/Select20
The todo app Select20 leverages language recognition to manage tasks more efficiently. The distraction-free and blazing fast app supports offline usage and compatibility to CalDav.
Language: PHP - Size: 4.85 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

takoshiobi/detect-language
🎏🎌 language recognition script implemented using basic algorithms and spaghetti code
Language: Python - Size: 117 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

azagniotov/language-detection
This is a refined and re-implemented version of the archived plugin for ElasticSearch elasticsearch-langdetect, which itself builds upon the original work by Nakatani Shuyo, found at https://github.com/shuyo/language-detection. The aforementioned implementation by Nakatani Shuyo serves as the default language detection component within Apache Solr.
Language: Java - Size: 18.2 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

ZeroBone/Grammax
Grammax is a Java & C++ bottom-up SLR/CLR parser generator that builds parsers from grammars in Backus-Naur-Form.
Language: Java - Size: 478 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 1

mc-cat-tty/Language-Classification
Suite of Python modules to recognise the language of a file
Language: Python - Size: 12.9 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

golecalicja/language-recognition-neural-network
A single-layer neural network written from scratch that predicts the language of the text.
Language: Python - Size: 1.82 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

kuafuwang/LPG2 📦
The LALR parser generator (LPG) is a tool for developing scanners and parsers written in TypeScript ,C#, Java, C++ or C. Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.
Language: C++ - Size: 1.75 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Boriszn/NASAbot
The NASABot integrated with NASA API and LUIS (Language Recognition Service). It provides access to the latest NASA API (like Space Weather Database Of Notifications and other NASA services) using plain English and Natural User Flow.
Language: C# - Size: 2.74 MB - Last synced at: 4 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

guidsdo/ima-parse
Simple and convenient yet powerful parsing lib. No Regexes, tree walkers, (E)BNF or books necessary! No separate lexer required.
Language: TypeScript - Size: 1.8 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

isbainemohamed/Youtube-language-recognition-Piepline
This project is about creating an automated youtube videos scraper using Airflow, Selenium, ytb-dlp library.
Language: Jupyter Notebook - Size: 24.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

brunneis/linguakit-streaming Fork of citiususc/Linguakit
Streaming version of Linguakit, a multilingual toolkit for NLP
Language: Perl - Size: 298 MB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

wpdevelopment11/codeblocks
Modify Markdown fenced code blocks to contain the language name by detecting it from the block contents.
Language: Python - Size: 7.81 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

OneOffTech/laravel-language-recognizer
Recognize languages in text in a Laravel application
Language: PHP - Size: 49.8 KB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

jackr276/Context-Free-Language-Recognition-with-a-PDA
Implementation of a Pushdown Automaton that recognizes strings belonging to a language valid arithmetic expressions over floating point numbers
Language: C++ - Size: 138 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sdingcn/ecc
an easy-to-use parser generator (compiler-compiler)
Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

LucaSpadoni/Compiler-and-Interpreter-based-on-ANTLR
Implementation of a parser, a compiler and an interpreter for a programming language called “SimplanPlus” which is based on ANTLR.
Language: Java - Size: 2.9 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

PejperO/NeuralNetwork-Language
NeuralNetwork Language is a console application that uses a single-layer neural network to identify the language of a given text. It trains on data in the trainData folder and predicts the language of texts in the testData folder or manually inputted text. The application supports multiple languages as long as they are correctly labeled.
Language: Java - Size: 36.1 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Viggy1357/selfmadeOCR
This project focuses on language translation of images to texts using Pytesseract. This program successfully translates 4 different images in terms of languages and sources into english. This program is capable to translate more than 50 languages using Pytesseract and google translate.
Language: Jupyter Notebook - Size: 3.28 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

jathu5/compiler
A compiler in Racket for a subset of C/C++. Applied language recognition theories.
Language: Racket - Size: 18 MB - Last synced at: 9 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

MHoubre/dialectID_e2e Fork of swshon/dialectID_e2e
End to End Dialect Identification using Convolutional Neural Network
Language: Python - Size: 113 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

tiagohpf/tai-2017-trab2
Recognition of language inserted
Language: Java - Size: 1.13 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

tsotne95/tooSimpleLanguageRecognition
too Simple Language Recognition
Language: Python - Size: 19.5 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0
