An open API service providing repository metadata for many open source software ecosystems.

Topic: "language-recognition"

antlr/antlr4

ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.

Language: Java - Size: 67.3 MB - Last synced at: 1 day ago - Pushed at: 20 days ago - Stars: 17,841 - Forks: 3,344

pemistahl/lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Language: Python - Size: 287 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 1,323 - Forks: 45

pemistahl/lingua-go

The most accurate natural language detection library for Go, suitable for short text and mixed-language text

Language: Go - Size: 226 MB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 1,232 - Forks: 68

greyblake/whatlang-rs

Natural language detection library for Rust. Try demo online: https://whatlang.org/

Language: Rust - Size: 2.05 MB - Last synced at: 33 minutes ago - Pushed at: about 1 month ago - Stars: 1,010 - Forks: 112

pemistahl/lingua-rs

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

Language: Rust - Size: 241 MB - Last synced at: 24 minutes ago - Pushed at: 1 day ago - Stars: 953 - Forks: 45

pemistahl/lingua

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

Language: Kotlin - Size: 424 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 660 - Forks: 60

cisnlp/GlotLID

Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

Language: Python - Size: 409 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 92 - Forks: 7

KrishnaDN/x-vector-pytorch

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

Language: Python - Size: 86.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 78 - Forks: 23

adbar/py3langid Fork of saffsd/langid.py

Faster, modernized fork of the language identification tool langid.py

Language: Python - Size: 12.3 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 55 - Forks: 9

py-lidbox/lidbox

End-to-end spoken language identification out of the box.

Language: Python - Size: 6.52 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 48 - Forks: 13

SpeechFlow-io/Spoken_language_identification

A TensorFlow-based spoken language identification

Language: Python - Size: 64.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 28 - Forks: 2

igorsitdikov/lid_kaldi

Language: C++ - Size: 32.3 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 22 - Forks: 6

theolepage/ssl-for-slr

Collection of self-supervised models for speaker and language recognition tasks.

Language: Jupyter Notebook - Size: 4.67 MB - Last synced at: 24 days ago - Pushed at: over 3 years ago - Stars: 19 - Forks: 2

lord-alfred/dnlp

📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа

Language: Python - Size: 43 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 5

swshon/dialectID_siam

Dialect identification using Siamese network

Language: Jupyter Notebook - Size: 116 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 15 - Forks: 4

ZeroBone/Knife

Knife is a Java top-down parser generator for building parsers from grammars in BNF format.

Language: Java - Size: 251 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 1

Vanova/mfom_attribute_detection

Multi-label MFoM Framework for Speech Articulatory Attributes Detection

Language: HTML - Size: 995 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 4

swshon/lre15_siam

Language identification using Siamese network based on i-vector

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 3

A-LPG/LPG2

The LALR parser generator (LPG) is a tool for developing scanners and parsers. Supports multi-language . Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.

Language: C++ - Size: 3.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 2

sigrlami/lanhunch

Language Detection Library

Language: Haskell - Size: 7.81 KB - Last synced at: 5 days ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 0

Marcono1234/tiny-lingua Fork of pemistahl/lingua

👄 Fork of the language detector Lingua, with the intention to increase detection speed and reduce memory consumption

Language: Kotlin - Size: 1.89 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

osadj/kaldi_nnet3

Simple, yet fast, Python scripts to read Kaldi NNet3 models and compute bottleneck features

Language: Python - Size: 12.7 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 5

Lhx94As/JSTSP_w2v_for_LID

Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification

Language: Python - Size: 78.1 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

4security/Select20

The todo app Select20 leverages language recognition to manage tasks more efficiently. The distraction-free and blazing fast app supports offline usage and compatibility to CalDav.

Language: PHP - Size: 4.85 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

takoshiobi/detect-language

🎏🎌 language recognition script implemented using basic algorithms and spaghetti code

Language: Python - Size: 117 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

azagniotov/language-detection

This is a refined and re-implemented version of the archived plugin for ElasticSearch elasticsearch-langdetect, which itself builds upon the original work by Nakatani Shuyo, found at https://github.com/shuyo/language-detection. The aforementioned implementation by Nakatani Shuyo serves as the default language detection component within Apache Solr.

Language: Java - Size: 18.2 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

ZeroBone/Grammax

Grammax is a Java & C++ bottom-up SLR/CLR parser generator that builds parsers from grammars in Backus-Naur-Form.

Language: Java - Size: 478 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 1

mc-cat-tty/Language-Classification

Suite of Python modules to recognise the language of a file

Language: Python - Size: 12.9 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

golecalicja/language-recognition-neural-network

A single-layer neural network written from scratch that predicts the language of the text.

Language: Python - Size: 1.82 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

kuafuwang/LPG2 📦

The LALR parser generator (LPG) is a tool for developing scanners and parsers written in TypeScript ,C#, Java, C++ or C. Input is specified by BNF rules. LPG supports backtracking (to resolve ambiguity), automatic AST generation and grammar inheritance.

Language: C++ - Size: 1.75 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Boriszn/NASAbot

The NASABot integrated with NASA API and LUIS (Language Recognition Service). It provides access to the latest NASA API (like Space Weather Database Of Notifications and other NASA services) using plain English and Natural User Flow.

Language: C# - Size: 2.74 MB - Last synced at: 4 days ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 1

guidsdo/ima-parse

Simple and convenient yet powerful parsing lib. No Regexes, tree walkers, (E)BNF or books necessary! No separate lexer required.

Language: TypeScript - Size: 1.8 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

isbainemohamed/Youtube-language-recognition-Piepline

This project is about creating an automated youtube videos scraper using Airflow, Selenium, ytb-dlp library.

Language: Jupyter Notebook - Size: 24.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

brunneis/linguakit-streaming Fork of citiususc/Linguakit

Streaming version of Linguakit, a multilingual toolkit for NLP

Language: Perl - Size: 298 MB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

wpdevelopment11/codeblocks

Modify Markdown fenced code blocks to contain the language name by detecting it from the block contents.

Language: Python - Size: 7.81 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

OneOffTech/laravel-language-recognizer

Recognize languages in text in a Laravel application

Language: PHP - Size: 49.8 KB - Last synced at: 8 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

jackr276/Context-Free-Language-Recognition-with-a-PDA

Implementation of a Pushdown Automaton that recognizes strings belonging to a language valid arithmetic expressions over floating point numbers

Language: C++ - Size: 138 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sdingcn/ecc

an easy-to-use parser generator (compiler-compiler)

Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

LucaSpadoni/Compiler-and-Interpreter-based-on-ANTLR

Implementation of a parser, a compiler and an interpreter for a programming language called “SimplanPlus” which is based on ANTLR.

Language: Java - Size: 2.9 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

PejperO/NeuralNetwork-Language

NeuralNetwork Language is a console application that uses a single-layer neural network to identify the language of a given text. It trains on data in the trainData folder and predicts the language of texts in the testData folder or manually inputted text. The application supports multiple languages as long as they are correctly labeled.

Language: Java - Size: 36.1 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Viggy1357/selfmadeOCR

This project focuses on language translation of images to texts using Pytesseract. This program successfully translates 4 different images in terms of languages and sources into english. This program is capable to translate more than 50 languages using Pytesseract and google translate.

Language: Jupyter Notebook - Size: 3.28 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

jathu5/compiler

A compiler in Racket for a subset of C/C++. Applied language recognition theories.

Language: Racket - Size: 18 MB - Last synced at: 9 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

MHoubre/dialectID_e2e Fork of swshon/dialectID_e2e

End to End Dialect Identification using Convolutional Neural Network

Language: Python - Size: 113 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

tiagohpf/tai-2017-trab2

Recognition of language inserted

Language: Java - Size: 1.13 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

tsotne95/tooSimpleLanguageRecognition

too Simple Language Recognition

Language: Python - Size: 19.5 KB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

Related Topics
nlp 13 language-identification 12 language-detection 11 python 8 language-classification 8 parser-generator 7 natural-language-processing 6 grammar 6 nlp-machine-learning 5 java 4 language-processing 4 text-classification 4 detect-language 4 parse 4 parsing 4 parser 3 dfa 3 tensorflow 3 language-detector 3 spoken-language-recognition 3 speech 3 grammar-parser 3 antlr 3 language 3 ai 3 machine-learning 3 text-processing 2 no-dependencies 2 grammars 2 text-analysis 2 spoken-language-identification 2 language-detection-library 2 language-detection-lib 2 nlp-library 2 rust 2 laravel 2 self-supervised-learning 2 antlr4 2 siamese-network 2 automation 2 siamese 2 i-vector 2 neural-network 2 text-classifier 2 speaker-recognition 2 langid 2 whatlang 2 kaldi 2 classifier 2 language-classifier 2 cnn 2 deep-learning 2 bnf 2 yacc-lex 2 scannerless 2 lalr-parser-generator 2 natural-language 2 lalr 2 hacktoberfest 1 standalone 1 single-layer-perceptron 1 compiler-compiler 1 haskell 1 console-application 1 data-efficient 1 simplelanguagerecognition 1 german 1 simple 1 language-learning 1 compiler 1 interpreter 1 artificial-intelligence 1 nasa-bot 1 nasa-services 1 space-weather-database 1 nasa-api 1 context-free-grammar 1 pushdown-automaton 1 luis 1 azure-bot-service 1 azure-bot-framework 1 theoretical-computer-science 1 assembly 1 binary 1 context-sensitive-analysis 1 mips 1 azure 1 scanner 1 testing 1 training-dataset 1 low-resource-languages 1 low-resource-nlp 1 multlingual 1 python-library 1 algorithm 1 rustlang 1 cpp 1 csharp 1 dart 1 golang 1