An open API service providing repository metadata for many open source software ecosystems.

Topic: "language-identification"

googlesamples/mlkit

A collection of sample apps to demonstrate how to use Google's ML Kit APIs on Android and iOS

Language: Java - Size: 45.4 MB - Last synced at: 12 days ago - Pushed at: 26 days ago - Stars: 3,754 - Forks: 2,982

modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language: Python - Size: 3.18 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 1,905 - Forks: 162

pemistahl/lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Language: Python - Size: 287 MB - Last synced at: 8 days ago - Pushed at: about 1 month ago - Stars: 1,323 - Forks: 45

pemistahl/lingua-go

The most accurate natural language detection library for Go, suitable for short text and mixed-language text

Language: Go - Size: 226 MB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 1,232 - Forks: 68

pemistahl/lingua-rs

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

Language: Rust - Size: 241 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 947 - Forks: 45

pemistahl/lingua

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

Language: Kotlin - Size: 424 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 660 - Forks: 60

echogarden-project/echogarden

Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.

Language: TypeScript - Size: 1.54 MB - Last synced at: 3 days ago - Pushed at: 21 days ago - Stars: 354 - Forks: 38

apcode/tensorflow_fasttext

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

Language: Python - Size: 58.6 KB - Last synced at: 25 days ago - Pushed at: almost 7 years ago - Stars: 303 - Forks: 91

textpipe/textpipe 📦

Textpipe: clean and extract metadata from text

Language: Python - Size: 340 KB - Last synced at: 17 days ago - Pushed at: almost 4 years ago - Stars: 302 - Forks: 26

vunb/vntk

Vietnamese NLP Toolkit for Node

Language: JavaScript - Size: 3.56 MB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 216 - Forks: 63

LlmKira/fast-langdetect

⚡️ 80x faster Fasttext language detection out of the box | Split text by language

Language: Python - Size: 941 KB - Last synced at: 16 days ago - Pushed at: 23 days ago - Stars: 182 - Forks: 8

adbar/simplemma

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

Language: Python - Size: 729 MB - Last synced at: 10 days ago - Pushed at: 5 months ago - Stars: 154 - Forks: 12

HPI-DeepLearning/crnn-lid

Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks

Language: Python - Size: 510 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 100 - Forks: 48

cisnlp/GlotLID

Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

Language: Python - Size: 409 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 92 - Forks: 7

KrishnaDN/x-vector-pytorch

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

Language: Python - Size: 86.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 78 - Forks: 23

nitotm/efficient-language-detector-js

Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.

Language: JavaScript - Size: 13.5 MB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 61 - Forks: 10

adbar/py3langid Fork of saffsd/langid.py

Faster, modernized fork of the language identification tool langid.py

Language: Python - Size: 12.3 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 55 - Forks: 9

microsoft/LID-tool

This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.

Language: Python - Size: 2.16 MB - Last synced at: 1 day ago - Pushed at: over 4 years ago - Stars: 54 - Forks: 9

DoodleBears/split-lang

✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux

Language: Jupyter Notebook - Size: 295 KB - Last synced at: 29 days ago - Pushed at: about 2 months ago - Stars: 50 - Forks: 4

nitotm/efficient-language-detector

Fast and accurate natural language detection. Detector written in PHP. Nito-ELD, ELD.

Language: PHP - Size: 44.6 MB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 49 - Forks: 6

swshon/dialectID_e2e

End to End Dialect Identification using Convolutional Neural Network

Language: Python - Size: 113 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 49 - Forks: 22

py-lidbox/lidbox

End-to-end spoken language identification out of the box.

Language: Python - Size: 6.52 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 48 - Forks: 13

currentslab/fastlangid

fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-hant)

Language: Python - Size: 1.73 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 39 - Forks: 8

rosette-api/python

Babel Street Analytics Client Library for Python

Language: Python - Size: 1.63 MB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 38 - Forks: 38

mbanon/fastspell

Targetted language identifier, based on FastText and Hunspell.

Language: Python - Size: 314 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 34 - Forks: 4

sagorbrur/codeswitch

CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.

Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: 17 days ago - Pushed at: over 4 years ago - Stars: 34 - Forks: 6

UBC-NLP/afrolid

AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.

Language: Python - Size: 12.3 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 31 - Forks: 9

zkmkarlsruhe/language-identification

Spoken Language Identification on Common Voice and AudioSet using Deep Learning

Language: Python - Size: 5.44 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 30 - Forks: 6

hiredscorelabs/seqtolang

Multi-Langauge Identification

Language: Python - Size: 54.4 MB - Last synced at: 6 days ago - Pushed at: 9 months ago - Stars: 28 - Forks: 3

SpeechFlow-io/Spoken_language_identification

A TensorFlow-based spoken language identification

Language: Python - Size: 64.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 28 - Forks: 2

nipunmanral/Spoken-Language-Identification

Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features

Language: Python - Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 16

dataiku/dss-plugin-nlp-preparation

Dataiku DSS plugin to detect languages, correct misspellings, and clean text data 🧼

Language: Python - Size: 17.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 23 - Forks: 6

smola/language-dataset

Dataset for programming language identification.

Language: Python - Size: 11.7 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 22 - Forks: 5

igorsitdikov/lid_kaldi

Language: C++ - Size: 32.3 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 22 - Forks: 6

floydhub/language-identification-template

Detect the languages from short pieces of text

Language: Jupyter Notebook - Size: 511 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 21 - Forks: 7

joshdevins/demo-es-lang-ident

Demo: Elasticsearch Language Identification

Language: Python - Size: 43.9 KB - Last synced at: 21 days ago - Pushed at: about 5 years ago - Stars: 19 - Forks: 8

cisnlp/GlotCC

🕸 GlotCC Dataset and Pipline -- NeurIPS 2024

Language: Jupyter Notebook - Size: 2.31 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 18 - Forks: 0

MartinThoma/lidtk

Language Identification Toolkit

Language: Python - Size: 457 KB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 18 - Forks: 7

nitotm/efficient-language-detector-py

Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.

Language: Python - Size: 10.6 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 4

jonsafari/witch-language

Easy language identification of 380 languages

Language: Python - Size: 93.8 KB - Last synced at: 12 days ago - Pushed at: over 5 years ago - Stars: 17 - Forks: 1

skit-ai/Map-Mix

The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)

Size: 18.1 MB - Last synced at: 27 days ago - Pushed at: about 2 years ago - Stars: 16 - Forks: 1

KrishnaDN/BERTphone

Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"

Language: Python - Size: 923 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 16 - Forks: 5

commoncrawl/language-detection-cld2

Natural language detection, Java bindings for CLD2

Language: Java - Size: 138 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 14 - Forks: 2

searxng/fasttext-predict Fork of facebookresearch/fastText

fasttext with wheels and no external dependency, but only the predict method (<1MB)

Language: C++ - Size: 4.36 MB - Last synced at: 22 days ago - Pushed at: 5 months ago - Stars: 13 - Forks: 5

cisnlp/GlotScript

🖋 Resource and Tool for Writing System Identification -- LREC 2024

Language: Python - Size: 128 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 13 - Forks: 2

ltkk/language-identifier 📦

Chương trình dự đoán ngôn ngữ dựa vào văn bản(như Google Dịch ^^)

Language: Python - Size: 774 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 0

gluschenko/panlingo

Collection of language detection libraries for .NET: FastText, CLD2, CLD3, MediaPipe, Lingua, Whatlang

Language: C# - Size: 1.36 MB - Last synced at: about 22 hours ago - Pushed at: about 22 hours ago - Stars: 9 - Forks: 0

sinaahmadi/CORDI

Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)

Language: Python - Size: 25.9 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 1

yunsii/fasttext.wasm.js

Node and Browser env supported WebAssembly version of fastText: Library for efficient text classification and representation learning.

Language: TypeScript - Size: 5.71 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 8 - Forks: 0

hb20007/greek-dialect-classifier

Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 7 - Forks: 3

sinaahmadi/PersoArabicLID

PALI: Language identification for Perso-Arabic Scripts

Language: Python - Size: 139 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

swshon/lre15_siam

Language identification using Siamese network based on i-vector

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 3

rosette-api/csharp

Babel Street Analytics Client Library for C#

Language: C# - Size: 12.6 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 6 - Forks: 16

rosette-api/php

Babel Street Analytics Client Library for PHP

Language: PHP - Size: 2.06 MB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 12

jonathandunn/geoLid

Geographically-informed language identification

Language: Python - Size: 42 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

fievelk/pylade

PyLaDe - Language Detection tool.

Language: Python - Size: 469 KB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

aparnadutta/code-mixed-lid

Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.

Language: Python - Size: 190 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

nikhil-iyer-97/Language-Identifier

Language identification toolkit for identifying what language a document is writen in

Language: Python - Size: 7.65 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 1

ZJaume/heliport

Fast and accurate language identifier

Language: Rust - Size: 112 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 4 - Forks: 0

rosette-api/ruby

Babel Street Analytics Client Library for Ruby

Language: Ruby - Size: 959 KB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 11

sinaahmadi/KurdishLID

Language identification of Kurdish and Zaza-Gorani languages (& variants)

Language: Shell - Size: 6.2 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

Lhx94As/JSTSP_w2v_for_LID

Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification

Language: Python - Size: 78.1 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

loretoparisi/fasttext.py

FastText Pytorch version

Language: Python - Size: 11.4 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

alvations/bayesline-DSL

A Multinomial Bayesian Classification for Language Identification

Language: Python - Size: 152 MB - Last synced at: 22 days ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 4

jonathandunn/idNet

Neural net language identification for many languages on short texts plus construction-based dialectometry

Language: Python - Size: 455 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

bung87/whatlangid

This project is build on top of whatthelang and langid

Language: Python - Size: 769 KB - Last synced at: 10 days ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 0

stratosblue/LanguageIdentification

.NET Port of Language Identification Library for langid-java. 移植自langid-java的语言识别库。

Language: C# - Size: 2.47 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

hemangsk/capacitor-mlkit-language

Capacitor Plugin implementing Language Identification on Android & iOS using Google's on-device ML library - ML Kit

Language: Swift - Size: 242 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

wahabjawed/language-detection-mobile

Language Identification from Very Short Strings on Mobile Devices - Using Deep Neural Network

Language: Jupyter Notebook - Size: 25.4 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

alvayliu/Language-Identification-with-CNN

Spoken language identification systems (LID) allow for automatic language detection given speech data. Among the many available methods that can be applied to this classification task, modern machine learning and deep learning approaches have been reported as effective. A previous study approached the problem of spoken language identification in the image domain by transforming speech samples to spectrograms and classifying them using convolutional neural networks (CNN). We have implemented two similar types of CNNs and trained them on data for five languages from the SpeechDat database. Then, we investigated how well their performance generalised on speech samples from another source then SpeechDat. The results indicated that even though the models could achieve over 80 % in test accuracy on SpeechDat data, they did not perform well on speech samples not originating from the SpeechDat database, with the best model achieving 37.5 % accuracy.

Size: 5.93 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

dhpollack/spokenlanguages 📦

Language: Jupyter Notebook - Size: 2.51 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 5

sinaahmadi/teshi

An atlas of Central Kurdish dialects + a simple game to detect dialects

Language: HTML - Size: 1.83 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

proycon/colibri-utils

NLP utilities that rely on Colibri Core: currently only language identification

Language: TeX - Size: 19.8 MB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

cisnlp/MaskLID

MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024

Language: Python - Size: 12.7 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

guo-yong-zhi/LanguageIdentification.jl

A Julia package for language identification.

Language: Julia - Size: 16.2 MB - Last synced at: 9 days ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

mbanon/benchmarks

Several benchmarks on sentence splitting and language identification

Language: Mathematica - Size: 35.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

tsimafeip/mt5-lang-detect

Fine-Tuning the Multilingual Text-To-Text Transfer Transformer (MT5) for Predicting The Language Of The Given Text

Language: Jupyter Notebook - Size: 7.37 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

xzhren/PreferenceAwareLID

Unsupervised Preference-Aware Language Identification

Language: Python - Size: 7 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

imdiptanu/language-identification

Detect the text language automatically using a bigram model, Support Vector Machines, and Artifical Neural Networks. The model is trained using the WiLI-2018 benchmark dataset, and the highest accuracy achieved on the test dataset is 99.7% with paragraph text.

Language: Python - Size: 51.9 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

alianoroozi/Language-Identification

A Python implementation of language identification using Canvar and Trenkle’s approach and the WiLI-2018 database

Language: Jupyter Notebook - Size: 58.5 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

shakiz/SpeechRecognition

Speech recognition in android is a very first thing that we want nowadays because Artificial Intelligence is everywhere. I am extending the work from recognizing the text from speech and convert them into one's native language and finally perform some tasks.

Language: Java - Size: 125 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 1

brijmohan/lid-convex-comb

Convex combination of phonotactics for large-scale spoken language identification

Language: Python - Size: 16.3 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 2

jkaloger/langid

Language Identification of Short Text Documents

Language: Python - Size: 143 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

vhidvz/language-identification

Language identification microservice powered by the FastText language detection model

Language: Python - Size: 62.5 KB - Last synced at: 2 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 1

andrianllmm/tagLID

A word level Language Identification (LID) tool for Tagalog-English (Taglish) text.

Language: Python - Size: 610 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

bhagat-sahil/Multilingual-Text-Translation-App

Created an Android app with Google ML Kit for translation among 59 languages. Integrated image text extraction with ML Text Recognition v2, language auto-detection for unidentified text using Language Identification. Added a text-to-speech feature for reading translated text aloud.

Language: Kotlin - Size: 103 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

DoodleBears/langdetect Fork of veelion/langdetect

Port of Google's language-detection library to Python.

Language: Python - Size: 1.07 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

caiselvas/language-identification

An NLP project leveraging character trigrams and smoothing techniques (Lidstone, Linear Discounting, Absolute Discounting) for language identification. Trained on for Spanish, Italian, English, French, Dutch, and German, achieving 99.8932% accuracy. Includes datasets, model parameters, and comprehensive documentation.

Language: Jupyter Notebook - Size: 75.2 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

AndyHe021112/math125a

Math125A_LanguageIdentification

Language: Jupyter Notebook - Size: 11.9 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

dimits-ts/practical_data_science

Projects concerning LLMs, prompting, NLP, webscraping, data aquisition and dataset analysis.

Language: Jupyter Notebook - Size: 125 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

bimarakajati/Nusacular

This project aims to develop a system capable of detecting regional languages or dialects from text. We will collect a dataset containing text in various regional languages or dialects and train machine learning models to recognize and classify them.

Language: Python - Size: 54 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 2

Dahouabdelhalim/Language_Identificaiton

Language Identification python script

Language: Jupyter Notebook - Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

MathurUtkarsh/Language-Identification-in-Audio-Using-Deep-CRNN

Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks

Language: Python - Size: 178 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

todd-gavin/DSCI550-PixstoryMediaExtractionAndAnalysis

Extraction analysis of PixStory Social Media Dataset using language detection, language translation, tike geotopic parser, tika image object recognition/image caption generation, and PyTorch detoxify.

Language: Jupyter Notebook - Size: 349 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ir-nlp-csui/id-en-code-mixed

Indonesian-English code-mixed Twitter dataset

Size: 288 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

zkmkarlsruhe/museum-label

Auto-adaptive, AI-supported museum label with language identification

Language: JavaScript - Size: 5.48 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

mmaguero/lang_detection

Language detection (majority voting)

Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

jjlee0802cu/open-set-lid

Open-set speech language identification https://arxiv.org/abs/2205.10397

Language: Shell - Size: 684 MB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

ralf-koenig/se4ai-pr01-gr07

Lecture on "Software Engineering for AI-enabled systems", summer term 2022, Project 01, Group 07

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

sachinraghult/Language-Identification-ML

Language Identification using NLP. The dataset used here is Europarl dataset consists of over 21 European languages which is extracted from the proceedings of the European Parliament that is trained by both Logistic Regression model and Multinomial Naive Bayes model. And, the trained model is deployed with front end using flask for user interface.

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 1

Related Topics
nlp 46 language-detection 34 natural-language-processing 24 python 19 machine-learning 19 fasttext 13 language-recognition 12 deep-learning 12 language-classification 8 pytorch 7 nlp-machine-learning 7 tensorflow 7 language-identifier 6 natural-language 6 code-mixing 6 code-switching 6 language-model 5 keras 5 language-detector 5 speech 5 speech-processing 5 android 5 langid 5 speech-recognition 5 lid 5 jupyter-notebook 4 text-to-speech 4 translation 4 lstm 4 language-processing 4 language-identification-toolkit 4 kurdish 4 text-analysis 4 spoken-language-identification 4 langdetect 4 entity-extraction 4 morphology 4 named-entity-recognition 4 language 4 neural-network 4 spoken-language-recognition 4 tokenization 4 arabic 3 intelligent-museum 3 nltk 3 dialect-identification 3 python3 3 glot 3 cnn 3 text-recognition 3 speech-to-text 3 n-grams 3 sorani 3 languagedetector 3 text-analytics 3 convolutional-neural-networks 3 text-embedding 3 text-processing 3 kurdish-language-processing 3 sentiment-analysis 3 dialects 2 corpus-linguistics 2 dialect 2 kurmanji 2 tf-idf 2 lre 2 persian 2 lemmatization 2 computational-linguistics 2 english 2 transformer 2 code-switch 2 php 2 naive-bayes 2 nodejs 2 ruby 2 cld2 2 mlkit 2 text-classification 2 huggingface 2 gorani 2 ai 2 dnn 2 i-vector 2 identification 2 linguistics 2 ios 2 landmark-detection 2 json 2 java 2 currency-converter 2 cloud-vision-api 2 multlingual 2 neural-networks 2 gru 2 glotlid 2 glotcc 2 dataset 2 ngrams 2 computer-vision 2