An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: linguistic-analysis

sillsdev/FieldWorks

FieldWorks is a suite of software tools for language and cultural data, with support for complex scripts.

Language: C# - Size: 1010 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 93 - Forks: 33

hoangsonww/Amazon-Reviews-Analysis

🧐 This project analyzes Amazon Fine Food Reviews to investigate whether negative reviews are more emotionally intense and lexically repetitive than positive ones. Using R, we apply sentiment analysis and lexical diversity metrics to uncover patterns in consumer review language.

Language: R - Size: 279 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

DmitryRyumin/INTERSPEECH-2023-24-Papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

Size: 11.4 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 666 - Forks: 42

matthias-stemmler/annimate

Your Friendly ANNIS Match Exporter

Language: TypeScript - Size: 5.73 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 10 - Forks: 0

NEU-DSG/dailp-encoding

Digital Archive of American Indian Languages Preservation and Perseverance

Language: TypeScript - Size: 7.19 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 21 - Forks: 3

mmmaurer/elfen

A python package to efficiently extract linguistic features for text/NLP datasets

Language: Python - Size: 5.65 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 1

AlaaAlzahrani/Jiwar

Jiwar: A calculator for orthographic, phonological and phonographic neighborhood measures. Supports 40+ languages.

Language: Python - Size: 120 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

jcvasquezc/phonet

Keras-based python framework to compute phonological posterior probabilities from audio files

Language: Python - Size: 23 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 42 - Forks: 18

THU-KEG/ChatLog

⏳ ChatLog: Recording and Analysing ChatGPT Across Time

Language: Jupyter Notebook - Size: 6.17 MB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 97 - Forks: 3

audreycs/ImpScore

A repository for paper ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences accepted to ICLR 2025.

Language: Python - Size: 5.02 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 0

livingtongues/living-dictionaries

Speeding the availability of language resources for endangered languages. Tools such as this have the power to shift how we think about endangered languages. Rather than perceiving them as being antiquated, difficult to learn and on the brink of vanishing, we see them as modern, easily accessible for learning online in text and audio formats.

Language: TypeScript - Size: 18.6 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 37 - Forks: 2

azagniotov/language-detection

This is a refined and re-implemented version of the archived plugin for ElasticSearch elasticsearch-langdetect, which itself builds upon the original work by Nakatani Shuyo, found at https://github.com/shuyo/language-detection. The aforementioned implementation by Nakatani Shuyo serves as the default language detection component within Apache Solr.

Language: Java - Size: 18.2 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

julienijs/Predictability-of-language-change

How predictable is language change?

Language: R - Size: 8.62 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

LSYS/LexicalRichness

:smile_cat: :speech_balloon: A module to compute textual lexical richness (aka lexical diversity).

Language: Python - Size: 3.46 MB - Last synced at: 11 days ago - Pushed at: over 1 year ago - Stars: 105 - Forks: 20

korpling/graphANNIS

This is a new backend implementation of the ANNIS linguistic search and visualization system.

Language: Rust - Size: 15.2 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 1

brucewlee/lingfeat

[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment

Language: Python - Size: 56.9 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 127 - Forks: 16

michal-owsiak/swps-university-research-part-II

Python-based linguistic analysis project including natural language processing (NLP) techniques.

Language: Jupyter Notebook - Size: 12.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

arjo129/LangCluster

A visuallization for cognates in various languages and how they spread

Language: Python - Size: 363 KB - Last synced at: about 22 hours ago - Pushed at: 10 months ago - Stars: 6 - Forks: 2

fidelisrafael/esperanto-analyzer

Morphological and syntactic analysis of Esperanto sentences

Language: Python - Size: 209 KB - Last synced at: 19 days ago - Pushed at: almost 4 years ago - Stars: 32 - Forks: 1

Halvani/Constituent-Treelib

A lightweight Python library for constructing, processing, and visualizing constituent trees.

Language: Jupyter Notebook - Size: 2.67 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 67 - Forks: 12

nykolai-d/concreteness-score-of-word

This code takes an English word as input and returns its concreteness score and position of speech, using as reference the Brysbaert, M. et. al. (2014) concreteness ratings for 40 thousand generally known English word lemmas.

Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Abe-Alefew/LexiLink

The aim of this mini-project is to to analyze the text and phonemic similarities between the Afan Oromo and Somali languages by examining word frequency, overlap, and phonemic distribution.

Language: Python - Size: 75.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

public-law/readability

How readable is your text? Provide a text input and get its grade level. Validated against the source data.

Language: Python - Size: 87.9 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 11 - Forks: 1

nikisetti01/Hadoop-MapReduce-LetterFrequency-Analysis

Simple example of Hadoop Application count letter, with an intersting Romance Language Analysis

Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 2

kivanc57/RQuests

The RQuest project uses R to analyze textual data, focusing on tasks like calculating word lengths, comparing languages, and extracting linguistic features with udpipe. It includes statistical methods, visualizations, and stochastic simulations, showcasing diverse approaches to text modeling.

Language: R - Size: 6.57 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ashithapallath/Name-Nationality-Classifier-Using-DeepLearning

This project implements a deep learning-based classifier to identify whether a name is Indian or Non-Indian. By leveraging advanced neural networks to analyze name patterns, the classifier offers accurate predictions, with applications in demographic studies, personalized services, and more.

Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

sociocom/limco

limco: a linguistic measure collection

Language: Python - Size: 1.03 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 3

LanguageMachines/foliatest

Test suite for libfolia

Language: C++ - Size: 847 KB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 0 - Forks: 2

unrealtecellp/life

Linguistic Field Data Management and Analysis System [LiFE]

Language: Python - Size: 295 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 1

radu-macphee96/Star-Trek-Coding-Script

Star Trek: Exolinguistic Comprehensive Translation Matrix

Size: 5.86 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Sl1mb0/tran-scraper

project aimed at cleaning, scraping, and analyzing question type and frequency of linguistic transcripts.

Language: Shell - Size: 12.4 MB - Last synced at: 8 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

Deeptiman/php-dom-parser-translation-tool

A Simple DOM Parser and Translation Tool using PHP, HTML, and MySQL. The translation model is supported for English to Odia language. There is a built in dictionary to support the translation.

Language: PHP - Size: 4.62 MB - Last synced at: 17 days ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 1

TALP-UPC/saga

SAGA - Phonetic transcription software for all Spanish variants.

Language: C - Size: 466 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 12 - Forks: 6

AndreasBlombach/Possessiver_Dativ

Daten und Analysen zum possessiven Dativ

Language: HTML - Size: 18.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

SondreWold/lexical_complexity_estimation

Code related to the LREC-COLING 2024 paper "Estimating Lexical Complexity from Document-Level Distributions"

Language: Python - Size: 970 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Vivek-Tate/Language-Model

Language Model project is a Java-based language and N-Gram model developed for the COM6516 module. It predicts up to two words based on a single word input and provides detailed text analysis statistics. Demonstrating advanced object-oriented programming and design principles, it is a valuable tool for predictive text input and linguistic analysis.

Language: Java - Size: 6.48 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

nickduran/align-linguistic-alignment

Python library for extracting quantitative, reproducible metrics of multi-level alignment between two speakers in naturalistic language corpora.

Language: Python - Size: 54.8 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 38 - Forks: 11

0ldriku/CAF-Annotator

Audio annotation tool designed for second language acquisition (SLA) researchers

Language: Jupyter Notebook - Size: 55.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

sadielbartholomew/cf-standard-names-linguistics

Lexical & semantic analysis of the CF Conventions Standard Names

Language: Python - Size: 51.2 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

Anastassssiia/Surname-analysis

Наш проект направлен на изучение фамилий болгаро-гагаузского происхождения. Пользователи смогут проанализировать свои фамилии и больше узнать о своей идентичности. Кроме того, инструмент позволит исследователям изучать целый пул фамилий одновременно.

Language: Jupyter Notebook - Size: 26.4 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

Itabashi-don/Shiina

板橋在住の女子高生、しいちゃんですっ( ˙꒳​˙ )

Language: JavaScript - Size: 4.25 MB - Last synced at: 7 days ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 0

julienijs/Linguistic-complexity

Measuring linguistic complexity through information theory

Language: Python - Size: 5.31 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

AtharvaKatre/Numbers-Prophecy

An experiment to demonstrate the biases and predictability of our world.

Language: Python - Size: 5.06 MB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 2

spottolaq/corpus-spotted-2020

This repository houses a comprehensive collection of 14,701 Instagram posts authored by Italian university students between January 2020 and December 2020. These posts offer invaluable insights into the experiences and reflections of students during the challenging period of the COVID-19 lockdown in Italy.

Size: 16.6 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

EvgeniaViskovatykh/Quantitative-analysis-of-semantic-shift

Language: Jupyter Notebook - Size: 14.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sorinmarti/textanalyzer

Java Software to analyze text files.

Language: Java - Size: 268 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

ggeraldina/nominative_field_v2.0

Построение номинативного поля концепта (2017-2018г)

Language: HTML - Size: 16.8 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

morganlee123/2GPTEmpathicDialogues

Code, analyses, and data for 'A Linguistic Comparison between Human and ChatGPT-Generated Conversations'

Language: Python - Size: 9.84 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

milosen/arc

ARC: A tool for creating artificial languages with rhythmicity control

Language: Python - Size: 8.39 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

katreparitosh/Discourse-Analytics-of-Political-Speech-Transcripts

Political Discourse Analysis (PDA) of Political Speech Transcripts using Natural Language Processing (NLP)

Language: Jupyter Notebook - Size: 22.7 MB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 15 - Forks: 1

kingsdigitallab/dral-django

Distant Reading across Languages

Language: HTML - Size: 21.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

alschmut/code2semantics

Parse software-code for semantic identifier names

Language: Python - Size: 742 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

Ghozayel/Lextale

The LexTALE-package calculates the % correctAv score for the LexTALE-test, English, German and Dutch versions.

Language: R - Size: 204 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 4

CoCoLabErica/LIWC2015

A program built on LIWCalike and quanteda to produce LIWC2015 results

Language: R - Size: 12.7 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

amadeusferro/English-language-reading-assistant

Introducing an English reading assistant—a web application using NLP to enhance understanding of English documents. It allows users to upload English PDFs, employing NLP to highlight recurring words and their definitions.

Language: Python - Size: 327 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jtanwk/nytcrossword

An exploration of New York Times crossword answers from 1994-2017, i.e. the Will Shortz era.

Language: HTML - Size: 7.43 MB - Last synced at: 5 months ago - Pushed at: about 6 years ago - Stars: 122 - Forks: 8

Halvani/TextUnitLib

A Python library that allows easy extraction of a variety of text units within texts...

Language: Python - Size: 31.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

phughesmcr/LIWCjs-Dictionary

Parse and manipulate multiple LIWC dictionary files.

Language: TypeScript - Size: 172 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 1

kthomas4031/Author-Detector

Detects the author based on linguistic signatures

Language: Java - Size: 1.24 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

robert1ridley/linguisticBias

This is the code and data used to produce the results from the EMNLP 2023 paper Addressing Linguistic Bias through a Contrastive Analysis of Academic Writing in the NLP Domain.

Language: Python - Size: 23 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zyocum/phoible-notebook

Exploratory notebook for inspecting the PHOIBLE data set.

Language: Jupyter Notebook - Size: 322 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

parthNJ/Research---Teenage-development-observed-via-their-twitter-tweets

The following is a research paper with the hypothesis to study whether teenage development can also be observed via their twitter tweets. Using a dataset of teenagers from twitter I was able to confirm my study that as we develop as humans our development is also found on our social media via linguistic aspects such as spelling, maturity, bad word usage, acronym usage, and more. Please see the finalpaper for more detailed explanation. The paper is also soon to be published.

Language: Python - Size: 755 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 3 - Forks: 0

alichtman/text-language-identifier

Accurately identify written English, French or Italian text with up to 99% accuracy.

Language: Python - Size: 5.88 MB - Last synced at: 16 days ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

armankazmi/Computational-Linguistics

Projects related to Computation Linguistics

Language: Jupyter Notebook - Size: 232 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Frobeniusnorm/AcademicTextEstimator

Language: Scala - Size: 12.3 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

GiellaLT-Archive/giella-shared 📦

Shared linguistic resources, like names, digits, fst filtering and dependency parsing.

Language: Rich Text Format - Size: 6.44 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

STRZGR/Natural-Language-Processing-with-Python-Analyzing-Text-with-the-Natural-Language-Toolkit

My solutions to selected exercises to "Natural Language Processing with Python – Analyzing Text with the Natural Language Toolkit" by Steven Bird, Ewan Klein, and Edward Loper.

Language: Jupyter Notebook - Size: 9.75 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 43 - Forks: 34

AhmedHani/textstyle

Lightweight package for extracting the stylometric features from text corpus

Language: Python - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

chepalgsh/lexopedia

Lexopedia: Free Linguistic Search

Language: HTML - Size: 3.23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

i-amritpal/Feature-based-fake-review-detection

This project related to one of my B.Tech final year project that investigates the influence of linguistic and sentiment analysis features on detecting fake reviews in e-commerce (Amazon).

Language: Jupyter Notebook - Size: 33.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dativebase/dailp-ingest-clj

DAILP Ingest (of Cherokee language data from Google Sheets)

Language: Clojure - Size: 197 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

Rutatu/experimental_methods_III

Portfolio assignments in data cleaning, analysis and diagnostic science field employing multilevel and logistic regression models. 2019 fall semester at Aarhus University

Size: 2.02 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

EPgg92/pam

Programme d'Analyse Métrique

Language: Python - Size: 1.04 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

alekswael/PoS_NER_tagger

This repo contains a Python script called get_linguistic_features.py - an information extraction script which performs part-of-speech (PoS) tagging and named-entity recognition (NER).

Language: Python - Size: 2.99 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

morioka/alphamalig

ALPHAMALIG(ALPHAbet Multiple ALIGnment)

Language: C - Size: 283 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Text-Mining/Ferdowsi-Annotated-Academic-Linguistic-Corpus

دو پیکره زبانی مربوط به مجموعه مقالات دانشگاه فردوسی مشهد

Size: 57.6 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1

Jungmin-YUN-0/Readability_linguistic_feature

iipl project

Language: Jupyter Notebook - Size: 5.55 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

zelewskap/BA_heuristics

Heuristics and cognitive biases in public discourse on climate changes - lingustic data analysis

Language: Jupyter Notebook - Size: 3.18 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

nikopetr/LinCFNA

A large dataset which consists of linguistic characteristics of fake and real articles

Language: TeX - Size: 26.4 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

macbre/faroese-corpus

Some Faroese language statistics taken from fo.wikipedia.org content dump

Language: Python - Size: 8.79 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

JessevanLier/feature_based_fake_review_detection

This project related to my MSc Thesis that investigates the influence of linguistic and sentiment analysis features on detecting fake reviews in e-commerce (Amazon).

Language: Jupyter Notebook - Size: 37.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sapomaro/repeated-words

Приложение по поиску повторов однокоренных слов (vanilla JavaScript ES5)

Language: JavaScript - Size: 281 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

SkywingsWang/Linguistic-Analysis

Language: R - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

julienijs/Speed-of-language-change-and-population

Comparing the speed of language change in more and less densely populated areas

Language: R - Size: 7.96 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

dylan-profiler/tangled-up-in-unicode

Access to the Unicode Character Database (UCD)

Language: Python - Size: 7.2 MB - Last synced at: 8 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 6

GiovanniMerici/Big-Data-in-Linguistics

Supporting code for big-data analysis in linguistics

Language: Jupyter Notebook - Size: 1.99 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

jklu-jaipur/Political-Biasness-Detection

Our ML model calculates the biasness of a political article based on linguistic features and classifies them as biased towards the ruling government, bias towards the opposition, or neutral.

Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 3

phHartl/memeAnalyze

Analyze memes with knowyourmeme

Language: R - Size: 3.28 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

mars-aria/aave_data_analysis

For a human-centered data science assignment, I analyzed how Google's Perspective API tool detects and categorizes Black language data, also known as AAVE (African-American Vernacular English), from Twitter.

Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

IParraMartin/APA-Automatic-Praat-Analyzer

Script to analyze audio files in Praat using Parselmouth

Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

devSuchit/nlp-cky-PCFG

This repository contains an implementation of the CKY parsing for English. (NLP)

Language: Python - Size: 154 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 1

wordification/front

Front-end for the Wordification project, written with Next and TypeScript.

Language: TypeScript - Size: 170 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

raghav-arora-1998/HipHop-Linguistics-Analysis

Language: HTML - Size: 1.12 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

brianSalk/sql_or_sequal

A reddit scrapper investigating how redditors pronounce SQL

Language: Python - Size: 81.1 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

mzhukovaucsb/emoji_gestures

Research project “Gesture Emoji Twitter Corpus”. Project description, data collection pipeline (tweepy), data preprocessing functions (regex, nltk), 2 datasets for Russian and English published in open access.

Language: Jupyter Notebook - Size: 125 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

sleepyrob/cl-project

Contains the files of the project for the "Computational linguistics" course (A.Y. 2020-21, University of Pisa).

Language: Python - Size: 56.6 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

PeerChristensen/PossessivePronounsTwitter

Mining tweets to explore the linguistic context of gender pronouns

Language: R - Size: 15.6 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

LSYS/lexicaldiversity-example

Hosting MyBinder example for the LexicalRichness package.

Language: Jupyter Notebook - Size: 2.25 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Remusqs1/Esperanto-Al-Arkaikam

Tujtradukilo el Esperanto al Arkaikam Esperantom

Language: Python - Size: 55.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

mintaka5/bitpusher

store some information. learn it. react to it.

Language: Python - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Related Keywords
linguistic-analysis 124 linguistics 43 nlp 22 python 20 natural-language-processing 12 machine-learning 9 linguistic-corpora 8 data-science 7 text-analysis 7 python3 6 language 6 language-learning 5 data-analysis 5 nltk 5 natural-language 5 lexical-analysis 5 phonetics 4 word2vec 4 corpus-linguistics 4 text-mining 4 ai 4 twitter 4 computational-linguistics 4 language-model 4 sentiment-analysis 4 r 4 visualization 3 liwc 3 linguistics-databases 3 languages 3 linguistics-field 3 text-analytics 3 spacy 3 corpus 3 nlp-parsing 3 text-classification 3 esperanto 3 javascript 3 phonology 3 grammar 3 feature-extraction 3 word-embeddings 2 corpus-processing 2 minority-language 2 second-language-acquisition 2 search-engine 2 corpus-analysis 2 audio 2 grammar-parser 2 data-mining 2 language-change 2 grammar-learning 2 natural-language-understanding 2 data-visualization 2 classification 2 zipfs-law 2 data 2 parsing 2 second-language-research 2 grammar-checker 2 fake-review-detection 2 amazon 2 corpus-data 2 quora 2 language-detection 2 artificial-intelligence 2 sociolinguistics 2 research 2 constituency-parser 2 statistics 2 lexical-analyzer 2 annotation-tool 2 speech-analysis 2 semantic-analysis 2 speech-recognition 2 parser 2 information-retrieval 2 twitter-api 2 readability-scores 2 gestures 2 t-test 2 typescript 2 cherokee-language 2 annotations 2 nlp-datasets 2 text-processing 2 natural-language-interface 2 deep-learning 2 java 2 english 2 chatgpt 2 readability 2 nlp-machine-learning 2 knowledge 2 bigram-model 1 bioinformatics 1 sequence-alignment 1 multiple-sequence-alignment 1 linguistic-relativity 1 matplotlib 1