An open API service providing repository metadata for many open source software ecosystems.

Topic: "language-modeling"

rkapur102/language_phylogeny_feature_simulations

Simulations of language evolution and phylogenetic/feature dynamics, as described in: "Kapur, Rhea and Phillip Rogers. 2020. Modeling language evolution and feature dynamics in a realistic geographic environment. 28th International Conference on Computational Linguistics (COLING 2020), Barcelona, Spain."

Language: R - Size: 11.3 MB - Last synced at: 8 months ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 0

arianhosseini/MemArchs-in-RNNLM

attempt at implementing "Memory Architectures in Recurrent Neural Network Language Models" as a part of the ICLR 2018 reproducibility challenge

Language: Python - Size: 43.9 KB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 2

Ingenious-c0der/Beluga

An esoteric programming language based on Turing Machines

Language: C++ - Size: 163 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

Mawazoni/KiswahiliModuleNooJ

he Kiswahili module is designed for NooJ linguistic development environment software and corpus processor. It goes with a 45000 words Kiswahili-English dictionary and morphological grammars that detect all tenses of Kiswahili verbs. Mathieu ROY et al. 2017. 2018. CC BY NC SA 3.0

Size: 12.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

zhaoyanpeng/syntax-induction-in-deep-learning-era

Unsupervised Learning of Syntax

Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

kazuki-irie/dct-fast-weights

PyTorch implementation of DCT fast weight RNNs

Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

senderle/lexpart

Companion code for "Toward a Thermodynamics of Meaning," CHR 2020

Language: Python - Size: 26.7 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

dlsucelt/Transformer

Language Modeling + Transfer Learning demo using the GPT-2 Transformer architecture.

Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 3

jtejido/basset-ir

Basset IR - An Information Retrieval library.

Language: PHP - Size: 2.19 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 4 - Forks: 1

anunayarunav/MCMC-Deciphering

A markov chain monte carlo deciphering method based on http://www-users.york.ac.uk/~sbc502/decode.pdf

Language: Jupyter Notebook - Size: 3.46 MB - Last synced at: almost 2 years ago - Pushed at: over 8 years ago - Stars: 4 - Forks: 6

ishan00/translation-for-code-switching-acl

Official repository for the paper titled "From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text" accepted at ACL 2021

Language: Python - Size: 8.59 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 2

fgaim/Tigrinya-PLMs

Resources for the paper: Monolingual Pre-trained Language Models for Tigrinya

Language: Python - Size: 55.7 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

nkcr/overlap-ml πŸ“¦

Reference implementation of the paper "Alleviating Sequence Information Loss with Data Overlapping and Prime Batch Sizes" - CoNLL 2019

Language: Python - Size: 2.44 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

duanyiqun/ICOBase-whitepaper

A simple dataset of ICO whitepapers with annotation of the quality of associated projects. The annotation is sparse with cross validation of annotators

Language: Io - Size: 14.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 6

Hritikbansal/RNNs_SVA_OOD

This is an official pytorch implementation for the experiments described in the paper - Can RNNs trained on harder subject-verb agreement instances still perform well on easier ones?

Language: Python - Size: 53.6 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

sedrickkeh/PINEAPPLE

Dataset and code for PINEAPPLE: Personifying INanimate Entities by Acquiring Parallel Personification data for Learning Enhanced generation (COLING 22)

Language: Python - Size: 37.1 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

Atenrev/comics-dialogue-generation

PyTorch code for Automatic generation of comic dialogues. The purpose of this project is to generate subsequent dialogues given a multimodal context.

Language: Python - Size: 727 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

asgaardlab/21-markos-test_case_improvement_framework-code

Repository with the source code of our experiments for an automated NLP-based framework to improve test cases written in natural language

Language: Jupyter Notebook - Size: 65.4 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

donfaq/legal-space-research

Russian law meets language modelling

Language: Jupyter Notebook - Size: 96.2 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 1

ArmanBehnam/NLP

Natural language processing including Datasets,Farsi NLP, Automated Essay Scoring, Automatic Speech Recognition and etc.

Language: Jupyter Notebook - Size: 512 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

nytud/emLam

Preprocessing scripts for Hungarian Language Modeling

Language: Python - Size: 239 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 2

samridhishree/Deeplearning-Models

Deep learning models in Python

Language: Python - Size: 266 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

Websail-NU/seqmodel

Sequence models implementation in Tensorflow.

Language: Python - Size: 1.31 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 3

namanrajpal/Java-Based-Compiler-Using-ASM

Made using Visitor Design Pattern, JVM compiler comprising of Scanner,Parser, Type Checker and JVM Byte Code Generator using ASM.

Language: Java - Size: 49.8 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 1

Madjakul/HALvesting

Harvests open research papers from HAL (Hyper Articles en Ligne).

Language: Python - Size: 490 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

nikitas-theo/BERTtimeStories

Code implementation for our paper "BERTtime Stories: Investigating the Role of Synthetic Story Data in Language Pre-training" as part of the 2024 BabyLM Challenge

Language: Python - Size: 1000 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

KIST-CSRC/Text-to-BatteryRecipe

Official source codes for implementing "Text-to-Battery Recipe: A language modeling-based protocol for automatic battery recipe extraction and retrieval"

Language: Python - Size: 27.9 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

BitcoinChatGPT/DeserializeSignature-Vulnerability-Algorithm

Learn about the DeserializeSignature vulnerability in Bitcoin's ECDSA signature algorithm and its potential impact on the security of Bitcoin transactions. Discover how the vulnerability can be exploited and what steps are being taken to mitigate the risk. Stay informed on the latest developments in Bitcoin security.

Language: Jupyter Notebook - Size: 1.72 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

NLP2CT/TempoSum

Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization

Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Shashi456/Deep-Learning-Research

Summaries of papers on Deep Learning, Natural Language Processing, Computer vision

Language: Jupyter Notebook - Size: 2.06 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

FilipCore034/Machine-learning-Datasets

Discover the Machine learning datasets! Diverse content for πŸŽ“ education, πŸ“Š research, πŸ‘₯ non-profit use and experimenting. Download, merge files for πŸ“ convenience. Contribute to enhance language modeling, πŸ€– machine learning, πŸŽ“ education, data analysis, and πŸ§ͺ software development. Note: Content sourced for non-profit, educational use. Enjoy! ;)

Language: PowerShell - Size: 138 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

AndyCheang/TempoSum

TempoSum: Evaluating the Temporal Generalization of Abstractive Summarization

Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

pharo-ai/ngram πŸ“¦

N-gram functionality for Pharo

Language: Smalltalk - Size: 40 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

DarshanAdiga/idiom-principle-on-magpie-corpus

Idiom Principle on MAGPIE dataset

Language: Python - Size: 246 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

maxine-red/moo_ebooks Fork of mispy-archive/twitter_ebooks

A minimalistic ebook library

Language: Ruby - Size: 2.22 MB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

fededevi/DeScript

A simple interpreted programming language and expression evaluator.

Language: Java - Size: 1.02 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

SuddenlyPineapple/graphLanguageAnalyzer

Graph Natural (or Custom) Language Modeling App

Language: JavaScript - Size: 556 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 8

Coder1400/LanguageIdentification

Identify between English, French, and Italian with 99% accuracy. Uses language modeling techniques including LaPlace and Good-Turing smoothing.

Language: Python - Size: 470 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

prakruti-joshi/Natural-Language-Processing

Language modeling, LSTM, Attention models, Transformers, Parsing and Tagging in NLP, EM algorithm, Auto-encoders implemented in Python using PyTorch. The assignments are part of the course Natural Language Processing.

Language: Python - Size: 2.8 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

Subangkar/Sequence-Models-Deeplearning.ai-Coursera-Assignments

Notebooks of programming assignments of Sequence Models course of deeplearning.ai on coursera in May-2020

Language: Jupyter Notebook - Size: 9.08 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 4

dellison/WikiText.jl

Julia interface to the WikiText dataset.

Language: Julia - Size: 14.6 KB - Last synced at: 23 days ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

kplachkov/Deep-Learning

Essential deep learning algorithms, concepts, examples and visualizations with TensorFlow. Popular and custom neural network architectures. Applications of neural networks.

Language: Jupyter Notebook - Size: 55.5 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 3

aquatiko/Language-Model-Shakespere-generator

Character level LSTM based language generator based on Shakespere corpus.

Language: Jupyter Notebook - Size: 58.6 KB - Last synced at: 3 months ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

dhwajraj/lm-finetune-NER

Language Model Fine-tuning for Named Entity Recognition (and other sequence tagging tasks)

Language: Python - Size: 29.3 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

SNUDerek/lm_perplexity_bootstrapping

demo of domain corpus bootstrapping using language model perplexity

Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 3

Websail-NU/adaptive_lm

Implementation of recurrent neural network language model (deprecated)

Language: Python - Size: 163 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 4

bjam24/agh-natural-language-processing

This respository contains projects made for the NLP course at the AGH UST in 2024 / 2025. They received maximum grade 5.0.

Language: Julia - Size: 25 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Helsinki-NLP/lm-vs-mt

A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives

Language: Python - Size: 1.15 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

jwchoi95/Text-to-BatteryRecipe

Official source codes for implementing "Text-to-Battery Recipe: A language modeling-based protocol for automatic battery recipe extraction and retrieval"

Language: Python - Size: 5.39 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

CODING-Enthusiast9857/Gemini_LLM_Application

It is an innovative repository housing a sophisticated Large Language Model (LLM) project, showcasing the intersection of advanced natural language processing and cutting-edge artificial intelligence. This repository serves as a comprehensive platform for the development, experimentation, and application of state-of-the-art language models.

Language: Python - Size: 8.79 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

albinjm/FinSpeech

A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models

Language: Jupyter Notebook - Size: 188 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

BitcoinChatGPT/Gauss-Jacobi-Method-Algorithm

To use a pre-trained Bitcoin ChatGPT AI model to learn this method, you would first need to provide the model with a clear and concise description of the algorithm, including its purpose, prerequisites, and the mathematical principles behind it. How To Get PrivateKey of Bitcoin Wallet Address.

Language: Jupyter Notebook - Size: 1.7 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 3

ohmthanap/CS584_Natural-Language-Processing

Learned knowledge and techniques in Natural Language Processing and also related tools: Python, Pytorch, Jupyter Notebook, Google Colab, RNN, CNN, Reinforcement Learning, LSTM, Language Modeling

Language: Jupyter Notebook - Size: 185 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

phueb/EntropicStartTheory

Research code used to study learning dynamics of RNN language models

Language: Python - Size: 70 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

ipekseyitoglu/Morse_Translate_Project

Language: Python - Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

chao-ji/tf-transformerxl-language-model

Tensorflow 2 implementation of TransformerXL for language modeling

Language: Python - Size: 113 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

spydaz/ClassTokenizer

Basic Tokenizer - Creates tokens - enabling for creation of personal syntax; removal of unwanted characters etc

Language: Visual Basic .NET - Size: 16.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 2

saeeddhqan/tiny-transformer

Tiny transformer models implemented in pytorch.

Language: Python - Size: 11.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

ivkos/markovski

Markov chains for Node.js

Language: JavaScript - Size: 25.4 KB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

vaddhiparthy/GPT

Fine-tuning GPT-2 models with custom text corpora, utilizing Hugging Face's Transformers library and advanced training techniques for sophisticated text generation applications.

Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

paule32/Kurt_Goedel_Experiment

This is just a fun Project that is faced from the "Kurt Goedel" Therom of incomplete sentences. Produced with SBCL - a Common Lisp implementation. Free for non-profit usage.

Language: Common Lisp - Size: 44.9 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

filippogiruzzi/nlp_chatbot

Realistic Chatbot based on NLP & TensorFlow

Language: Python - Size: 92.8 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

davidvos/prefix-tuning-for-data-management

Parameter-efficient automation of data wrangling tasks with prefix-tuning and the T5 language model.

Language: Python - Size: 8.2 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

SantoshPenugurthi/Topic-Modelling-on-Telugu-English-Code-mixed-data-using-LDA-

This is Natural Language Processing project utilized the LDA model to analyze Telugu-English code-mixed data, enabling effective topic modeling and gaining valuable insights from the text

Language: Jupyter Notebook - Size: 1.99 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

daskol/lsp-lm

Language Model as a Language Server

Language: Python - Size: 205 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

LucHayward/low_resource_lm Fork of StuartMesham/low_resource_lm

Repo containing only my code for Honours 2020 project at UCT on Low Resource language Modelling for African Languages with RNN/LSTM models

Language: Jupyter Notebook - Size: 4.51 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

AnantShankhdhar/AI-Rap-Lyric-Generator

Generating Rap Lyrics from Artist Name and Song Title using GPT2

Language: Python - Size: 914 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

fcakyon/gpt2-shakespeare

A tutorial on GPT2 language model training with texts from Shakespeare

Language: Jupyter Notebook - Size: 25.4 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 2

Jeevesh8/AutoRegressive-MLM Fork of deterministic-algorithms-lab/NLP-Journey

This repository extends a basic MLM implementation to allow for efficiently conditioning on chained previous texts, in a tree; for e.g., a Reddit thread.

Language: Python - Size: 1.91 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

freha-mezzoudj/PhD_works1

My PhD topic is about Automatic Speech Recognition and Language Modeling. My works are presented here to help the Reserach community, thanks !

Size: 12.5 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

mrm8488/electricidad-base

Spanish Electra

Size: 3.91 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

pranavajitnair/DeFINE-AWD-LSTM

PyTorch implementation of DeFINE word embeddings with AWD-LSTM for language modeling. The input and output embeddings for AWD-LSTMM are tied

Language: Python - Size: 16.6 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

aaronbae/AnaQA

Multi-Hop Question Answering system based on DecompRC and Dr.QA

Language: Python - Size: 837 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

askintution/simpletransformers Fork of ThilinaRajapakse/simpletransformers

Transformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Size: 17.6 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

LenaMullerFrommeyer/beyond-consistency

Code "Beyond consistency: Contextual dependency of language style in monologue and conversation" (MΓΌller-Frommeyer, Kauffeld and Paxton, 2020, Cognitive Science)

Language: R - Size: 58 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

shuwang127/NLP-FFNN

Feed Forward Neural Network for Sentiment Classification and Language Modeling

Language: Python - Size: 1.04 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

ajpar94/flair-extra

A collection of NLP related scripts and notebooks for using the framework flair (https://github.com/flairNLP/flair)

Language: Jupyter Notebook - Size: 1.1 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

racinmat/gpt-2 Fork of nshepperd/gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language: Python - Size: 12.6 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

afiliot/Text-Generation

NLP project on Language Modelling - ENSAE ParisTech

Language: Jupyter Notebook - Size: 24.7 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

MirunaPislar/Do-LSTMs-learn-Syntax

Evaluate language models on syntactic tasks.

Language: Python - Size: 1.58 MB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

young-zonglin/yangzl-deep-lm-keras

Language modeling using several deep models.

Language: Python - Size: 4.19 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

markab4/Language-Modeling-in-Python

Python program to train several language models and evaluate them on two test corpora.

Language: Python - Size: 1.95 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

ys1998/handwriting-synthesis

Handwriting synthesis and text prediction using RNNs

Language: Python - Size: 162 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

robertah/nlu_project

Project for Natural Language Understanding course - ETH Zurich, Spring 2018

Language: Python - Size: 4.34 MB - Last synced at: 7 months ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 1

kalifou/ri_tme1

Information retrieval - assignments for course at UPMC - Paris 6

Language: Jupyter Notebook - Size: 23.1 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

jacerong/normalesp

An open-source spell checker for texts written in Spanish, with a focus on tweets.

Language: Python - Size: 4.85 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 2

jrdodson/unigram-lm

Simple language model for computing unigram frequencies.

Language: Java - Size: 4.43 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 1

chen0040/java-plsa

Package provides the java implementation of probabilistic latent semantic analysis (pLSA)

Language: Java - Size: 1.16 MB - Last synced at: 3 months ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 1

jancarauma/nanoGPT

nanoGPT - A simple GPT-Style Transformer from Scratch in PyTorch

Language: Python - Size: 9.77 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

MyDarapy/gpt-1-from-scratch

Rewriting and pretraining GPT-1 from scratch. Implementing Multihead Attention (MHA) in pyTorch from the original paper Improving Language Understanding by Generative Pre-Training (https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf)

Language: Python - Size: 44.9 KB - Last synced at: 10 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

BitcoinChatGPT/Jacobian-Curve-Vulnerability-Algorithm

Discover the implications of the Jacobian Curve vulnerability in elliptic curve cryptography, particularly its impact on the Elliptic Curve Digital Signature Algorithm (ECDSA). This article explores how attackers can exploit this flaw to generate fraudulent transactions, create fake signatures, and compromise the integrity of blockchain systems.

Language: Jupyter Notebook - Size: 1.72 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1

PraveenKumar-Rajendran/Udacity-Natural-Language-Processing-Engineer-Nanodegree

Projects Implemented for the Udacity Natural Language Processing Engineer Nanodegree Program

Size: 1.42 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

sayhitosandy/Transformer-Speech-Classifier-LM

Implementation and exploration of transformer models for speech segment classification and language modeling.

Language: Jupyter Notebook - Size: 1.83 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

ShiningLab/PromptSub

This repository is for the paper Lexical Substitution as Causal Language Modeling. In Proceedings of the 13th Joint Conference on Lexical and Computational Semantics (*SEM 2024), Mexico City, Mexico. Association for Computational Linguistics.

Language: Python - Size: 4.32 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

LarsHill/pointer-guided-pre-training

Code for the ECML 2024 paper "Pointer-Guided Pre-Training: Infusing Large Language Models with Paragraph-Level Contextual Awareness"

Language: Python - Size: 63.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BitcoinChatGPT/Fuzzing-Vulnerability-Algorithm

Learn about the Fuzzing vulnerability in Bitcoin's ECDSA signature algorithm and its potential impact on the security of Bitcoin transactions. Discover how the vulnerability can be exploited and what steps are being taken to mitigate the risk. Stay informed on the latest developments in Bitcoin security.

Language: Jupyter Notebook - Size: 1.7 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

ohmthanap/CS583_Deep-Learning

Learned knowledge and techniques in Deep Learning and also related tools: Python, Pytorch, Jupyter Notebook, RNN, CNN, Reinforcement Learning, LSTM, BERT, Language Modeling

Language: Jupyter Notebook - Size: 127 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

khlaifiabilel/Awesome-Machine-Learning-On-Source-Code-V2

A curated list of awesome research papers, datasets and software projects devoted to machine learning and source code.

Size: 316 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

dimits-ts/text_analytics

Language Modelling (text generation, spell correction) and Sentiment Analysis / POS Tagging with MLP, RNN, CNN and BERT models and LLM prompting

Language: Jupyter Notebook - Size: 69.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

harsha-desaraju/HMM-Model-for-POS-tagging

Parts of Speech (POS) tagging for English using Hidden Markov Model.

Language: Python - Size: 6.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Related Topics
nlp 70 natural-language-processing 62 deep-learning 47 pytorch 37 machine-learning 33 python 29 language-model 24 tensorflow 21 text-generation 20 transformers 20 lstm 18 recurrent-neural-networks 16 rnn 15 dataset 14 artificial-intelligence 13 transformer 11 neural-networks 11 speech-recognition 10 ai 10 word-embeddings 10 language 9 bert 8 machine-translation 8 llm 7 natural-language-generation 7 transfer-learning 7 attention-mechanism 7 deep-neural-networks 6 gpt-2 6 chatgpt 6 language-processing 6 text-classification 6 nlp-machine-learning 6 gpt2 6 classification 5 sentiment-analysis 5 datasets 5 sequence-to-sequence 5 text-processing 5 protein-sequences 5 question-answering 5 linguistics 5 named-entity-recognition 5 language-generation 5 generative-models 5 colab-notebook 5 openai 5 ngrams 5 python3 5 n-grams 5 data-analysis 4 gru 4 huggingface 4 shakespeare 4 computational-linguistics 4 natural-language-understanding 4 reinforcement-learning 4 paper 4 transformer-xl 4 nlg 4 lstm-model 4 bitcoin-wallet 4 automatic-speech-recognition 4 asr 4 benchmark 4 text-summarization 4 word2vec 4 large-language-models 4 t5 4 bitcoin 4 chatbot 4 topic-modeling 3 transformer-architecture 3 information-retrieval 3 code-switching 3 pretraining 3 bert-model 3 self-attention 3 llms 3 gpt 3 text-to-image 3 languages 3 pos-tagging 3 neural-network 3 keras 3 deeplearning 3 jupyter-notebook 3 seq2seq 3 attention-model 3 rnn-model 3 few-shot-learning 3 language-detection 3 protein-structure 3 programming 3 research 3 generalization 3 cnn 3 gpt-3 3 in-context-learning 3 java 3