Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: korean-text-processing

alexlaurence/DoReMi

🌊 DoReMi uses WaveNet to synthesise speech from online job listings for blind Korean speakers

Language: Python - Size: 11.9 MB - Last synced: 4 days ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

shineware/KOMORAN

Korean Morphological Analyzer by shineware

Language: Java - Size: 111 MB - Last synced: 2 days ago - Pushed: about 1 year ago - Stars: 274 - Forks: 61

mkpoli/koconv

JS library to convert Korean Hangul text

Language: TypeScript - Size: 35.2 KB - Last synced: 20 days ago - Pushed: 21 days ago - Stars: 0 - Forks: 0

lovit/soynlp

ν•œκ΅­μ–΄ μžμ—°μ–΄μ²˜λ¦¬λ₯Ό μœ„ν•œ 파이썬 λΌμ΄λΈŒλŸ¬λ¦¬μž…λ‹ˆλ‹€. 단어 μΆ”μΆœ/ ν† ν¬λ‚˜μ΄μ € / ν’ˆμ‚¬νŒλ³„/ μ „μ²˜λ¦¬μ˜ κΈ°λŠ₯을 μ œκ³΅ν•©λ‹ˆλ‹€.

Language: Python - Size: 34 MB - Last synced: 17 days ago - Pushed: about 1 month ago - Stars: 904 - Forks: 181

bab2min/Kiwi

Kiwi(지λŠ₯ν˜• ν•œκ΅­μ–΄ ν˜•νƒœμ†Œ 뢄석기)

Language: C++ - Size: 390 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 360 - Forks: 42

open-korean-text/open-korean-text

Open Korean Text Processor - An Open-source Korean Text Processor

Language: Scala - Size: 32.7 MB - Last synced: 25 days ago - Pushed: 2 months ago - Stars: 597 - Forks: 94

rasoio/daon

ν•œκΈ€ ν˜•νƒœμ†Œ 뢄석기

Language: Java - Size: 68.7 MB - Last synced: 17 days ago - Pushed: over 5 years ago - Stars: 20 - Forks: 5

okikirmui/handic

HanDic: a morphological analysis dictionary for contemporary Korean

Language: Perl - Size: 134 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

hmmhmmhm/hangul-search-js

πŸ‡°πŸ‡· Simple Korean text search module

Language: TypeScript - Size: 1.49 MB - Last synced: 15 days ago - Pushed: over 2 years ago - Stars: 25 - Forks: 1

tenebo/g2pk2 Fork of harmlessman/g2pkk

Updated folk of g2pk

Language: Python - Size: 66.4 KB - Last synced: 13 days ago - Pushed: 9 months ago - Stars: 6 - Forks: 1

shineware/PyKOMORAN

(Beta) PyKOMORAN is wrapped KOMORAN in Python using Py4J.

Language: Python - Size: 34.8 MB - Last synced: 11 days ago - Pushed: about 3 years ago - Stars: 41 - Forks: 5

ychoi-kr/ko-prfrdr

Utils for Korean proofreaders

Language: Python - Size: 23.9 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 52 - Forks: 7

lovit/customized_konlpy

Customized KoNLPy - Korean Natural Language Processing Toolkit KoNLPy wrapping code

Language: Python - Size: 929 KB - Last synced: 7 days ago - Pushed: over 5 years ago - Stars: 126 - Forks: 25

lovit/KR-WordRank

λΉ„μ§€λ„ν•™μŠ΅ λ°©λ²•μœΌλ‘œ ν•œκ΅­μ–΄ ν…μŠ€νŠΈμ—μ„œ 단어/ν‚€μ›Œλ“œλ₯Ό μžλ™μœΌλ‘œ μΆ”μΆœν•˜λŠ” λΌμ΄λΈŒλŸ¬λ¦¬μž…λ‹ˆλ‹€

Language: Python - Size: 4.55 MB - Last synced: 4 months ago - Pushed: about 2 years ago - Stars: 341 - Forks: 57

storidient/KoBookNLP

ν•œκ΅­μ–΄ μ†Œμ„€ ν…μŠ€νŠΈλ₯Ό μœ„ν•œ μžμ—°μ–΄μ²˜λ¦¬ λΌμ΄λΈŒλŸ¬λ¦¬μž…λ‹ˆλ‹€. Natural Language Processing Library for Korean Literary Text. (Will be open in February, 2024)

Language: Python - Size: 727 KB - Last synced: 2 months ago - Pushed: 4 months ago - Stars: 4 - Forks: 0

selfcontrol7/Korean_Voice_Phishing_Detection

All codes implemented on Korean voice phishing detection papers

Language: Jupyter Notebook - Size: 146 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 5 - Forks: 3

kimcore/Josa.kt

쑰사λ₯Ό μžλ™μœΌλ‘œ κ΅μ •ν•˜λŠ” Kotlin λΌμ΄λΈŒλŸ¬λ¦¬μž…λ‹ˆλ‹€.

Language: Kotlin - Size: 92.8 KB - Last synced: 23 days ago - Pushed: almost 2 years ago - Stars: 12 - Forks: 1

jeongukjae/korean-spacing-model

ν•œκ΅­μ–΄ λ¬Έμž₯ 띄어쓰기(μ‚­μ œ/μΆ”κ°€) λͺ¨λΈμž…λ‹ˆλ‹€. 데이터 μ€€λΉ„ ν›„ 직접 ν•™μŠ΅μ΄ κ°€λŠ₯ν•˜λ„λ‘ μž‘μ„±ν•˜μ˜€μŠ΅λ‹ˆλ‹€.

Language: Python - Size: 2.21 MB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 54 - Forks: 3

open-korean-text/elasticsearch-analysis-openkoreantext

Korean analysis plugin that integrates open-korean-text module into elasticsearch.

Language: Java - Size: 13.1 MB - Last synced: 26 days ago - Pushed: 12 months ago - Stars: 126 - Forks: 22

kimcore/inko.kt

πŸ‡°πŸ‡· μ˜νƒ€λ₯Ό ν•œκΈ€λ‘œ, ν•œνƒ€λ₯Ό μ˜μ–΄λ‘œ λ³€ν™˜ν•΄μ£ΌλŠ” Kotlin μ˜€ν”ˆμ†ŒμŠ€ 라이브러리 (Implementation of inko.js)

Language: Kotlin - Size: 89.8 KB - Last synced: 23 days ago - Pushed: almost 2 years ago - Stars: 13 - Forks: 0

coarchive/hangul-unicode πŸ“¦

A library to process and standardize hangul characters

Language: JavaScript - Size: 2.34 MB - Last synced: 6 months ago - Pushed: over 5 years ago - Stars: 1 - Forks: 0

NLP-kr/tensorflow-ml-nlp-tf2

ν…μ„œν”Œλ‘œ2와 λ¨Έμ‹ λŸ¬λ‹μœΌλ‘œ μ‹œμž‘ν•˜λŠ” μžμ—°μ–΄μ²˜λ¦¬ (λ‘œμ§€μŠ€ν‹±νšŒκ·€λΆ€ν„° BERT와 GPT3κΉŒμ§€) μ‹€μŠ΅μžλ£Œ

Language: Jupyter Notebook - Size: 200 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 267 - Forks: 135

oneonlee/KR-Emotional-Analysis

2023λ…„ ꡭ립ꡭ어원 인곡 지λŠ₯ μ–Έμ–΄ λŠ₯λ ₯ 평가: 감정 뢄석 과제

Language: Jupyter Notebook - Size: 13.1 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 1

SOMJANG/Mecab-ko-for-Google-Colab

Use Mecab Library(NLP Library) in Google Colab

Language: Shell - Size: 1.68 MB - Last synced: 7 months ago - Pushed: 11 months ago - Stars: 61 - Forks: 29

bytecell/slotminer

Tool for slot extraction from text

Language: Python - Size: 132 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 16 - Forks: 2

Keracorn/geulstagram

πŸ“· κΈ€μŠ€νƒ€κ·Έλž¨ 데이터셋 λ§Œλ“€κΈ°

Language: Python - Size: 16.4 MB - Last synced: 9 days ago - Pushed: over 1 year ago - Stars: 14 - Forks: 9

minseok0809/korean-sentence-segementation

AIHub ν•œκ΅­μ–΄ 데이터 μ „μ²˜λ¦¬: ν•œκ΅­μ–΄ λ¬Έμž₯ 뢄리

Language: Jupyter Notebook - Size: 2.61 MB - Last synced: 4 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

affjljoo3581/langumo-ko

ν•œκ΅­μ–΄ λ§λ­‰μΉ˜μš© langumo parser λͺ¨μŒ

Language: Python - Size: 27.3 KB - Last synced: 29 days ago - Pushed: over 3 years ago - Stars: 7 - Forks: 2

steamb23/Naramal

This library is designed for flexible Korean processing in C#.

Language: C# - Size: 187 KB - Last synced: 28 days ago - Pushed: almost 4 years ago - Stars: 11 - Forks: 0

SohyeonKim-dev/Textinit

GPT-3와 MLKit 을 ν™œμš©ν•œ ν•œκ΅­μ–΄ ν…μŠ€νŠΈ 생성기

Language: Swift - Size: 272 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 3 - Forks: 0

fingeredman/teanaps

μžμ—°μ–΄ μ²˜λ¦¬μ™€ ν…μŠ€νŠΈ 뢄석을 μœ„ν•œ μ˜€ν”ˆμ†ŒμŠ€ 파이썬 라이브러리 μž…λ‹ˆλ‹€.

Language: Jupyter Notebook - Size: 62.5 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 92 - Forks: 11

passing2961/KMRE

Korean Moview Review Emotion (KMRE) Dataset

Size: 23.1 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 17 - Forks: 0

yc9701/pansori-tedxkr-corpus

Korean ASR Corpus generated from TEDx talks

Size: 163 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 26 - Forks: 4

yonkmanjl/hangul-convert

Converts English word to Korean alphabet

Language: Java - Size: 863 KB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 1 - Forks: 0

ttop32/coqui_tts_korea

Korean TTS using coqui TTS (glowtts and multiband melgan) - ν•œκ΅­μ–΄ TTS

Language: Jupyter Notebook - Size: 2.79 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 25 - Forks: 8

ttop32/KoGPT2novel

Generate novel text - novel finetuned from skt KoGPT2 base v2 - ν•œκ΅­μ–΄

Language: Jupyter Notebook - Size: 138 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 8 - Forks: 2

this-is-my-life/KoreanScript πŸ“¦

μ½”λ”©? ν•œκ΅­μ–΄λ‘œ μ‹œμž‘ν•˜μž! "ν•œκΈ€μŠ€ν¬λ¦½νŠΈ"

Language: JavaScript - Size: 866 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 3 - Forks: 0

ni-inha/topic-modeling-of-mom-community

넀이버 카페 "λ§˜μŠ€ν™€λ¦­ 베이비" 수유 질문방 κ²Œμ‹œνŒ ν† ν”½ λͺ¨λΈλ§ 뢄석

Language: Jupyter Notebook - Size: 532 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

ni-inha/De-identification-of-Korean-names-in-clinical-notes

EMR μž„μƒλ…ΈνŠΈ λ‚΄ κ·œμΉ™ 기반 ν•œκ΅­μ–΄ 이름 비식별화

Language: Python - Size: 5.86 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

Astro36/kotka

Korean Obfuscation ToolKit Advanced

Language: Python - Size: 36.1 KB - Last synced: 22 days ago - Pushed: almost 4 years ago - Stars: 5 - Forks: 1

hmmhmmhm/tetrapod

😊 Improved swear word detection module

Language: JavaScript - Size: 130 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 25 - Forks: 5

bab2min/kiwi-gui

C# API for Kiwi

Language: C# - Size: 167 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 10 - Forks: 4

tgisaturday/CNN-text-classification

multi-class text classification using text-CNN and Konlpy

Language: Python - Size: 7.81 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 6 - Forks: 2

lovit/crf_postagger

Korean Part-of-Speech Tagger using Conditional Random Field (CRF)

Language: Python - Size: 68.7 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 11 - Forks: 4

usik/usik_nlp

basic framework for NLP tasks.

Language: Python - Size: 45.9 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

shineware/RKOMORAN

RKOMORAN is KOMORAN wrapper for R users

Language: R - Size: 15 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 15 - Forks: 0

abdalimran/pykotokenizer

PyKoTokenizer is a Korean text tokenizer for Korean Natural Language Processing tasks.

Language: Python - Size: 10.6 MB - Last synced: 5 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1

mohenjo/Hangul.Net

.NET framework ν•œκΈ€ 처리 클래슀 라이브러리

Language: C# - Size: 21.5 KB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 0

Kiminjo/Extract-company-preference-factors Fork of UnstructuredDataProject/Unstructured-Data

Based on company review data, company preference factors are derived. This project was conducted as a part of the "Unstructured Data Analysis" class at the Department of Data Science, Seoul National University of Science and Technology

Language: Jupyter Notebook - Size: 4.74 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

ossteam8/LDA-TextRank-keyword

Keyword extractor using LDA and TextRank combined

Language: Jupyter Notebook - Size: 44.1 MB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 4 - Forks: 0

jaeyung1001/NLP Fork of ahroobe/NLP

Natural Language Processing for Korean.

Language: Jupyter Notebook - Size: 11.3 MB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 1 - Forks: 2

codebasic/pyko

Korean Text Processing using Python

Language: Python - Size: 6.49 MB - Last synced: 4 days ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0

mohenjo/PyHangulUtils

ν•œκΈ€ 문자/λ¬Έμžμ—΄ 처리λ₯Ό μœ„ν•œ 파이썬 λͺ¨λ“ˆ

Language: Python - Size: 15.6 KB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 2

woodongk/geulstagram Fork of Keracorn/geulstagram

κΈ€μŠ€νƒ€κ·Έλž¨ 데이터셋 λ§Œλ“€κΈ°

Language: Python - Size: 6.96 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0