An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: document-classification

Neeraj652/AI-_Document_Extraction

Document Intelligence: Demonstrating AI and OCR capabilities in the the system

Language: Python - Size: 101 MB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 0 - Forks: 0

kk7nc/Text_Classification

Text Classification Algorithms: A Survey

Language: Python - Size: 13.8 MB - Last synced at: about 5 hours ago - Pushed at: about 1 month ago - Stars: 1,811 - Forks: 543

devrahulbanjara/Document-Verification-System

The system that is designed to assist in verifying the authenticity of Nepali documents (Citizenship, Driving License, Passport) against the government database.

Language: Jupyter Notebook - Size: 4.01 GB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 1

kk7nc/HDLTex

HDLTex: Hierarchical Deep Learning for Text Classification

Language: Python - Size: 32 MB - Last synced at: about 18 hours ago - Pushed at: 7 months ago - Stars: 271 - Forks: 65

sergioburdisso/pyss3

A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)

Language: Python - Size: 102 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 341 - Forks: 44

sgrvinod/a-PyTorch-Tutorial-to-Text-Classification

Hierarchical Attention Networks | a PyTorch Tutorial to Text Classification

Language: Python - Size: 712 KB - Last synced at: 6 days ago - Pushed at: almost 5 years ago - Stars: 247 - Forks: 55

JPLeoRX/detectron2-publaynet

Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset

Language: Python - Size: 7.76 MB - Last synced at: 4 days ago - Pushed at: about 2 years ago - Stars: 48 - Forks: 7

sraashis/diseaseprediction

Undergrad final year project to predict diseases given any text symptoms.

Language: Java - Size: 1.4 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 26 - Forks: 7

vietnh1009/Hierarchical-attention-networks-pytorch

Hierarchical Attention Networks for document classification

Language: Python - Size: 48.5 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 394 - Forks: 104

vietnh1009/Character-level-cnn-pytorch

Character-level CNN for text classification

Language: Python - Size: 53.3 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 54 - Forks: 17

vietnh1009/Character-level-cnn-tensorflow

Character-level CNN for text classification

Language: Python - Size: 84.3 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 28 - Forks: 11

vietnh1009/Very-deep-cnn-pytorch

Very deep CNN for text classification

Language: Python - Size: 120 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 36 - Forks: 14

vietnh1009/Very-deep-cnn-tensorflow

Very deep CNN for text classification

Language: Python - Size: 17.3 MB - Last synced at: 10 days ago - Pushed at: over 3 years ago - Stars: 20 - Forks: 8

brightmart/bert_language_understanding

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

Language: Python - Size: 16 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 964 - Forks: 211

danfmaia/hybrid-legal-doc-classifier

Production-ready zero-shot legal document classifier using Mistral-7B LLM and FAISS validation, built with FastAPI for high-performance document classification.

Language: Python - Size: 301 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

vickshan001/Friends-Character-Classifier-Vector-Semantics-NLP

NLP coursework using vector space semantics to classify Friends character dialogue. Includes TF-IDF, POS, sentiment, and context-aware features.

Language: Jupyter Notebook - Size: 339 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Willgnner-Santos/DPE-Legal-Doc-Classification-Pipeline

The results are drawn from experiments on the classification of legal documents using LLMs in a real-world institutional setting

Language: Jupyter Notebook - Size: 42 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

luopeixiang/textclf

TextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。

Language: Python - Size: 281 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 239 - Forks: 39

Renovamen/Text-Classification

PyTorch implementation of some text classification models (HAN, fastText, BiLSTM-Attention, TextCNN, Transformer) | 文本分类

Language: Python - Size: 266 KB - Last synced at: 20 days ago - Pushed at: about 4 years ago - Stars: 148 - Forks: 29

jankstar/pydocu

fastapi server for classification of documents and extraction of data

Language: Python - Size: 152 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

MiniAiLive/ID-DocumentRecognition-Docker

MiniAiLive Intelligent ID OCR for Reliable Identity Verification From document verification to data entry, our MiniAiLive OCR solution can help transform your identity verification process.

Language: Python - Size: 1.65 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 4

MiniAiLive/ID-DocumentRecognition-Windows

MiniAiLive Intelligent ID OCR for Reliable Identity Verification From document verification to data entry, our MiniAiLive OCR solution can help transform your identity verification process.

Language: Python - Size: 1.72 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 83 - Forks: 76

MiniAiLive/ID-DocumentRecognition-Linux

MiniAiLive Intelligent ID OCR for Reliable Identity Verification From document verification to data entry, our MiniAiLive OCR solution can help transform your identity verification process.

Language: Python - Size: 1.73 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 83 - Forks: 72

MiniAiLive/.github

Size: 47.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 15 - Forks: 2

castorini/hedwig

PyTorch deep learning models for document classification

Language: Python - Size: 23.3 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 595 - Forks: 125

SDpDas/YOLOv5-DocAnalyser

This tool extracts images from a PDF, annotates them using the YOLOv5 model, and converts the annotated images back into a single PDF.. https://github.com/ultralytics/yolov5 https://github.com/HumanSignal/labelImg https://www.kaggle.com/code/sagardeepdas/yolov5-model1

Language: Python - Size: 179 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

justinbt1/Multimodal-Document-Classification

MSc project investigating multi-modal fusion approaches to combining textual and visual features for multi-page classification of documents within the OGA National Data Repository (NDR).

Language: Jupyter Notebook - Size: 11.9 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

ThatSINEWAVE/Project-Reminder

DocTrack is a web app designed to help Romanian citizens track official documents (ID card, driver's license, etc.) and receive expiration reminders.

Language: JavaScript - Size: 278 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

ematvey/hierarchical-attention-networks

Document classification with Hierarchical Attention Networks in TensorFlow. WARNING: project is currently unmaintained, issues will probably not be addressed.

Language: Python - Size: 40 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 465 - Forks: 148

olekli/MrDocument

Automatic PDF transcription and classification via OpenAI

Language: Rust - Size: 163 KB - Last synced at: about 21 hours ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

RituYadav92/Lightweighted-CNN-for-Document-Classification

Optimized Text Document Classification

Language: Python - Size: 4.93 MB - Last synced at: 7 days ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 4

gdmatrix/gdmatrix

GDMatrix is a modular document management system that provides an integrated set of services and applications designed to meet the requirements of public administrations.

Language: Java - Size: 25.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 5 - Forks: 5

sr-murthy/doc_classifier

Experimental document classification tool based on a domain-dependent, keywords-based document class map and a keyword frequency score

Language: Python - Size: 74.2 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MiteshPuthran/Document_Classification

Python code for classification of documents into different classes using machine learning

Language: Jupyter Notebook - Size: 89.1 MB - Last synced at: 30 days ago - Pushed at: about 6 years ago - Stars: 28 - Forks: 8

hank110/bagofconcepts

Python implementation of bag-of-concepts

Language: Python - Size: 5.34 MB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 20 - Forks: 1

raviqqe/tensorflow-font2char2word2sent2doc

TensorFlow implementation of Hierarchical Attention Networks for Document Classification and some extension

Language: Python - Size: 78.1 KB - Last synced at: 21 days ago - Pushed at: about 8 years ago - Stars: 94 - Forks: 31

Hellisotherpeople/Active-Explainable-Classification

A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification

Language: HTML - Size: 2.78 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 29 - Forks: 1

acsenrafilho/cucaracha

A bureaucratic cockroach (cucaracha) assistent to help in document processing and analysis

Language: Python - Size: 6.44 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

GerHobbelt/qiqqa-open-source Fork of jimmejardine/qiqqa-open-source

The open-sourced version of the award-winning Qiqqa research management tool for Windows (a bleeding edge dev fork) ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ・ ☞☞☞ File any issues you find in the main repo issue tracker at https://github.com/jimmejardine/qiqqa-open-source/issues

Language: TeX - Size: 1.43 GB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 47 - Forks: 5

DocsaidLab/DocClassifier

A zero-shot document classifier.

Language: Python - Size: 46 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

digiparser/digiparser-docs

DigiParser Documentation and API reference

Language: MDX - Size: 149 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

qkrdmsghk/TextSSL

[AAAI 2022] Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Language: Python - Size: 142 MB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 32 - Forks: 10

IBM/twitter-customer-care-document-prediction 📦

Twitter dataset for Conversational Document Prediction to Assist Customer Care Agents (Ganhotra et al. 2020, EMNLP)

Size: 6.84 MB - Last synced at: 7 days ago - Pushed at: over 4 years ago - Stars: 15 - Forks: 6

pandeykartikey/Hierarchical-Attention-Network

Implementation of Hierarchical Attention Networks in PyTorch

Language: Jupyter Notebook - Size: 98.2 MB - Last synced at: 22 days ago - Pushed at: over 6 years ago - Stars: 129 - Forks: 27

DataTurks/DataTurks

ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.

Language: JavaScript - Size: 3.95 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 266 - Forks: 125

wri-dssg-omdena/policy-data-analyzer

Building a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.

Language: Jupyter Notebook - Size: 226 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 35 - Forks: 8

eahlys/EdPaper

Helps you organizing your paperwork

Language: PHP - Size: 354 KB - Last synced at: 14 days ago - Pushed at: about 5 years ago - Stars: 57 - Forks: 10

tqtg/hierarchical-attention-networks

TensorFlow implementation of the paper "Hierarchical Attention Networks for Document Classification"

Language: Python - Size: 1.07 MB - Last synced at: about 2 months ago - Pushed at: almost 6 years ago - Stars: 87 - Forks: 25

PKPDAI/PKDocClassifier

Binary classifier to identify scientific publications reporting pharmacokinetic parameters estimated in vivo

Language: Python - Size: 8.42 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 18 - Forks: 8

SDpDas/Document-layout-generator-and-segmentation-tool

Lists all parts of a document PDF and is a highly scalable with robust code.

Language: Python - Size: 58.7 MB - Last synced at: 29 days ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

bitrao/Job-Advertisement-Classification

Classification Models for Job Advertisements

Language: Jupyter Notebook - Size: 2.19 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

lozingaro/multimodal-side-tuning

Classification using deep-learning additive technique and multimodal inputs.

Language: Python - Size: 4.05 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 0

caltechlibrary/documentarist

Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents

Language: Python - Size: 519 KB - Last synced at: 30 days ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 4

IgorAugust0/info-org-retrieval

📙 Arquivos e materiais utilizados na disciplina GSI024 - Organização e Recuperação da Informação da UFU.

Language: Jupyter Notebook - Size: 5.61 MB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

gaurav104/TextClassification

Repository of state of the art text/documentation classification algorithms in Pytorch.

Language: Jupyter Notebook - Size: 1.82 MB - Last synced at: 10 months ago - Pushed at: about 6 years ago - Stars: 11 - Forks: 2

csebuetnlp/banglabert

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.

Language: Python - Size: 1.14 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 230 - Forks: 31

ExtrieveTechnologies/QuickCapture_IOS

QuickCapture Mobile Scanning SDK Specially designed for native IOS

Language: Objective-C - Size: 80.1 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

prnvpwr2612/MS-AI-900-Practice

Completing the exercises & exploring service offered by Azure AI.

Language: Python - Size: 33.5 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

MS1034/document-classification-using-KNN

Documents classification using KNN Algorithm a graph based approach along with scrapped data

Language: Python - Size: 13.4 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

mdh266/TextClassificationApp

Building and Deploying A Serverless Text Classification Web App

Language: Jupyter Notebook - Size: 8.61 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 18 - Forks: 10

ali7haider/Classification_of_Documents_Using_Graph-Based-Features_and_KNN_GT

Classification of Documents Using Graph-Based Features and KNN This project offers hands-on experience with graph theory and machine learning, fostering skills in data representation, algorithm implementation, and analytical thinking in the context of document classification.

Language: Python - Size: 1.79 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Yahya123-hub/Classification-of-Documents-Using-Graph-Based-Features-and-KNN

An innovative project that integrates graph theory and machine learning techniques to classify documents into predefined topics. By leveraging graph representations of documents and employing the K-Nearest Neighbors (KNN) algorithm, this project aims to provide a robust system for document classification

Language: Python - Size: 49 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 2

hautran7201/skip_gram_for_document_classification

Language: Python - Size: 79.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

gangeshwark/hierarchical_document_modeling

Hierarchical Attention Networks for Document Classification

Language: Python - Size: 63.9 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 5

pfalcon/papersman

Minimalist electronic documents/papers/publications manager/indexer/categorizer

Language: Python - Size: 21.5 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 14 - Forks: 2

jhj0517/document_classification

finetune text classification model

Language: Jupyter Notebook - Size: 4.9 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

stipefrkovic/identity-document-classification-trainer

[Software Engineering]

Language: Python - Size: 327 MB - Last synced at: about 12 hours ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

11harini04/Feedly

Sentiment analysis of Reuter news articles..

Language: Python - Size: 25.4 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 1

FranzTscharf/JavaScript-D3-Object-Detection-Example

JavaScript D3 Object Detection Bounding Box Draw Example

Language: HTML - Size: 598 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

EdGENetworks/attention-networks-for-classification

Hierarchical Attention Networks for Document Classification in PyTorch

Language: Jupyter Notebook - Size: 233 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 595 - Forks: 135

elahehaghaarabi/language_model_grant_classifier

A language model is fine-tuned using domain data to identify pre-defined groups of documents

Language: Jupyter Notebook - Size: 43.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jfilter/text-classification-keras 📦

📚 Text classification library with Keras

Language: Python - Size: 11.8 MB - Last synced at: 2 days ago - Pushed at: almost 5 years ago - Stars: 52 - Forks: 11

sbischoff-ai/basic-document-classifier

A simple CNN for n-class classification of document images

Language: Python - Size: 27.3 KB - Last synced at: 13 days ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

GuillaumeDD/gowpy

A very simple library for exploiting graph-of-words in NLP

Language: Python - Size: 1.09 MB - Last synced at: 3 days ago - Pushed at: about 4 years ago - Stars: 12 - Forks: 2

aditya00kumar/document-classification

This project is an attempt to provide a generic pipeline for document classification using different machine learning models.

Language: Jupyter Notebook - Size: 12 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 7 - Forks: 3

Emvista/Gnn4DependencyDocumentClassification

Language: Python - Size: 35.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

malteos/semantic-document-relations

Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles"

Language: Python - Size: 174 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 31 - Forks: 2

ibtihelgharsalah/Document-Summarization-and-Information-Retrieval

A web app that uses NLP tasks for document manipulation and QA.

Language: HTML - Size: 35.2 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ankitdipto/ImageInfoExtractor

An end-to-end pipeline to filter scanned documents from arbitrary images with subsequent classification of the extracted documents.

Language: Jupyter Notebook - Size: 12.5 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

FantacherJOY/Arabic-text-classification

Arabic text documents classified using SVM, k-nn and Naive bayes classifers.

Language: Python - Size: 542 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 9

wolny/complement-naive-bayes

Implementation of Complement Naive Bayes text classifier used for automatic categorisation of DaWanda products

Language: Java - Size: 110 KB - Last synced at: 6 days ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

smit-1999/NaiveBayes

Implementing a multinomial naive bayes classifier for a bag of words model from scratch in Python3

Language: Jupyter Notebook - Size: 2.47 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

yesdeepakmittal/document-classification-using-bert

Document Classification using Bert | 97 % accuracy | FastAPI, Docker, Shell

Language: Jupyter Notebook - Size: 1000 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dunneja/IFDT

Intelligent File Delivery Tool

Language: Python - Size: 543 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

RadwaKotait/TweetClassification

This repo is for a Supervised Classification Machine Learning project that attempts to classify tweets made by Donald Trump and Justin Trudeau

Language: Jupyter Notebook - Size: 462 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

javier-marti-isasi/OCR-free-Document-Understanding-with-Donut-Transformer

This project tackles a real-world challenge of automating client document processing, with a focus on enhancing document classification, error detection, data extraction, and validation.

Language: Jupyter Notebook - Size: 9.48 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

tanmaybinaykiya/CS-7650-Natural-Language

GaTech CS 7650 Assignment Solutions

Language: TeX - Size: 75.6 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 2

CaoHaiNam/Document_Processing

Preprocessing for document classification

Language: Python - Size: 117 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

ToineSayan/sequence-classification-with-bert

A useful repository for calculating classification baselines using Bert

Language: Python - Size: 76.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hiarindam/document-image-classification-TL-SG

Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

Language: Python - Size: 179 KB - Last synced at: 11 months ago - Pushed at: over 5 years ago - Stars: 42 - Forks: 16

AI-Bhagat/Document-Classification-using-CNN-Architectures

Document Classification Using Deep Learning

Language: Jupyter Notebook - Size: 12.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kryvokhyzha/how-to-understand-bert

In this repository, I have collected different sources, visualizations, and code examples of BERT

Language: Jupyter Notebook - Size: 6.87 MB - Last synced at: 24 days ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 2

Jijunjiang/Document-classification-algorithms-design

leangit

Language: Python - Size: 18 MB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

dakopen/diyisio

diyisio is the first document classifier that is meant to be used by laymen to classify documents with custom labels. As diyisio achieves a very high level of accuracy, it is also interesting for people from the field.

Language: Python - Size: 854 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mustaffa-hussain/DONAR-CHOOSE-Document-classfication

It contains application of naive bayes model on a big textual data set. The problems is an example of NLP based solution on 2 different kind of vetorization.

Language: Jupyter Notebook - Size: 117 KB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

melanietosik/bow-sentiment-classifier

Bag-of-words (BOW) sentiment classifier

Language: Python - Size: 322 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 3 - Forks: 0

dimits-ts/Large-Text-NLP-Survey

A survey paper exploring the use of state-of-the-art deep neural network architectures in NLP problems featuring very large documents.

Language: TeX - Size: 1.21 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

DamiFass/document-clustering-and-classification

This repo contains notebooks performing clustering and classification on documents from the FUNSD dataset

Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

scotthlee/document-classification

Simple command-line scripts for document classification

Language: Python - Size: 117 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 21 - Forks: 6

therealexpertai/nlapi-java

Java Client for the expert.ai Natural Language API

Language: Java - Size: 163 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 9

Related Keywords
document-classification 164 nlp 48 text-classification 40 machine-learning 33 deep-learning 30 natural-language-processing 26 pytorch 20 hierarchical-attention-networks 16 python 15 nlp-machine-learning 14 tensorflow 13 classification 13 bert 13 python3 10 sentiment-analysis 10 ocr 8 deep-neural-networks 7 document 7 word2vec 7 image-classification 7 naive-bayes-classifier 7 java 5 sentence-classification 5 lstm 5 deeplearning 5 convolutional-neural-networks 5 han 5 document-management 5 id-document-reader 4 transformer 4 text-mining 4 neural-network 4 transfer-learning 4 ekyc-verification 4 transformers 4 lda 4 data-science 4 fastapi 4 biometrics 4 onboarding 4 keras 4 sentiment-classification 4 attention-mechanism 4 mrz-scanner 4 nltk 4 cnn-text-classification 4 document-analysis 3 on-premise 3 ocr-recognition 3 naive-bayes 3 textcnn 3 document-ocr 3 object-detection 3 docker 3 passport-reader 3 cnn 3 knn-classification 3 idcard-ocr 3 bert-model 3 fasttext 3 active-learning 3 topic-modeling 3 clustering 3 tf-idf 3 image-processing 3 pdf 3 jupyter-notebook 3 text-processing 3 vector-space-model 3 information-retrieval 3 artificial-intelligence 3 data-mining 3 data-extraction 3 tesseract 3 multinomial-naive-bayes 3 authentication 3 pytorch-implementation 3 computer-vision 3 id-card-recognition 2 attention 2 torchtext 2 torch 2 sequence-classification 2 id-verification 2 glove 2 text-preprocessing 2 tutorial 2 word-embeddings 2 sklearn 2 graph 2 annotation-tool 2 document-annotate 2 document-filtering 2 huggingface 2 documents 2 dataless 2 webapp 2 topic-model 2 scraping 2 open-source 2