Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ocr-d

OCR-D/gt_structure_1_2

The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Size: 1.44 GB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0

OCR-D/ocrd_tesserocr

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

Language: Python - Size: 655 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 38 - Forks: 11

OCR-D/gt_structure_1_3

The repo gt_structure_1_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Size: 2.15 GB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0

OCR-D/gt_structure_1_4

About The repo gt_structure_1_4 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Size: 1.8 GB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 1 - Forks: 0

bertsky/ocrd_detectron2

OCR-D wrapper for detectron2 based segmentation models

Language: Python - Size: 1.21 GB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 16 - Forks: 5

UB-Mannheim/tesseract Fork of tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

Language: C++ - Size: 118 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 2,856 - Forks: 414

OCR-D/gt_structure_1_1

The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

Size: 1.22 GB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

OCR-D/gt-repo-template

A template for creating a ground truth repo with the various functions and features: such as metadata creation, data analysis and presentation.

Size: 155 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 7 - Forks: 4

qurator-spk/dinglehopper

An OCR evaluation tool

Language: Python - Size: 3.8 MB - Last synced: 11 days ago - Pushed: 12 days ago - Stars: 54 - Forks: 12

bertsky/workflow-configuration

a makefilization for OCR-D workflows, with configuration examples

Language: Makefile - Size: 218 KB - Last synced: 19 days ago - Pushed: 20 days ago - Stars: 10 - Forks: 4

OCR-D/core

Collection of OCR-related python tools and wrappers from @OCR-D

Language: Python - Size: 26.1 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 118 - Forks: 31

OCR-D/ocrd_all

Master repository which includes most other OCR-D repositories as submodules

Language: Makefile - Size: 880 KB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 69 - Forks: 17

OCR-D/ocrd_segment

OCR-D-compliant page segmentation

Language: Python - Size: 2.67 MB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 65 - Forks: 15

OCR-D/ocrd_anybaseocr

DFKI Layout Detection for OCR-D

Language: Python - Size: 122 MB - Last synced: 14 days ago - Pushed: 23 days ago - Stars: 47 - Forks: 12

UB-Mannheim/ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

Language: JavaScript - Size: 799 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 175 - Forks: 23

OCR-D/gt-metadata Fork of HTR-United/htr-united.github.io

Metadata tool for Ground Truth datasets

Language: CSS - Size: 7.73 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 2

OCR-D/gt-MufiLevelRules

OCR-D-Level-Rules can be created automatically with gt-MufiLevelRules from the encodings published by MUFI: The Medieval Unicode Font Initiative.

Language: XSLT - Size: 1.21 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0

OCR-D/gt-repo-scripts

XSLT and shell scripts for analyzing and creating GitHub pages of a ground truth repository. These are centrally managed and can be used by all repositories created with gt-repo-template (https://github.com/OCR-D/gt-repo-template).

Language: XSLT - Size: 1.25 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 1

cisocrgroup/ocrd_cis

OCR-D python tools

Language: Python - Size: 78.5 MB - Last synced: 22 days ago - Pushed: about 1 month ago - Stars: 33 - Forks: 11

bertsky/ocrd_doxa

OCR-D wrapper for DoxaPy image binarization via locally adaptive thresholding

Language: Python - Size: 9.77 KB - Last synced: 7 days ago - Pushed: 8 months ago - Stars: 1 - Forks: 1

UB-Mannheim/hkb-gt

Ground truth for a political newspaper of the Mannheim region (1931–1945)

Language: Shell - Size: 2.33 MB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

slub/ocrd_kitodo

Docker integration of Kitodo.Production and OCR-D

Language: XSLT - Size: 15.2 MB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 10 - Forks: 5

OCR-D/ocrd_keraslm

Simple character-based language model using keras

Language: Python - Size: 284 KB - Last synced: 4 days ago - Pushed: 2 months ago - Stars: 6 - Forks: 6

OCR-D/ocrd_typegroups_classifier πŸ“¦

Font family detection in historical documents.

Language: Python - Size: 323 MB - Last synced: 4 days ago - Pushed: about 1 year ago - Stars: 6 - Forks: 10

ASVLeipzig/cor-asv-ann

OCR-D post-correction with encoder-attention-decoder LSTMs

Language: Python - Size: 797 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 13 - Forks: 3

OCR-D/ocrd_kraken

Wrapper for the kraken OCR engine

Language: Python - Size: 148 KB - Last synced: 9 days ago - Pushed: 3 months ago - Stars: 10 - Forks: 6

OCR-D/ocrd_fileformat

OCR-D wrapper for ocr-fileformat

Language: Shell - Size: 60.5 KB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 4 - Forks: 3

OCR-D/gt-labelling

Language: XSLT - Size: 143 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 1 - Forks: 0

ASVLeipzig/cor-asv-fst

OCR-D post-correction module based on weighted finite-state transducers

Language: Python - Size: 1.45 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 9 - Forks: 4

qurator-spk/ocrd_repair_inconsistencies πŸ“¦

Automatically re-order lines, words and glyphs to become textually consistent with their parents.

Language: Python - Size: 44.9 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 2 - Forks: 3

OCR-D/ocrd_vandalize

Demo processor to illustrate OCR-D Python API

Language: Python - Size: 123 KB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 5 - Forks: 2

UB-Mannheim/ocrd_contrib_ubma

Helper scripts for OCR-D

Language: Python - Size: 10.7 KB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 1

UB-Mannheim/ocrd_all Fork of OCR-D/ocrd_all

Master repository which includes most other OCR-D repositories as submodules

Language: Makefile - Size: 595 KB - Last synced: 6 months ago - Pushed: about 2 years ago - Stars: 2 - Forks: 0

UB-Mannheim/ocrd_pagetopdf

OCR-D wrapper for prima-pagetopdf

Language: Shell - Size: 2.71 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 7 - Forks: 5

kba/page-to-alto

Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)

Language: Python - Size: 420 KB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 12 - Forks: 5

OCR-D/format-converters

Converters for various file formats used for representing OCR

Language: XSLT - Size: 55.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 11 - Forks: 5

OCR-D/policy

OCR-D Empfehlungen Volltextdigitalisierung

Size: 69.3 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 1 - Forks: 1

tboenig/17_fontmix_complex

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 179 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

tboenig/16_ant_simple

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 15.5 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/18_frak_simple

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 6.11 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/19_frak_simple

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 8.19 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/16_frak_complex

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 94.7 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

tboenig/17_frak_complex

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 54 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/18_frak_complex

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 181 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/17_fontmix_simple

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 97.4 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/18_fontmix_complex

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 30 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

qurator-spk/train-calamari-gt4histocr

Train a GT4HistOCR Calamari model

Language: Shell - Size: 20.5 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 6 - Forks: 1

tboenig/16_ant_complex

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 14.2 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

OCR-D/ocrd_im6convert

Run ImageMagick with an OCR-D CLI

Language: Shell - Size: 34.2 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 5 - Forks: 4

bertsky/nmalign

forced alignment of lists of string by fuzzy string matching

Language: Python - Size: 58.6 KB - Last synced: 7 days ago - Pushed: 9 months ago - Stars: 6 - Forks: 1

OCR-D/ocrmultieval

Extensible evaluation of (intermediate) results of an OCR workflow

Language: Python - Size: 12.8 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

OCR-D/spec

Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)

Language: Python - Size: 17.4 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 17 - Forks: 6

hnesk/browse-ocrd

An extensible viewer for OCR-D mets.xml files

Language: Python - Size: 15.4 MB - Last synced: 3 days ago - Pushed: 11 months ago - Stars: 19 - Forks: 9

qurator-spk/setuptools_ocrd

Manage your package version through ocrd-tool.json

Language: Python - Size: 49.8 KB - Last synced: 10 days ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

slub/ocrd_manager

frontend for ocrd_controller and adapter towards ocrd_kitodo

Language: Shell - Size: 6.44 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 10 - Forks: 2

VRI-UFPR/ocrd-gbn Fork of qurator-spk/sbb_textline_detection

OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil

Language: Python - Size: 600 KB - Last synced: 21 days ago - Pushed: over 2 years ago - Stars: 9 - Forks: 0

bertsky/ocrd_jdeskew

OCR-D wrapper for Document Image Skew Estimation using Adaptive Radial Projection

Language: Python - Size: 8.79 KB - Last synced: 16 days ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

bertsky/ocrd_wrap

OCR-D wrapper for arbitrary coords-preserving image operations

Language: Python - Size: 43.9 KB - Last synced: 7 days ago - Pushed: 12 months ago - Stars: 4 - Forks: 2

slub/ocrd_controller

Path to network implementation of OCR-D

Language: Dockerfile - Size: 95.7 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 6 - Forks: 2

OCR-D/ocrd_calamari

Recognize text using Calamari OCR and the OCR-D framework

Language: Python - Size: 1.07 MB - Last synced: 15 days ago - Pushed: 7 months ago - Stars: 12 - Forks: 6

OCR-D/taverna_workflow πŸ“¦

Workflows for OCR-D powered by Taverna.

Language: Shell - Size: 462 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 3 - Forks: 1

qurator-spk/ocrd_trocr

OCR-D processor for TrOCR

Language: Python - Size: 24.4 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

OCR-D/slides Fork of kba/ocrd-slides πŸ“¦

Language: CSS - Size: 66.2 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 1 - Forks: 1

OCR-D/OLD_ocrd_anybaseocr πŸ“¦

DFKI Layout Detection for OCR-D

Language: Python - Size: 122 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0

OCR-D/docs πŸ“¦

OCR-D Documentation

Language: Python - Size: 503 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 4 - Forks: 2

OCR-D/ocrd_ocropy πŸ“¦

OCRD CLI to ocropy

Language: Python - Size: 43 KB - Last synced: 16 days ago - Pushed: over 3 years ago - Stars: 2 - Forks: 1

OCR-D/okralact Fork of Doreenruirui/okralact

A repository for online OCRD training infrastructure.

Language: Python - Size: 461 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 4 - Forks: 0

OCR-D/ocr-d.github.io

Website for OCR-D specs, formats, requirements

Language: HTML - Size: 426 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 5 - Forks: 2

qurator-spk/ocrd-galley

A Dockerized test environment for OCR-D processors 🚒

Language: Shell - Size: 19.1 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 7 - Forks: 1

qurator-spk/page2tsv

PAGE-XML to TSV

Language: Python - Size: 317 KB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 4 - Forks: 5

bertsky/docstruct

Document structure detection from PAGE-XML to METS-XML

Language: Python - Size: 9.77 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 6 - Forks: 1

OCR-D/gt-guidelines

OCR-D guidelines for Ground Truth production

Language: HTML - Size: 117 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 5 - Forks: 6

tboenig/19_ant_simple

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 59.5 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/18_ant_simple

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 29.9 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/17_frak_simple

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 8.69 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/16_frak_simple

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Size: 58.8 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1

tboenig/ocrd_bbaw_pilotbibliothek

Bericht ΓΌber die OCR-D-Teststellung an Berlin-Brandenburgische Akademie der Wissenschaften (BBAW)

Language: HTML - Size: 141 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

mikegerber/ocrd_calamari Fork of OCR-D/ocrd_calamari

Recognize text using Calamari OCR and the OCR-D framework

Language: Python - Size: 137 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

ocr-d-modul-2-segmentierung/ocrd-pixelclassifier-segmentation

Wrapper around pixel classifier

Language: Python - Size: 23.4 MB - Last synced: 17 days ago - Pushed: about 2 years ago - Stars: 9 - Forks: 6

OCR-D/ocrd_olahd_client

Language: Python - Size: 25.4 KB - Last synced: 30 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 2

qurator-spk/ocrd_calamari Fork of OCR-D/ocrd_calamari

Recognize text using Calamari OCR and the OCR-D framework

Language: Python - Size: 125 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

OCR-D/ocropy Fork of ocropus/ocropy

Python-based tools for document analysis and OCR

Language: Jupyter Notebook - Size: 40.9 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1

OCR-D/assets

Test data for testing specs and software in @OCR-D

Language: Makefile - Size: 143 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 5 - Forks: 9

bertsky/ocrd_page2tei

OCR-D wrapper for page2tei

Language: Makefile - Size: 4.88 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

bertsky/ocrd_origami

OCR-D wrapper for poke1024/origami OLR+OCR

Language: Python - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

OCR-D/ocrd_olena

Binarize with Olena/scribo

Language: Shell - Size: 170 KB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 6 - Forks: 8

StaatsbibliothekBerlin/ocrd_butler

A butler is a domestic worker in a large household. The butler, as the senior servant, has the highest servant status. He can also sometimes function as a chauffeur.

Language: Python - Size: 120 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 7 - Forks: 3

OCR-D/PAGE-XML Fork of PRImA-Research-Lab/PAGE-XML

PAGE XML format collection for document image page content and more

Size: 4.61 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 2

GBN-DBP/ocrd-page-xml-draw

OCR-D wrapper for page-xml-draw

Language: Python - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

bertsky/ocrd_publaynet

convert PubLayNet data into METS/PAGE-XML

Language: Python - Size: 5.86 KB - Last synced: 15 days ago - Pushed: about 4 years ago - Stars: 10 - Forks: 0

stweil/tensorflow_gpu_to_tensorflow

Dummy Python package for tensorflow-gpu on hosts without GPU

Language: Python - Size: 3.91 KB - Last synced: about 2 months ago - Pushed: about 4 years ago - Stars: 2 - Forks: 0

OCR-D/ocrd_framework

Docker installation for the OCR-D framework containing all available processors, taverna workflow and local repository.

Language: Shell - Size: 16.6 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 2 - Forks: 0

kba/ocrd-core Fork of OCR-D/core

Collection of OCR-related python tools and wrappers from the OCR-D team

Language: Python - Size: 25.4 MB - Last synced: about 1 month ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

ocr-d-modul-2-segmentierung/ocr4all-segmentation

Language: Python - Size: 6.54 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 5 - Forks: 0

OCR-D/repository_metastore

Microservice to manage the data and metadata of the OCR-D data. It provides read/write/update metadata (XML), registering XSD, validate XML and indexing of metadata.

Language: Java - Size: 1.72 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 1

OCR-D/BibliothecaBaltica2018

Slides for the OCR-D talk at the Bibliotheca Baltica 2018 symposium in Rostock

Size: 1.95 KB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0