Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: ocr-d
OCR-D/gt_structure_1_2
The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
Size: 1.44 GB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0
OCR-D/ocrd_tesserocr
Run tesseract with the tesserocr bindings with @OCR-D's interfaces
Language: Python - Size: 655 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 38 - Forks: 11
OCR-D/gt_structure_1_3
The repo gt_structure_1_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
Size: 2.15 GB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 0 - Forks: 0
OCR-D/gt_structure_1_4
About The repo gt_structure_1_4 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
Size: 1.8 GB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 1 - Forks: 0
bertsky/ocrd_detectron2
OCR-D wrapper for detectron2 based segmentation models
Language: Python - Size: 1.21 GB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 16 - Forks: 5
UB-Mannheim/tesseract Fork of tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language: C++ - Size: 118 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 2,856 - Forks: 414
OCR-D/gt_structure_1_1
The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
Size: 1.22 GB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
OCR-D/gt-repo-template
A template for creating a ground truth repo with the various functions and features: such as metadata creation, data analysis and presentation.
Size: 155 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 7 - Forks: 4
qurator-spk/dinglehopper
An OCR evaluation tool
Language: Python - Size: 3.8 MB - Last synced: 11 days ago - Pushed: 12 days ago - Stars: 54 - Forks: 12
bertsky/workflow-configuration
a makefilization for OCR-D workflows, with configuration examples
Language: Makefile - Size: 218 KB - Last synced: 19 days ago - Pushed: 20 days ago - Stars: 10 - Forks: 4
OCR-D/core
Collection of OCR-related python tools and wrappers from @OCR-D
Language: Python - Size: 26.1 MB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 118 - Forks: 31
OCR-D/ocrd_all
Master repository which includes most other OCR-D repositories as submodules
Language: Makefile - Size: 880 KB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 69 - Forks: 17
OCR-D/ocrd_segment
OCR-D-compliant page segmentation
Language: Python - Size: 2.67 MB - Last synced: 29 days ago - Pushed: about 1 month ago - Stars: 65 - Forks: 15
OCR-D/ocrd_anybaseocr
DFKI Layout Detection for OCR-D
Language: Python - Size: 122 MB - Last synced: 14 days ago - Pushed: 23 days ago - Stars: 47 - Forks: 12
UB-Mannheim/ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
Language: JavaScript - Size: 799 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 175 - Forks: 23
OCR-D/gt-metadata Fork of HTR-United/htr-united.github.io
Metadata tool for Ground Truth datasets
Language: CSS - Size: 7.73 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 2
OCR-D/gt-MufiLevelRules
OCR-D-Level-Rules can be created automatically with gt-MufiLevelRules from the encodings published by MUFI: The Medieval Unicode Font Initiative.
Language: XSLT - Size: 1.21 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 2 - Forks: 0
OCR-D/gt-repo-scripts
XSLT and shell scripts for analyzing and creating GitHub pages of a ground truth repository. These are centrally managed and can be used by all repositories created with gt-repo-template (https://github.com/OCR-D/gt-repo-template).
Language: XSLT - Size: 1.25 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 1
cisocrgroup/ocrd_cis
OCR-D python tools
Language: Python - Size: 78.5 MB - Last synced: 22 days ago - Pushed: about 1 month ago - Stars: 33 - Forks: 11
bertsky/ocrd_doxa
OCR-D wrapper for DoxaPy image binarization via locally adaptive thresholding
Language: Python - Size: 9.77 KB - Last synced: 7 days ago - Pushed: 8 months ago - Stars: 1 - Forks: 1
UB-Mannheim/hkb-gt
Ground truth for a political newspaper of the Mannheim region (1931β1945)
Language: Shell - Size: 2.33 MB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0
slub/ocrd_kitodo
Docker integration of Kitodo.Production and OCR-D
Language: XSLT - Size: 15.2 MB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 10 - Forks: 5
OCR-D/ocrd_keraslm
Simple character-based language model using keras
Language: Python - Size: 284 KB - Last synced: 4 days ago - Pushed: 2 months ago - Stars: 6 - Forks: 6
OCR-D/ocrd_typegroups_classifier π¦
Font family detection in historical documents.
Language: Python - Size: 323 MB - Last synced: 4 days ago - Pushed: about 1 year ago - Stars: 6 - Forks: 10
ASVLeipzig/cor-asv-ann
OCR-D post-correction with encoder-attention-decoder LSTMs
Language: Python - Size: 797 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 13 - Forks: 3
OCR-D/ocrd_kraken
Wrapper for the kraken OCR engine
Language: Python - Size: 148 KB - Last synced: 9 days ago - Pushed: 3 months ago - Stars: 10 - Forks: 6
OCR-D/ocrd_fileformat
OCR-D wrapper for ocr-fileformat
Language: Shell - Size: 60.5 KB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 4 - Forks: 3
OCR-D/gt-labelling
Language: XSLT - Size: 143 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 1 - Forks: 0
ASVLeipzig/cor-asv-fst
OCR-D post-correction module based on weighted finite-state transducers
Language: Python - Size: 1.45 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 9 - Forks: 4
qurator-spk/ocrd_repair_inconsistencies π¦
Automatically re-order lines, words and glyphs to become textually consistent with their parents.
Language: Python - Size: 44.9 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 2 - Forks: 3
OCR-D/ocrd_vandalize
Demo processor to illustrate OCR-D Python API
Language: Python - Size: 123 KB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 5 - Forks: 2
UB-Mannheim/ocrd_contrib_ubma
Helper scripts for OCR-D
Language: Python - Size: 10.7 KB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 1
UB-Mannheim/ocrd_all Fork of OCR-D/ocrd_all
Master repository which includes most other OCR-D repositories as submodules
Language: Makefile - Size: 595 KB - Last synced: 6 months ago - Pushed: about 2 years ago - Stars: 2 - Forks: 0
UB-Mannheim/ocrd_pagetopdf
OCR-D wrapper for prima-pagetopdf
Language: Shell - Size: 2.71 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 7 - Forks: 5
kba/page-to-alto
Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)
Language: Python - Size: 420 KB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 12 - Forks: 5
OCR-D/format-converters
Converters for various file formats used for representing OCR
Language: XSLT - Size: 55.7 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 11 - Forks: 5
OCR-D/policy
OCR-D Empfehlungen Volltextdigitalisierung
Size: 69.3 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 1 - Forks: 1
tboenig/17_fontmix_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 179 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
tboenig/16_ant_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 15.5 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/18_frak_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 6.11 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/19_frak_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 8.19 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/16_frak_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 94.7 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
tboenig/17_frak_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 54 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/18_frak_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 181 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/17_fontmix_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 97.4 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/18_fontmix_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 30 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
qurator-spk/train-calamari-gt4histocr
Train a GT4HistOCR Calamari model
Language: Shell - Size: 20.5 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 6 - Forks: 1
tboenig/16_ant_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 14.2 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
OCR-D/ocrd_im6convert
Run ImageMagick with an OCR-D CLI
Language: Shell - Size: 34.2 KB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 5 - Forks: 4
bertsky/nmalign
forced alignment of lists of string by fuzzy string matching
Language: Python - Size: 58.6 KB - Last synced: 7 days ago - Pushed: 9 months ago - Stars: 6 - Forks: 1
OCR-D/ocrmultieval
Extensible evaluation of (intermediate) results of an OCR workflow
Language: Python - Size: 12.8 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0
OCR-D/spec
Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)
Language: Python - Size: 17.4 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 17 - Forks: 6
hnesk/browse-ocrd
An extensible viewer for OCR-D mets.xml files
Language: Python - Size: 15.4 MB - Last synced: 3 days ago - Pushed: 11 months ago - Stars: 19 - Forks: 9
qurator-spk/setuptools_ocrd
Manage your package version through ocrd-tool.json
Language: Python - Size: 49.8 KB - Last synced: 10 days ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
slub/ocrd_manager
frontend for ocrd_controller and adapter towards ocrd_kitodo
Language: Shell - Size: 6.44 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 10 - Forks: 2
VRI-UFPR/ocrd-gbn Fork of qurator-spk/sbb_textline_detection
OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil
Language: Python - Size: 600 KB - Last synced: 21 days ago - Pushed: over 2 years ago - Stars: 9 - Forks: 0
bertsky/ocrd_jdeskew
OCR-D wrapper for Document Image Skew Estimation using Adaptive Radial Projection
Language: Python - Size: 8.79 KB - Last synced: 16 days ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
bertsky/ocrd_wrap
OCR-D wrapper for arbitrary coords-preserving image operations
Language: Python - Size: 43.9 KB - Last synced: 7 days ago - Pushed: 12 months ago - Stars: 4 - Forks: 2
slub/ocrd_controller
Path to network implementation of OCR-D
Language: Dockerfile - Size: 95.7 KB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 6 - Forks: 2
OCR-D/ocrd_calamari
Recognize text using Calamari OCR and the OCR-D framework
Language: Python - Size: 1.07 MB - Last synced: 15 days ago - Pushed: 7 months ago - Stars: 12 - Forks: 6
OCR-D/taverna_workflow π¦
Workflows for OCR-D powered by Taverna.
Language: Shell - Size: 462 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 3 - Forks: 1
qurator-spk/ocrd_trocr
OCR-D processor for TrOCR
Language: Python - Size: 24.4 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0
OCR-D/slides Fork of kba/ocrd-slides π¦
Language: CSS - Size: 66.2 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 1 - Forks: 1
OCR-D/OLD_ocrd_anybaseocr π¦
DFKI Layout Detection for OCR-D
Language: Python - Size: 122 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 1 - Forks: 0
OCR-D/docs π¦
OCR-D Documentation
Language: Python - Size: 503 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 4 - Forks: 2
OCR-D/ocrd_ocropy π¦
OCRD CLI to ocropy
Language: Python - Size: 43 KB - Last synced: 16 days ago - Pushed: over 3 years ago - Stars: 2 - Forks: 1
OCR-D/okralact Fork of Doreenruirui/okralact
A repository for online OCRD training infrastructure.
Language: Python - Size: 461 MB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 4 - Forks: 0
OCR-D/ocr-d.github.io
Website for OCR-D specs, formats, requirements
Language: HTML - Size: 426 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 5 - Forks: 2
qurator-spk/ocrd-galley
A Dockerized test environment for OCR-D processors π’
Language: Shell - Size: 19.1 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 7 - Forks: 1
qurator-spk/page2tsv
PAGE-XML to TSV
Language: Python - Size: 317 KB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 4 - Forks: 5
bertsky/docstruct
Document structure detection from PAGE-XML to METS-XML
Language: Python - Size: 9.77 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 6 - Forks: 1
OCR-D/gt-guidelines
OCR-D guidelines for Ground Truth production
Language: HTML - Size: 117 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 5 - Forks: 6
tboenig/19_ant_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 59.5 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/18_ant_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 29.9 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/17_frak_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 8.69 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/16_frak_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
Size: 58.8 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 1
tboenig/ocrd_bbaw_pilotbibliothek
Bericht ΓΌber die OCR-D-Teststellung an Berlin-Brandenburgische Akademie der Wissenschaften (BBAW)
Language: HTML - Size: 141 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
mikegerber/ocrd_calamari Fork of OCR-D/ocrd_calamari
Recognize text using Calamari OCR and the OCR-D framework
Language: Python - Size: 137 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
ocr-d-modul-2-segmentierung/ocrd-pixelclassifier-segmentation
Wrapper around pixel classifier
Language: Python - Size: 23.4 MB - Last synced: 17 days ago - Pushed: about 2 years ago - Stars: 9 - Forks: 6
OCR-D/ocrd_olahd_client
Language: Python - Size: 25.4 KB - Last synced: 30 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 2
qurator-spk/ocrd_calamari Fork of OCR-D/ocrd_calamari
Recognize text using Calamari OCR and the OCR-D framework
Language: Python - Size: 125 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
OCR-D/ocropy Fork of ocropus/ocropy
Python-based tools for document analysis and OCR
Language: Jupyter Notebook - Size: 40.9 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1
OCR-D/assets
Test data for testing specs and software in @OCR-D
Language: Makefile - Size: 143 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 5 - Forks: 9
bertsky/ocrd_page2tei
OCR-D wrapper for page2tei
Language: Makefile - Size: 4.88 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0
bertsky/ocrd_origami
OCR-D wrapper for poke1024/origami OLR+OCR
Language: Python - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
OCR-D/ocrd_olena
Binarize with Olena/scribo
Language: Shell - Size: 170 KB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 6 - Forks: 8
StaatsbibliothekBerlin/ocrd_butler
A butler is a domestic worker in a large household. The butler, as the senior servant, has the highest servant status. He can also sometimes function as a chauffeur.
Language: Python - Size: 120 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 7 - Forks: 3
OCR-D/PAGE-XML Fork of PRImA-Research-Lab/PAGE-XML
PAGE XML format collection for document image page content and more
Size: 4.61 MB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 1 - Forks: 2
GBN-DBP/ocrd-page-xml-draw
OCR-D wrapper for page-xml-draw
Language: Python - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0
bertsky/ocrd_publaynet
convert PubLayNet data into METS/PAGE-XML
Language: Python - Size: 5.86 KB - Last synced: 15 days ago - Pushed: about 4 years ago - Stars: 10 - Forks: 0
stweil/tensorflow_gpu_to_tensorflow
Dummy Python package for tensorflow-gpu on hosts without GPU
Language: Python - Size: 3.91 KB - Last synced: about 2 months ago - Pushed: about 4 years ago - Stars: 2 - Forks: 0
OCR-D/ocrd_framework
Docker installation for the OCR-D framework containing all available processors, taverna workflow and local repository.
Language: Shell - Size: 16.6 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 2 - Forks: 0
kba/ocrd-core Fork of OCR-D/core
Collection of OCR-related python tools and wrappers from the OCR-D team
Language: Python - Size: 25.4 MB - Last synced: about 1 month ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
ocr-d-modul-2-segmentierung/ocr4all-segmentation
Language: Python - Size: 6.54 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 5 - Forks: 0
OCR-D/repository_metastore
Microservice to manage the data and metadata of the OCR-D data. It provides read/write/update metadata (XML), registering XSD, validate XML and indexing of metadata.
Language: Java - Size: 1.72 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 1
OCR-D/BibliothecaBaltica2018
Slides for the OCR-D talk at the Bibliotheca Baltica 2018 symposium in Rostock
Size: 1.95 KB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0