An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: tika

apache/tika-docker

Convenience Docker images for Apache Tika Server

Language: Shell - Size: 115 KB - Last synced at: about 3 hours ago - Pushed at: 9 days ago - Stars: 180 - Forks: 75

apache/tika

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Language: Java - Size: 236 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 2,917 - Forks: 817

ICIJ/extract

A cross-platform command line tool for parallelised content extraction and analysis.

Language: Java - Size: 69.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 244 - Forks: 32

shebinleo/pdf2html

pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image for PDF file using Apache PDFBox.

Language: JavaScript - Size: 637 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 165 - Forks: 35

kestra-io/plugin-tika

Language: Java - Size: 3.54 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 3

catalyst/moodle-search_postgresfulltext

Moodle search engine implemented using Postgres full text indexing

Language: PHP - Size: 58.6 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 7

yobix-ai/extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

Language: Rust - Size: 2.88 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 1,051 - Forks: 43

quarkiverse/quarkus-tika

Quarkus Tika extension

Language: Java - Size: 652 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 11 - Forks: 13

dadoonet/fscrawler

Elasticsearch File System Crawler (FS Crawler)

Language: Java - Size: 15.4 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1,386 - Forks: 305

shelfio/tika-text-extract

Extract text from a document by Apache Tika

Language: TypeScript - Size: 347 KB - Last synced at: 8 days ago - Pushed at: 15 days ago - Stars: 16 - Forks: 6

stumpylog/tika-client

A modern Python REST client for Apache Tika server

Language: Python - Size: 2.16 MB - Last synced at: 9 days ago - Pushed at: 24 days ago - Stars: 16 - Forks: 5

fedelemantuano/tika-app-python

Python bindings for Apache Tika

Language: Python - Size: 244 KB - Last synced at: 18 days ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 5

fejesa/quarkus-minio

Solving Media File Availability Challenges with Quarkus and MinIO

Language: Java - Size: 2.66 MB - Last synced at: 16 days ago - Pushed at: 29 days ago - Stars: 1 - Forks: 0

ropensci/rtika

R Interface to Apache Tika

Language: R - Size: 133 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 54 - Forks: 8

sergio11/struts2-hibernate

This project demonstrates building a web application with Struts2, Apache Tika, Hibernate, and Wildfly 10. 🚀 Users can upload PDF files, extract text content using Apache Tika, and store metadata in a database using Hibernate. 🔒 Additionally, the project provides instructions for setting up a JDBC Realm on Wildfly 10 for enhanced security.

Language: Java - Size: 140 KB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

vaites/php-apache-tika

Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats

Language: PHP - Size: 13.8 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 116 - Forks: 23

sergio11/document_search_engine_architecture

📄🚀 Unleash a powerful Document Search Engine with Apache NiFi for lightning-fast, comprehensive text indexing and search.

Language: Java - Size: 13.5 MB - Last synced at: 19 days ago - Pushed at: about 1 month ago - Stars: 30 - Forks: 11

TYPO3-Solr/ext-tika

A TYPO3 CMS extension that provides Apache Tika functionality

Language: PHP - Size: 2.16 MB - Last synced at: 16 days ago - Pushed at: 4 months ago - Stars: 9 - Forks: 29

chrismattmann/MLwithTensorFlow2ed

Code for Machine Learning with TensorFlow: 2nd Edition Published by Manning Publications

Language: Jupyter Notebook - Size: 546 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 139 - Forks: 69

lguberan/LuceneFx

Tiny unofficial javafx demo application for Apache's Lucene and Tika.

Language: Java - Size: 86.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

apache/tika-helm

A Helm chart to deploy Apache Tika on Kubernetes.

Language: Smarty - Size: 86.9 KB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 28 - Forks: 20

chrismattmann/imagecat

ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (images,but could be extended to other files) in place, and to extract metadata and OCR information from those files/images using Tika and Tesseract OCR.

Language: Java - Size: 175 MB - Last synced at: 14 days ago - Pushed at: over 6 years ago - Stars: 95 - Forks: 40

rse/tika-server

Apache Tika Server as a Background Service in Node.js

Language: JavaScript - Size: 84 KB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 19 - Forks: 5

liquidinvestigations/hoover-snoop2

Processing system for the search engine service in Liquid Investigations.

Language: Python - Size: 1.86 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 5

USCDataScience/sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

Language: Java - Size: 23.1 MB - Last synced at: 27 days ago - Pushed at: about 2 years ago - Stars: 412 - Forks: 141

chrismattmann/tika-similarity

Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.

Language: Python - Size: 3.2 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 108 - Forks: 60

chrismattmann/drat

The Distributed Release Audit Tool (DRAT) for code analysis and verification.

Language: JavaScript - Size: 94.7 MB - Last synced at: 14 days ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 1

alexferl/tika 📦

Golang client for Apache Tika

Language: Go - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 1

FrodeRanders/disksearch

Indexes a directory hierarchy and provides a crude search interface onto that index

Language: Java - Size: 31.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

dmamakas2000/tiktok-java-app

This project implements a multimedia content sharing system in Java 8, allowing users to upload and stream videos to their subscribers. Inspired by platforms like TikTok, it manages user channels, subscriptions, and real-time video streaming, developing the event delivery system for efficient content promotion.

Language: Java - Size: 25 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

public-law/oregon-law-parser

Distill information about amendments to the Oregon Revised Statutes.

Language: Haskell - Size: 50.1 MB - Last synced at: about 9 hours ago - Pushed at: 8 days ago - Stars: 18 - Forks: 2

mattporritt/moodle-search_elastic

An Elasticsearch engine plugin for Moodle's Global Search

Language: PHP - Size: 1.36 MB - Last synced at: 22 days ago - Pushed at: 5 months ago - Stars: 16 - Forks: 14

ipfs-search/ipfs-tika 📦

Java web application taking IPFS hashes, extracting (textual) content and metadata through Apache's Tika.

Language: Java - Size: 52.7 KB - Last synced at: about 11 hours ago - Pushed at: over 3 years ago - Stars: 32 - Forks: 6

httpreserve/tikalinkextract

Tika based link (URL) extractor for httpreserve

Language: HTML - Size: 171 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 9 - Forks: 1

whentotrade/Noggle.TikaOnDotNet

.NET Tika Wrapper

Language: Rich Text Format - Size: 95.1 MB - Last synced at: 14 days ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 1

orijtech/tikago

Apache Tika adapter in Go

Language: Go - Size: 48 MB - Last synced at: 3 days ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

EricLondon/Docker-Rails-Tika-Elasticsearch

Docker Rails Tika Elasticsearch

Language: Ruby - Size: 168 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

bcgov/nr-bcws-opensearch

opensearch related code

Language: Java - Size: 395 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 8

graboskyc/MQTTtoRealm 📦

A c# console app to act as MQTT broker and write messages to MongoDB Realm

Language: C# - Size: 116 KB - Last synced at: 3 days ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

nasa-jpl-memex/memex-explorer

Viewers for statistics and dashboarding of Domain Search Engine data

Language: Python - Size: 14 MB - Last synced at: 5 months ago - Pushed at: over 9 years ago - Stars: 121 - Forks: 69

hmmh/typo3-solr-file-indexer

TYPO3 Extension: solr_file_indexer

Language: PHP - Size: 466 KB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 6

skvkel/information-retrieval-system

Information retrieval system for documents.

Language: HTML - Size: 78.9 MB - Last synced at: 10 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

AidaRosaCalvo/info-retrieval-system

Este proyecto consiste en la construcción de un sistema de recuperación de información que puede manipular documentos de diferentes formatos provenientes de un repositorio de información. La aplicación utiliza herramientas como Lucene y Tika para indexar y extraer información de los documentos.

Language: Java - Size: 39.3 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

KevM/tikaondotnet

Use the Java Tika text extraction library on the .NET platform

Language: Rich Text Format - Size: 155 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 193 - Forks: 73

albertus82/extfix

File Extension Fix Tool - Find and rename files with wrong extensions.

Language: Java - Size: 10.9 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

OpenSextant/Xponents

Geographic Place, Date/time, and Pattern entity extraction toolkit along with text extraction from unstructured data and GIS outputters.

Language: Java - Size: 78.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 42 - Forks: 7

sesam-community/content-extractor Fork of sesam-io/content-extraction-service

Extract textual information using the Apache Tika library from JSON streams

Language: Java - Size: 23.4 KB - Last synced at: 12 months ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

commitd/krill

Improved HTML output for Tika extraction

Language: Java - Size: 1.92 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

kressi/search-media

Parse media files with Apache Tika, add documents to Lucene index and query this index.

Language: Scala - Size: 30.3 MB - Last synced at: 12 months ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

riccardo1980/simple-extractor

Simple test for document extractor

Language: Java - Size: 16.6 KB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

juanpablo-santos/jspwiki-tika-searchprovider

Apache JSPWiki tika search provider integration sample

Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

M-Haertling/WorkforceResearchGuide

This is a UTDallas senior design project developed for Alliance Data. Its purpose is to provide a more robust system for searching through a document repository. This is achieved through high level indexing and the addition of a tagging system. This is a Maven project. Third party libraries used include Apache Lucene, Apache Tika, and SQLite.

Language: Perl - Size: 43.3 MB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

DFKI/leechcrawler

Incremental crawling capabilities for Apache Tika. Crawl content out of e.g. file systems, http(s) sources (webcrawling) imap(s) servers or your own arbitrary data sources. LeechCrawler offers additional Tika parsers providing these crawling capabilities.

Language: Java - Size: 95.2 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 5

sarbanandabhikkhu/tipitaka-xml

Roman Tipitaka (CSCD)

Language: JavaScript - Size: 55.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

abhayalekal74/NLP-Information-Extraction

Extracting information from PDF files.

Language: Python - Size: 3.78 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

StegarescuAnaMaria/Java_Indexer_and_Searcher

This project is a simulation of a search engine which outputs the path of the documents based on the search string query input.

Language: Java - Size: 15.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

USCDataScience/tika-dockers

A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for images and video

Size: 21.5 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 6

wbicode/TikaService

A windows service wrapper for the tika JSR 311 network server.

Language: Batchfile - Size: 305 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Dimous/tsundoku

Book Management System for e-bibliomaniacs

Language: Java - Size: 89.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

tspannhw/nifi-extracttext-processor

Apache NiFi Custom Processor Extracting Text From Files with Apache Tika

Language: Java - Size: 891 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 34 - Forks: 29

DavidChicharro/Recuperacion-de-Informacion

Recuperación de Información (RI) 2019-2020 Grado en Ingeniería Informática UGR

Language: Java - Size: 18.7 MB - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

arquivo/dspace-link-extractor

Extracts links from DSpace repositories

Language: Java - Size: 62.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

welle/JTika

Quick & Dirty project to generate java enumeration class for all mimetype in Apache Tika.

Language: Java - Size: 624 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

nasa-jpl-memex/image_space

Interactive Image similarity and Visual Search and Retrieval application

Language: JavaScript - Size: 2.25 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 93 - Forks: 46

alexoley/ReadWithMeBot

telegram bot available by username @ReadWithMeBot

Language: Kotlin - Size: 151 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

phantom0301/MetaSpider

基于Python和Tika的网络富文本元信息爬虫,Web crawler for rich text meta information based on Python and Tika

Language: Python - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 2

tirthmehta/Apache-Solr-based-Web-Search-Engine

Deployment of a search engine utilizing Apache Solr, Apache Tika and spelling correction programs.

Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

mrcsparker/ruby_tika_app

A ruby wrapper for the Tika jar (tika-app.jar) that extracts text in a lot of formats from PDF, xls, doc, etc files

Language: DIGITAL Command Language - Size: 415 MB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 26 - Forks: 20

Keerthivasan13/CSCI572-Information_Retrieval_And_Web_Search_Engines

Search Engine projects

Language: Java - Size: 34.5 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 11 - Forks: 17

nasa-jpl-memex/GeoPath-Clustering

To cluster geo paths that travel very similar paths

Language: HTML - Size: 10.5 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 5 - Forks: 7

nasa-jpl-memex/GeoParser

Extract and Visualize location from any file

Language: JavaScript - Size: 159 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 53 - Forks: 23

lagenorhynque/tika

git diff settings for Microsoft Office files

Language: Shell - Size: 65.8 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 1

sergeyt/pandora

Small box of pandora to prototype your app with ready for use backend. This is just my compilation of different solutions occasionally applied in hackathons and challenges

Language: Go - Size: 1.81 MB - Last synced at: 6 days ago - Pushed at: 17 days ago - Stars: 26 - Forks: 8

luisbalru/Information-Retrieval

Language: Java - Size: 2.02 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

sbelassa/SMIR

smart multimodal information retrieval project

Language: HTML - Size: 26.2 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

Journalisme-UQAM/extractionPDF

Trois façons d'extraire le texte de fichiers PDF à l'aide de python

Language: Python - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

puthurr/tika-docker

Contains a custom tika 1.x server docker image.

Language: Dockerfile - Size: 245 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

procesaur/TExASe

Flask application for OCR and extraction of text from documents with support for repository applications

Language: Python - Size: 14.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

thecogworks/Cogworks.ExamineFileIndexer

An examine indexer that uses Apache Tika.

Language: C# - Size: 23.1 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 6

CogStack/CogStack-Pipeline 📦

Distributed, fault tolerant batch processing for Natural Language Applications and Search, using remote partitioning

Language: Java - Size: 25.6 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 41 - Forks: 13

schopenhauer/tikka

Flask-based file drop on sterioids, powered by Apache Tika

Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

codingstar77/Automated-College-Result-Management-System-

It Parses PDF result provided By Pune University automatically into the Database,Generates reports and notifies student about his/her result on email

Language: Java - Size: 504 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 1

scotthaleen/py-tika-socket-server

Language: Clojure - Size: 133 KB - Last synced at: about 2 years ago - Pushed at: over 9 years ago - Stars: 0 - Forks: 1

Sotera/newman

Quickly analyze and explore email with advanced analytics and visualization.

Language: JavaScript - Size: 266 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 50 - Forks: 14

mixpeek/top-ocr-libraries

Most popular open source OCR libraries listed by accuracy and speed

Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

krish-kunal/task

Helps to parse bank statement(PDF)

Language: Python - Size: 34.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

izveigor/X-MAS-HACK

Веб-приложение, которое предсказывает тип документа по его содержанию 📝

Language: TypeScript - Size: 883 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

cloudogu/spotter

Content-Type and language recognition library

Language: Java - Size: 246 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

Anthonyive/DSCI-550-Assignment-2 📦

👨‍🦰 Large Scale Active Social Engineering Defense (ASED): Multimedia and Social Engineering

Language: HTML - Size: 154 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 2

Anthonyive/DSCI-550-Assignment-1 📦

📧 Analysis of Cyber Phishing Emails: Fraudulent Emails and Social Engineering.

Language: Jupyter Notebook - Size: 70.4 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 5 - Forks: 2

mkalus/tika-page-extractor 📦

Tika per page PDF extractor server returning content as JSON.

Language: Java - Size: 19.5 KB - Last synced at: about 2 years ago - Pushed at: about 9 years ago - Stars: 6 - Forks: 3

chrismattmann/trec-dd-polar

A dataset downloaded from the deep and scientific web across three major Polar data centers for use in research.

Language: Shell - Size: 85 KB - Last synced at: 14 days ago - Pushed at: over 7 years ago - Stars: 13 - Forks: 7

TheoGicquel/L3-IrisaParser

Parse scientific papers using python

Language: Python - Size: 249 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

kairohm/tikatree

Directory tree metadata parser using Apache Tika

Language: Python - Size: 42 KB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

sarbanandabhikkhu/DhammaChakka

Early Buddhist texts from the Tipitaka (Tripitaka). Suttas (sutras) with the Buddha's teachings on mindfulness, insight, wisdom, and meditation.

Language: JavaScript - Size: 6.31 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

nguyenhiepvan/tika_server_forever Fork of vuthaihoc/tika_server_forever

Run tika server forever with health check process

Language: Shell - Size: 76.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

jettdc/semester-search

Semester Search is a utility for quickly searching through downloadable class materials so that you can spend more time learning and less time clicking through dozens of links on your professors' websites.

Language: Go - Size: 66.5 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

jwo29/spring-boot-camunda

spring-boot-camunda

Language: Java - Size: 741 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

chrisbratlien/aws-bucketeer

Apache Solr/Tika index/search plus SHA256 content-based addressing for files stored into AWS S3 buckets

Language: PHP - Size: 150 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

wbicode/TikaService-Installer

A Windows Installer (MSI) for the windows service wrapper of the tika JSR 311 network server.

Language: C# - Size: 80.1 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0