GitHub topics: docpruner
sidmishraw/scp
A data processing pipeline for text-mining on contents extracted from PDFs using Apriori and Simplicial Complex algorithms
Language: C++ - Size: 268 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 2

sidmishraw/docpruner
DocPruner is an utility for pruning bad PDFs for cs 267 project and PDF processor
Language: Java - Size: 32.2 KB - Last synced at: 3 months ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0
