Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / Govind-S-B / pdf-to-text-chroma-search

Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma DB for similarity search based on user input.

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Govind-S-B%2Fpdf-to-text-chroma-search

Stars: 0
Forks: 0
Open Issues: 0

License: None
Language: Python
Repo Size: 0 Bytes
Dependencies: pending

Created: 7 months ago
Updated: 7 months ago
Last pushed: 7 months ago
Last synced: 7 months ago

Topics: chromadb, pdf-processing, similarity-search, text-extraction, vector-embeddings

Files
    Loading...
    Readme
    Loading...