An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: context-curation

Daethyra/context-converter

Curate scraped HTML for large language models. Build more robust generative AI applications. Convert HTML to Markdown using Regex, BeautifulSoup4, and filter useless content with Jina Embeddings.

Language: Python - Size: 93.8 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0