Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / twardoch / split-markdown4gpt
A Python tool for splitting large Markdown files into smaller sections based on a specified token limit. This is particularly useful for processing large Markdown files with GPT models, as it allows the models to handle the data in manageable chunks.
JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/twardoch%2Fsplit-markdown4gpt
Stars: 17
Forks: 2
Open Issues: 1
License: apache-2.0
Language: Python
Repo Size: 79.1 KB
Dependencies: pending
Created: 12 months ago
Updated: 16 days ago
Last pushed: 16 days ago
Last synced: about 5 hours ago
Topics: data-preprocessing, gpt, gpt-3, gpt-35-turbo, gpt-35-turbo-16k, gpt-4, markdown, markdown-processing, mistletoe, natural-language-processing, nlp, openai, openai-gpt, python, split-text, summarization, text-analysis, text-processing, text-summarization, text-tokenization