Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / twardoch / split-markdown4gpt

A Python tool for splitting large Markdown files into smaller sections based on a specified token limit. This is particularly useful for processing large Markdown files with GPT models, as it allows the models to handle the data in manageable chunks.

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/twardoch%2Fsplit-markdown4gpt

Stars: 17
Forks: 2
Open Issues: 1

License: apache-2.0
Language: Python
Repo Size: 79.1 KB
Dependencies: pending

Created: 12 months ago
Updated: 16 days ago
Last pushed: 16 days ago
Last synced: about 5 hours ago

Topics: data-preprocessing, gpt, gpt-3, gpt-35-turbo, gpt-35-turbo-16k, gpt-4, markdown, markdown-processing, mistletoe, natural-language-processing, nlp, openai, openai-gpt, python, split-text, summarization, text-analysis, text-processing, text-summarization, text-tokenization

Files
    Loading...
    Readme
    Loading...