An open API service providing repository metadata for many open source software ecosystems.

GitHub / cahlen / conversation-dataset-generator

Craft conversational datasets (JSONL format with rich metadata) using LLMs. Specify parameters manually or use a creative brief for LLM-generated arguments with automatic topic/scenario variation. Optional web search improves persona grounding. Ideal for LoRA tuning, persona training, and creative writing. Includes Hugging Face Hub upload.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cahlen%2Fconversation-dataset-generator
PURL: pkg:github/cahlen/conversation-dataset-generator

Stars: 0
Forks: 0
Open issues: 0

License: mit
Language: Python
Size: 125 KB
Dependencies parsed at: Pending

Created at: 4 months ago
Updated at: 4 months ago
Pushed at: 4 months ago
Last synced at: 4 months ago

Topics: dataset-generation, dialogue-generation, fine-tuning, huggingface, jsonl, llm, lora, nlp, peft, persona, python, synthentic-data, transformers

    Loading...