GitHub / kolhesamiksha / Nemo_Curator
This repository contains a sample text data-preparation code using Nemo Curator for pre-training or synthetic data generation
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kolhesamiksha%2FNemo_Curator
PURL: pkg:github/kolhesamiksha/Nemo_Curator
Stars: 1
Forks: 0
Open issues: 0
License: None
Language: Jupyter Notebook
Size: 138 KB
Dependencies parsed at: Pending
Created at: 9 months ago
Updated at: 7 months ago
Pushed at: 8 months ago
Last synced at: 3 months ago
Topics: curator, data-preprocessing-pipelines, finetuning-llms, generative-ai, nemo, nvidia, synthetic-dataset-generation