Topic: "jsonl"
Textualize/toolong
A terminal application to view, tail, merge, and search log files (plus JSONL).
Language: Python - Size: 205 KB - Last synced at: 20 days ago - Pushed at: 9 months ago - Stars: 3,373 - Forks: 67

neilotoole/sq
sq data wrangler
Language: Go - Size: 49.1 MB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 2,273 - Forks: 33

noborus/trdsql
CLI tool that can execute SQL queries on CSV, LTSV, JSON, YAML and TBLN. Can output to various formats.
Language: Go - Size: 3.31 MB - Last synced at: about 11 hours ago - Pushed at: 6 days ago - Stars: 2,090 - Forks: 77

hosuaby/inject-resources
Simple and convenient way to read content of resource in Java.
Language: Java - Size: 694 KB - Last synced at: 23 days ago - Pushed at: 9 months ago - Stars: 57 - Forks: 5

datacoon/undatum
undatum: a command-line tool for data processing. Brings CSV simplicity to JSON lines and BSON
Language: Python - Size: 4.97 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 48 - Forks: 7

umarbutler/orjsonl
A lightweight, high-performance Python library for parsing jsonl files.
Language: Python - Size: 28.3 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 1

Salaah01/json-lineage
Tool to allow parsing large JSON files without laoding into memory. Developed in Rust with adapters in other programming langauges for easy adoption
Language: Rust - Size: 9.39 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 0

chand1012/sq
Convert and query JSON, JSONL, CSV, and SQLite with ease!
Language: Go - Size: 87.9 KB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 0

seart-group/DL4SE
Building Training Datasets for Deep Learning Models in Software Engineering and Empirical Software Engineering Research
Language: Java - Size: 4.2 MB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 21 - Forks: 5

brianSalk/JSONLgenerator
an easy way to create JSONL files for fine-tuning openai models.
Language: Python - Size: 57.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 15 - Forks: 1

markusressel/openhasp-config-manager
A tool to manage all of your openHASP device configs in a centralized place.
Language: Python - Size: 467 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 13 - Forks: 4

gr-b/jsonltui
A fast TUI application (with optional webui) to visually navigate and inspect JSON and JSONL data. Easily localize parse errors in large JSONL files. Made with LLM fine-tuning workflows in mind.
Language: HTML - Size: 233 KB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 13 - Forks: 0

vearutop/flatjsonl
A tool to flatten JSONL into CSV or SQL
Language: Go - Size: 693 KB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 10 - Forks: 0

cdauth/json-stream-es
A streaming JSON parser/stringifier using web streams.
Language: TypeScript - Size: 1.26 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 9 - Forks: 2

joeyism/jsonl-to-conll
A simple tool to convert JSONL to CONLL
Language: Python - Size: 10.7 KB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 5

ryhkml/fine-tune-forge
JSONL generator designed for models like Google PaLM 2 and OpenAI GPT-3.5
Language: TypeScript - Size: 1.71 MB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 7 - Forks: 1

gaborbata/todo
✔ todo list manager on the command-line inspired by todo.txt using the jsonl format
Language: Ruby - Size: 2.82 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 0

aiengineeringforgrandmas/gemini-prompt-engineer-toolkit
⚡Powered by Goggle TPUs and the latest (Aug 27, 2024) Gemini 1.5 Pro and Flash Models to generate high-quality engineered prompts, analyze text and images, and create datasets for fine-tuning AI models, helping you to become a prompt engineering pro
Language: Python - Size: 55.9 MB - Last synced at: 22 days ago - Pushed at: 7 months ago - Stars: 6 - Forks: 0

nikolaydubina/multiline-jsonl 📦
Read and write multiline JSONL in Go
Language: Go - Size: 1.33 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 6 - Forks: 0

hyparam/hyperparam-cli
Hyperparam local dataset viewer
Language: TypeScript - Size: 7.86 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 5 - Forks: 2

ome9ax/target-s3-jsonl
`target-s3-jsonl` is a Singer Target (https://singer.io) which intend to work with regular Singer Tap. It take the output of the tap and export it as a JSON Lines (http://jsonlines.org) files.
Language: Python - Size: 116 KB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 5 - Forks: 6

kazu/vfs-index
no process, indexer like a DB on VFS(virtual filesystem)
Language: Go - Size: 3.01 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

jimsmart/peanut
peanut is a Go package to write tagged data structs to disk in a variety of formats, simply and without ceremony.
Language: Go - Size: 113 KB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

petlack/rollup-plugin-jsonlines
🍣 A Rollup plugin which imports .jsonl (JSON Lines) files as JSON arrays.
Language: JavaScript - Size: 256 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

nya1/bananareporter
An easy to use CLI to generate custom reports in JSON, JSONL, CSV from multiple sources
Language: TypeScript - Size: 482 KB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

unfoldml/jsonl
JSON Lines
Language: Haskell - Size: 26.4 KB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

vasil9v/jsonl-tree
simple way to encode nested tree structures of JSON objects
Language: JavaScript - Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

marvin-j97/x11-pasteboard
A micro-CLI to watch your X11 pasteboard and emit the contents as jsonl
Language: Rust - Size: 16.6 KB - Last synced at: 26 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

GRKdev/FTUP
This script helps to automate the process of preparing data for finetuning on OpenAI models, specifically GPT-3.5 and Babbage
Language: Python - Size: 40 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

mosuka/wikipedia-jsonl
wikipedia-jsonl is a CLI that converts Wikipedia dump XML to JSON Lines format.
Language: Go - Size: 25.2 MB - Last synced at: 24 days ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

oleewere/fluent-plugin-jsonl_array_splitter
Fluentd filter plugin for parsing multiple jsonl formatted json objects from text inputs (from one event into multiple events)
Language: Ruby - Size: 16.6 KB - Last synced at: 8 days ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

OADA/cli
Pipeable OADA CLI client
Language: TypeScript - Size: 5.52 MB - Last synced at: 2 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

aminnairi/cristaline
An immutable database engine built on the Event Sourcing pattern.
Language: TypeScript - Size: 313 KB - Last synced at: 21 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

fumito-ito/SwiftyJSONLines
The better way to deal with JSONLines data in Swift.
Language: Swift - Size: 18.6 KB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

SinnieOnFire/jsonl-finetune
Python script to transform a set of localization .json files into a .jsonl file for LLM fine-tuning.
Language: Python - Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

kevinbenabdelhak/WP-Fine-Tuning
WP Fine-tuning est un plugin qui exporte vos publications de ou des types de contenus de votre choix, dans un fichier .jsonl prêt à être importé sur OpenAI. Boostez votre IA personnalisée à partir de votre site WordPress
Language: PHP - Size: 24.4 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

maxlath/ndjson-2-json
minimal CLI converter from newline-delimited JSON to a JSON array
Language: JavaScript - Size: 5.86 KB - Last synced at: about 9 hours ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

ZaneH/dataset-tools
Small collection of scripts to build datasets for LLMs.
Language: Python - Size: 3.91 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

s3rgeym/sqldump2json
Converts SQL dump to a JSON stream.
Language: Python - Size: 952 KB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

Banyc/dfsql
SQL REPL/lib for Data Frames
Language: Rust - Size: 533 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

mrpudn/maltrends
(mirror) MyAnimeList.net manga and anime trend data.
Language: Python - Size: 81.5 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

neverendingqs/pprint-ndjson-ui
For pretty-printing newline delimited JSON.
Language: JavaScript - Size: 2.3 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

maxlath/couchdb-bulk2 Fork of jo/couchdb-bulk
Pipe newline-delimited JSON into CouchDB
Language: JavaScript - Size: 157 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

softdev629/wine-service-finetuning
Fine Tunning Data prepare for Wine Service Chatbot App
Language: JavaScript - Size: 225 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

sile/jsonlrpc
A JSON-RPC 2.0 library that streams JSON objects in JSON Lines format.
Language: Rust - Size: 28.3 KB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

KyleKing/tail-jsonl
Tail JSONL Logs
Language: Python - Size: 4.97 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

Errahum/Helios-Journal
A simple journal app
Language: Python - Size: 7.81 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

guilhermejansen/adset-aiprimavia-api
API de conversão de dados xlsx em jsonl.
Size: 11.7 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

hychan48/jsonl-to-yaml
Visualizing OpenAI Chat jsonl format in YAML. Easier to understand the content
Language: JavaScript - Size: 141 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Sija/jsonl.cr
Crystal shard for handling JSONL (JSON Lines) parsing
Language: Crystal - Size: 5.86 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

mohamedattahri/jsonl
Go library to encode/decode JSON Lines.
Language: Go - Size: 7.81 KB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ondata/appaltipop
ETL scripts and issue tracking for AppaltiPOP project.
Language: Jupyter Notebook - Size: 138 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

enesbol/Spark
It puts common records in 2 different tables and their ids from both tables in a separate table.
Language: Jupyter Notebook - Size: 949 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

zaibacu/json2jsonl
A tool to take large JSON and convert it to JSONL
Language: Rust - Size: 16.6 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

pearxteam/bashim-scraper
A utility written in Kotlin that scraps all the quotes from Russian bash.im website and writes them into a single JSONL file.
Language: Kotlin - Size: 62.5 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

abetomo/dump_to_jsonl
Generate JSONL from a `mysqldump` dump file.
Language: Go - Size: 153 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

allendema/rewe_dl
Call store APIs. Parse responses and save the data to SQL, JSON or any custom format.
Language: Python - Size: 1.18 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

cahlen/conversation-dataset-generator
Craft conversational datasets (JSONL format with rich metadata) using LLMs. Specify parameters manually or use a creative brief for LLM-generated arguments with automatic topic/scenario variation. Optional web search improves persona grounding. Ideal for LoRA tuning, persona training, and creative writing. Includes Hugging Face Hub upload.
Language: Python - Size: 125 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

mrizaln/octave-ndjson
Newline Delimited JSON (ndjson) or JSON Lines (jsonl) parser for Octave
Language: C++ - Size: 78.1 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

IliyaBadri/gologger
A minimal Go package that provides logging and console output for Go applications. It writes logs to a file while also printing them to the console.
Language: Go - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

IliaShkola/CSV_Jsonl_Converter
csv - jsonl converter python app for fine-tuning OpenAI models.
Language: Python - Size: 15.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

zurd46/ZurdSynthDataGen
This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).
Language: JavaScript - Size: 110 KB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rmoralespp/jsonl
A simple Python library for handling jsonlines files
Language: Python - Size: 3.36 MB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Siddhesh-Agarwal/openai-ft-validate
A Go-based CLI tool that verifies the structure of JSONL files for OpenAI fine-tuning.
Language: Go - Size: 10.7 KB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Cstrp/jsonl_generator
A specialized tool designed for creating JSONL files to train OpenAI models. This application streamlines the process of preparing your training data in the correct format.
Language: TypeScript - Size: 81.1 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

sile/jlot
Command-line tool for JSON-RPC 2.0 over JSON Lines over TCP.
Language: Rust - Size: 88.9 KB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Ethorbit/jsonl-augmenter
Augment jsonl files. Can be used to artificially inflate AI datasets.
Language: Python - Size: 14.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

iagolirapasssos/JSONL-File-Generator
This is a simple web application that allows users to create JSONL files for fine-tuning OpenAI's GPT models. The application includes fields for entering "System", "User", and "Assistant" messages, stores data in the browser's local storage to prevent data loss on refresh, and supports multiple languages (English, Portuguese, Spanish, and Hindi).
Language: JavaScript - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

rubeniskov/audible-scraper
Audible Scraper is a command-line tool (CLI) written in Rust that allows you to scrape information about audiobooks from Audible
Language: HTML - Size: 395 KB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

brianSalk/JSONLgenerator3.5
A JSONL generator to create training data for GPT3.5 and newer
Language: Python - Size: 15.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

yoppeh/ld2json
Carefree dataset specification
Language: C - Size: 39.1 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

leodeveloper/fine-tune-with-google-cloud
Generative AI custom data fine tuning with google cloud vertax ai
Language: Jupyter Notebook - Size: 382 KB - Last synced at: 29 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

itsluketwist/jldc
Easily read/write JSONLines files that include dataclasses.
Language: Python - Size: 16.6 KB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

loyal812/img2txt-fine-tuning-api
Image to Text & Fine Tuning & AWS & Docker & OpenAI & CI/CD
Language: Python - Size: 13.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MartinKondor/jsonl
🐍 Utilities for using JSONL (JSON lines) file type with Python
Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

tim-hub/parquet-to-json
a script to convert parquet to json
Language: Python - Size: 7.81 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

rashmishreev/jobgpt-resume-assistant
Fine-tune ChatGPT with few-shot learning for personalized resume bullet points.
Language: Python - Size: 13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

an-dist/ReadableStream.readAsJson
Reading the stream of "JSON / JSON Lines" from ReadableStream so that it can be used in "for await...of" statement.
Language: JavaScript - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kuriofoolio/JSONPlayground
A beginner's guide on how to get started with .json and .jsonl file formats, along with a sample project
Language: Jupyter Notebook - Size: 218 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

NinjaBitroom/jsonlapp
APP para criar arquivos .jsonl
Language: JavaScript - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Devadeut/Image_Captioning_and_Detection
Image Captioning is the process of generating textual description of an image. You have to create a python package for transforming images and analysing their effect on the captions of an image captioning model. We are providing you with a pretrained captioning model, all you need to do is to call the model on the image and get the outputs.
Language: Python - Size: 6.62 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ismailsoftdev/jsonl2json
jsonl2json: A Python library for converting JSONL (JSON Lines) files to standard JSON files.
Language: Python - Size: 9.77 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

spekulatius/jsonl-2-json-files
Converts `jsonl` files into single json files.
Language: Shell - Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

brokeyourbike/doccano-to-automl-jsonl
Transform doccano JSONL to the format expected by AutoML
Language: Go - Size: 69.3 KB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

damian-anslik/jsonl
Python library for working with JSON Lines (JSON) files
Language: Python - Size: 1.95 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

takanoriyanagitani/json2strings
convert json to string array(json -> json string array)
Language: Rust - Size: 18.6 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

ChristopherChristofi/raw-tweets-to-csv-script
Simple script for searching the Twitter API and converting the resulting raw tweet data into a CSV document
Language: Shell - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

franck-mahieu/datasets-toolbox
datasets-toolbox are some scripts usefull to generate, transfom and valid large dataset files, not openable with editor because too large. datasets-toolbox provide also a ping script.
Language: JavaScript - Size: 24.4 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

takanoriyanagitani/psplit-py
Programmable text file splitter
Language: Python - Size: 17.6 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

takanoriyanagitani/jsonl2jsons
convert jsonl(ldjson, ndjson, json-stream, ...) to json(s)
Language: C - Size: 15.6 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

zaypen/jsonl2csv
A simple cli script to convert jsonl to csv
Language: Python - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0
