An open API service providing repository metadata for many open source software ecosystems.

Topic: "jsonl"

Textualize/toolong

A terminal application to view, tail, merge, and search log files (plus JSONL).

Language: Python - Size: 205 KB - Last synced at: 20 days ago - Pushed at: 9 months ago - Stars: 3,373 - Forks: 67

neilotoole/sq

sq data wrangler

Language: Go - Size: 49.1 MB - Last synced at: 4 days ago - Pushed at: 12 days ago - Stars: 2,273 - Forks: 33

noborus/trdsql

CLI tool that can execute SQL queries on CSV, LTSV, JSON, YAML and TBLN. Can output to various formats.

Language: Go - Size: 3.31 MB - Last synced at: about 11 hours ago - Pushed at: 6 days ago - Stars: 2,090 - Forks: 77

hosuaby/inject-resources

Simple and convenient way to read content of resource in Java.

Language: Java - Size: 694 KB - Last synced at: 23 days ago - Pushed at: 9 months ago - Stars: 57 - Forks: 5

datacoon/undatum

undatum: a command-line tool for data processing. Brings CSV simplicity to JSON lines and BSON

Language: Python - Size: 4.97 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 48 - Forks: 7

umarbutler/orjsonl

A lightweight, high-performance Python library for parsing jsonl files.

Language: Python - Size: 28.3 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 31 - Forks: 1

Salaah01/json-lineage

Tool to allow parsing large JSON files without laoding into memory. Developed in Rust with adapters in other programming langauges for easy adoption

Language: Rust - Size: 9.39 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 0

chand1012/sq

Convert and query JSON, JSONL, CSV, and SQLite with ease!

Language: Go - Size: 87.9 KB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 0

seart-group/DL4SE

Building Training Datasets for Deep Learning Models in Software Engineering and Empirical Software Engineering Research

Language: Java - Size: 4.2 MB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 21 - Forks: 5

brianSalk/JSONLgenerator

an easy way to create JSONL files for fine-tuning openai models.

Language: Python - Size: 57.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 15 - Forks: 1

markusressel/openhasp-config-manager

A tool to manage all of your openHASP device configs in a centralized place.

Language: Python - Size: 467 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 13 - Forks: 4

gr-b/jsonltui

A fast TUI application (with optional webui) to visually navigate and inspect JSON and JSONL data. Easily localize parse errors in large JSONL files. Made with LLM fine-tuning workflows in mind.

Language: HTML - Size: 233 KB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 13 - Forks: 0

vearutop/flatjsonl

A tool to flatten JSONL into CSV or SQL

Language: Go - Size: 693 KB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 10 - Forks: 0

cdauth/json-stream-es

A streaming JSON parser/stringifier using web streams.

Language: TypeScript - Size: 1.26 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 9 - Forks: 2

joeyism/jsonl-to-conll

A simple tool to convert JSONL to CONLL

Language: Python - Size: 10.7 KB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 5

ryhkml/fine-tune-forge

JSONL generator designed for models like Google PaLM 2 and OpenAI GPT-3.5

Language: TypeScript - Size: 1.71 MB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 7 - Forks: 1

gaborbata/todo

✔ todo list manager on the command-line inspired by todo.txt using the jsonl format

Language: Ruby - Size: 2.82 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 0

aiengineeringforgrandmas/gemini-prompt-engineer-toolkit

⚡Powered by Goggle TPUs and the latest (Aug 27, 2024) Gemini 1.5 Pro and Flash Models to generate high-quality engineered prompts, analyze text and images, and create datasets for fine-tuning AI models, helping you to become a prompt engineering pro

Language: Python - Size: 55.9 MB - Last synced at: 22 days ago - Pushed at: 7 months ago - Stars: 6 - Forks: 0

nikolaydubina/multiline-jsonl 📦

Read and write multiline JSONL in Go

Language: Go - Size: 1.33 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 6 - Forks: 0

hyparam/hyperparam-cli

Hyperparam local dataset viewer

Language: TypeScript - Size: 7.86 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 5 - Forks: 2

ome9ax/target-s3-jsonl

`target-s3-jsonl` is a Singer Target (https://singer.io) which intend to work with regular Singer Tap. It take the output of the tap and export it as a JSON Lines (http://jsonlines.org) files.

Language: Python - Size: 116 KB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 5 - Forks: 6

kazu/vfs-index

no process, indexer like a DB on VFS(virtual filesystem)

Language: Go - Size: 3.01 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

jimsmart/peanut

peanut is a Go package to write tagged data structs to disk in a variety of formats, simply and without ceremony.

Language: Go - Size: 113 KB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

petlack/rollup-plugin-jsonlines

🍣 A Rollup plugin which imports .jsonl (JSON Lines) files as JSON arrays.

Language: JavaScript - Size: 256 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

nya1/bananareporter

An easy to use CLI to generate custom reports in JSON, JSONL, CSV from multiple sources

Language: TypeScript - Size: 482 KB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

unfoldml/jsonl

JSON Lines

Language: Haskell - Size: 26.4 KB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

vasil9v/jsonl-tree

simple way to encode nested tree structures of JSON objects

Language: JavaScript - Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

marvin-j97/x11-pasteboard

A micro-CLI to watch your X11 pasteboard and emit the contents as jsonl

Language: Rust - Size: 16.6 KB - Last synced at: 26 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

GRKdev/FTUP

This script helps to automate the process of preparing data for finetuning on OpenAI models, specifically GPT-3.5 and Babbage

Language: Python - Size: 40 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

mosuka/wikipedia-jsonl

wikipedia-jsonl is a CLI that converts Wikipedia dump XML to JSON Lines format.

Language: Go - Size: 25.2 MB - Last synced at: 24 days ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

oleewere/fluent-plugin-jsonl_array_splitter

Fluentd filter plugin for parsing multiple jsonl formatted json objects from text inputs (from one event into multiple events)

Language: Ruby - Size: 16.6 KB - Last synced at: 8 days ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

OADA/cli

Pipeable OADA CLI client

Language: TypeScript - Size: 5.52 MB - Last synced at: 2 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

aminnairi/cristaline

An immutable database engine built on the Event Sourcing pattern.

Language: TypeScript - Size: 313 KB - Last synced at: 21 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

fumito-ito/SwiftyJSONLines

The better way to deal with JSONLines data in Swift.

Language: Swift - Size: 18.6 KB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

SinnieOnFire/jsonl-finetune

Python script to transform a set of localization .json files into a .jsonl file for LLM fine-tuning.

Language: Python - Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

kevinbenabdelhak/WP-Fine-Tuning

WP Fine-tuning est un plugin qui exporte vos publications de ou des types de contenus de votre choix, dans un fichier .jsonl prêt à être importé sur OpenAI. Boostez votre IA personnalisée à partir de votre site WordPress

Language: PHP - Size: 24.4 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

maxlath/ndjson-2-json

minimal CLI converter from newline-delimited JSON to a JSON array

Language: JavaScript - Size: 5.86 KB - Last synced at: about 9 hours ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

ZaneH/dataset-tools

Small collection of scripts to build datasets for LLMs.

Language: Python - Size: 3.91 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

s3rgeym/sqldump2json

Converts SQL dump to a JSON stream.

Language: Python - Size: 952 KB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

Banyc/dfsql

SQL REPL/lib for Data Frames

Language: Rust - Size: 533 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

mrpudn/maltrends

(mirror) MyAnimeList.net manga and anime trend data.

Language: Python - Size: 81.5 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

neverendingqs/pprint-ndjson-ui

For pretty-printing newline delimited JSON.

Language: JavaScript - Size: 2.3 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

maxlath/couchdb-bulk2 Fork of jo/couchdb-bulk

Pipe newline-delimited JSON into CouchDB

Language: JavaScript - Size: 157 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

softdev629/wine-service-finetuning

Fine Tunning Data prepare for Wine Service Chatbot App

Language: JavaScript - Size: 225 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

sile/jsonlrpc

A JSON-RPC 2.0 library that streams JSON objects in JSON Lines format.

Language: Rust - Size: 28.3 KB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

KyleKing/tail-jsonl

Tail JSONL Logs

Language: Python - Size: 4.97 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

Errahum/Helios-Journal

A simple journal app

Language: Python - Size: 7.81 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

guilhermejansen/adset-aiprimavia-api

API de conversão de dados xlsx em jsonl.

Size: 11.7 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

hychan48/jsonl-to-yaml

Visualizing OpenAI Chat jsonl format in YAML. Easier to understand the content

Language: JavaScript - Size: 141 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Sija/jsonl.cr

Crystal shard for handling JSONL (JSON Lines) parsing

Language: Crystal - Size: 5.86 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

mohamedattahri/jsonl

Go library to encode/decode JSON Lines.

Language: Go - Size: 7.81 KB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ondata/appaltipop

ETL scripts and issue tracking for AppaltiPOP project.

Language: Jupyter Notebook - Size: 138 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

enesbol/Spark

It puts common records in 2 different tables and their ids from both tables in a separate table.

Language: Jupyter Notebook - Size: 949 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

zaibacu/json2jsonl

A tool to take large JSON and convert it to JSONL

Language: Rust - Size: 16.6 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

pearxteam/bashim-scraper

A utility written in Kotlin that scraps all the quotes from Russian bash.im website and writes them into a single JSONL file.

Language: Kotlin - Size: 62.5 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

abetomo/dump_to_jsonl

Generate JSONL from a `mysqldump` dump file.

Language: Go - Size: 153 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

allendema/rewe_dl

Call store APIs. Parse responses and save the data to SQL, JSON or any custom format.

Language: Python - Size: 1.18 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

cahlen/conversation-dataset-generator

Craft conversational datasets (JSONL format with rich metadata) using LLMs. Specify parameters manually or use a creative brief for LLM-generated arguments with automatic topic/scenario variation. Optional web search improves persona grounding. Ideal for LoRA tuning, persona training, and creative writing. Includes Hugging Face Hub upload.

Language: Python - Size: 125 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

mrizaln/octave-ndjson

Newline Delimited JSON (ndjson) or JSON Lines (jsonl) parser for Octave

Language: C++ - Size: 78.1 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

IliyaBadri/gologger

A minimal Go package that provides logging and console output for Go applications. It writes logs to a file while also printing them to the console.

Language: Go - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

IliaShkola/CSV_Jsonl_Converter

csv - jsonl converter python app for fine-tuning OpenAI models.

Language: Python - Size: 15.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

zurd46/ZurdSynthDataGen

This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).

Language: JavaScript - Size: 110 KB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rmoralespp/jsonl

A simple Python library for handling jsonlines files

Language: Python - Size: 3.36 MB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Siddhesh-Agarwal/openai-ft-validate

A Go-based CLI tool that verifies the structure of JSONL files for OpenAI fine-tuning.

Language: Go - Size: 10.7 KB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Cstrp/jsonl_generator

A specialized tool designed for creating JSONL files to train OpenAI models. This application streamlines the process of preparing your training data in the correct format.

Language: TypeScript - Size: 81.1 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

sile/jlot

Command-line tool for JSON-RPC 2.0 over JSON Lines over TCP.

Language: Rust - Size: 88.9 KB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Ethorbit/jsonl-augmenter

Augment jsonl files. Can be used to artificially inflate AI datasets.

Language: Python - Size: 14.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

iagolirapasssos/JSONL-File-Generator

This is a simple web application that allows users to create JSONL files for fine-tuning OpenAI's GPT models. The application includes fields for entering "System", "User", and "Assistant" messages, stores data in the browser's local storage to prevent data loss on refresh, and supports multiple languages (English, Portuguese, Spanish, and Hindi).

Language: JavaScript - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

rubeniskov/audible-scraper

Audible Scraper is a command-line tool (CLI) written in Rust that allows you to scrape information about audiobooks from Audible

Language: HTML - Size: 395 KB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

brianSalk/JSONLgenerator3.5

A JSONL generator to create training data for GPT3.5 and newer

Language: Python - Size: 15.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

yoppeh/ld2json

Carefree dataset specification

Language: C - Size: 39.1 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

leodeveloper/fine-tune-with-google-cloud

Generative AI custom data fine tuning with google cloud vertax ai

Language: Jupyter Notebook - Size: 382 KB - Last synced at: 29 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

itsluketwist/jldc

Easily read/write JSONLines files that include dataclasses.

Language: Python - Size: 16.6 KB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

loyal812/img2txt-fine-tuning-api

Image to Text & Fine Tuning & AWS & Docker & OpenAI & CI/CD

Language: Python - Size: 13.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MartinKondor/jsonl

🐍 Utilities for using JSONL (JSON lines) file type with Python

Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

tim-hub/parquet-to-json

a script to convert parquet to json

Language: Python - Size: 7.81 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

rashmishreev/jobgpt-resume-assistant

Fine-tune ChatGPT with few-shot learning for personalized resume bullet points.

Language: Python - Size: 13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

an-dist/ReadableStream.readAsJson

Reading the stream of "JSON / JSON Lines" from ReadableStream so that it can be used in "for await...of" statement.

Language: JavaScript - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kuriofoolio/JSONPlayground

A beginner's guide on how to get started with .json and .jsonl file formats, along with a sample project

Language: Jupyter Notebook - Size: 218 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

NinjaBitroom/jsonlapp

APP para criar arquivos .jsonl

Language: JavaScript - Size: 2.93 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Devadeut/Image_Captioning_and_Detection

Image Captioning is the process of generating textual description of an image. You have to create a python package for transforming images and analysing their effect on the captions of an image captioning model. We are providing you with a pretrained captioning model, all you need to do is to call the model on the image and get the outputs.

Language: Python - Size: 6.62 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ismailsoftdev/jsonl2json

jsonl2json: A Python library for converting JSONL (JSON Lines) files to standard JSON files.

Language: Python - Size: 9.77 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

spekulatius/jsonl-2-json-files

Converts `jsonl` files into single json files.

Language: Shell - Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

brokeyourbike/doccano-to-automl-jsonl

Transform doccano JSONL to the format expected by AutoML

Language: Go - Size: 69.3 KB - Last synced at: 10 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

damian-anslik/jsonl

Python library for working with JSON Lines (JSON) files

Language: Python - Size: 1.95 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

takanoriyanagitani/json2strings

convert json to string array(json -> json string array)

Language: Rust - Size: 18.6 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

ChristopherChristofi/raw-tweets-to-csv-script

Simple script for searching the Twitter API and converting the resulting raw tweet data into a CSV document

Language: Shell - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

franck-mahieu/datasets-toolbox

datasets-toolbox are some scripts usefull to generate, transfom and valid large dataset files, not openable with editor because too large. datasets-toolbox provide also a ping script.

Language: JavaScript - Size: 24.4 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

takanoriyanagitani/psplit-py

Programmable text file splitter

Language: Python - Size: 17.6 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

takanoriyanagitani/jsonl2jsons

convert jsonl(ldjson, ndjson, json-stream, ...) to json(s)

Language: C - Size: 15.6 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

zaypen/jsonl2csv

A simple cli script to convert jsonl to csv

Language: Python - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0