Topic: "rl-for-llm"
DolbyUUU/Sudoku4LLM
Sudoku4LLM is a Sudoku dataset generator for training and evaluating reasoning in Large Language Models (LLMs). It offers customizable puzzles, difficulty levels, and 11 serialization formats to support structured data reasoning and Chain of Thought (CoT) experiments.
Language: Python - Size: 29.3 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 0
