Clearinghouse

Prerequisites

Docker
Bash
npm
bun

Setup

Run bun install

Usage

# Start Postgres DB, create Puzzles table to store puzzle data, and run ingest pipeline to load puzzle data
# GENERATE_TIME_LIMIT is the number of seconds the puzzle generation jobs can run before they wind down, defaults to 60
# GENERATE_THREADS is the number of threads used during puzzle generation (in addition to main thread), defaults to 1
# PUZZLE_FILE is the file with one sudoku puzzle string per line to ingest, defaults to puzzles1.txt
GENERATE_TIME_LIMIT=60 GENERATE_THREADS=1 PUZZLE_FILE=puzzles1.txt bash start.sh

# Queries the DB for the number of puzzles requested with the desired difficulty values and then runs each
# line through the export.ts script to process them into the desired compact Puzzle format used by Sudokuru Frontend local database
# Results will be written to exports.ts (type declaration can be found in Puzzle.type.ts)
# PUZZLE_COUNT is the number of puzzles that will be exported, script will fail if not enough puzzles in DB
# MIN_DIFFICULTY is the minimum difficulty (inclusive) of puzzles that will be considered for export
# MAX_DIFFICULTY is the maximum difficulty (inclusive) of puzzles that will be considered for export
PUZZLE_COUNT=2 MIN_DIFFICULTY=20 MAX_DIFFICULTY=100 bash export.sh

# Exec into the DB to run SQL commands (run exit when done)
docker exec -it sudoku-postgres psql -U postgres

# Stop Postgres DB to store puzzle data
bash stop.sh

# Delete Postgres DB to store puzzle data
bash delete.sh

# Generate difficutly report for puzzles in DB
cat ./difficulty.sql | docker exec -i sudoku-postgres psql -U postgres -d postgres > DifficultyReport.txt

Ingest Pipeline (run by start.sh)

Insert puzzle data from puzzles.sql into the Postgres DB
For each puzzles in puzzles.txt (or until reach GENERATE_TIME_LIMIT):
- Checks if the puzzle is already in the database
- If the puzzle is not already in the database then does the following:
  - Checks if each of the GENERATE_THREADS has been assigned already to generate data for a puzzle
  - If threads are all assigned then waits until the youngest thread finishes
  - Once a thread is available it assigns it to generate puzzle data and put the insert statement in inserts.sql
Once all the insert statements for the new puzzles have been created they are appended to puzzles.sql
All the new insert statements in inserts.sql are executed and inserts.sql is deleted

Export Pipeline (run by export.sh)

export.sh builds sql query using environment variables
export.sh runs constructed sql query and redirects output to temp1.csv
export.sh cleans up the output and puts the new output in temp2.csv
export.sh runs export.ts using bun and redirects the output to exports.ts
export.ts parses temp2.csv and outputs the data in the desired format

Puzzles

Puzzles are generated using the Sudokuru sudoku.js fork.

Provided Puzzle Files

puzzles1.txt: Contains 40k mostly easy puzzles Source: generated 2k each of puzzles with 42-61 inclusive givens by running generate.ts in this sudoku generator fork: https://github.com/Sudokuru/sudoku.js
puzzles2.txt: Contains 80k puzzles including a lot of moderate and hard ones Source: subset of this: https://www.kaggle.com/datasets/radcliffe/3-million-sudoku-puzzles-with-ratings?resource=download
puzzles3.txt: Contains 20k puzzles mostly somewhere between easy and moderate difficulty Source: subset of this: https://www.kaggle.com/datasets/rohanrao/sudoku

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clearinghouse

Prerequisites

Setup

Usage

Ingest Pipeline (run by start.sh)

Export Pipeline (run by export.sh)

Puzzles

Provided Puzzle Files

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github		.github
.gitignore		.gitignore
DifficultyRanges.md		DifficultyRanges.md
DifficultyReport.txt		DifficultyReport.txt
GenerateInsert.ts		GenerateInsert.ts
Puzzle.type.ts		Puzzle.type.ts
README.md		README.md
bun.lockb		bun.lockb
create-puzzles-table.sql		create-puzzles-table.sql
delete.sh		delete.sh
difficulty.sql		difficulty.sql
export.sh		export.sh
export.ts		export.ts
package.json		package.json
puzzles.sql		puzzles.sql
puzzles1.txt		puzzles1.txt
puzzles2.txt		puzzles2.txt
puzzles3.txt		puzzles3.txt
start.sh		start.sh
stop.sh		stop.sh
utils.sh		utils.sh

Sudokuru/Clearinghouse

Folders and files

Latest commit

History

Repository files navigation

Clearinghouse

Prerequisites

Setup

Usage

Ingest Pipeline (run by start.sh)

Export Pipeline (run by export.sh)

Puzzles

Provided Puzzle Files

About

Resources

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages