FENMaster – AI Chess Staging

FENMaster is the corpus compiler for the AI Chess lab. It is responsible for converting, filtering, validating, sharding, and replaying large chess-position archives without losing the metadata that later analysis depends on.

That makes it more than a converter. It is the part of the stack that turns raw position files and frontier exports into something that can be trusted, queried, and stored economically.

Alpha status: Work in progress. FENMaster is still an alpha corpus toolchain. Formats, switches, and validation surfaces may evolve as the pipeline hardens. Run it at your own risk.

Current Release

The current downloadable build is a Windows x64 release binary:

Executable	Build	Purpose	Downloads
`FENMaster.exe`	`0.1.0 build 001`	Convert, filter, validate, shard, and benchmark large position corpora.	exe / sha256

Headline Results

Area	Current result
FDF d8 external validation	PASS on June 19, 2026. The raw external depth-8 FEN occurrence corpus and the ANS FDF frontier both reduced to `84,998,978,956` logical records and `988,187,354` unique canonical positions, with `0` missing keys, `0` multiplicity mismatches, `0` parse errors, and `0` overflow errors. Full compare time: `22,441.154s`.
FDF d9 frontier preflight	Both depth-9 FDF exports decoded successfully. `zstd-frame` and `nvcomp-ans` each summed to `2,439,530,234,167` multiplicity. The physical row counts were `41,462,480,920` for `zstd-frame` and `22,897,357,589` for `nvcomp-ans`. External depth-9 comparison is still blocked by corrupt visible raw d9 corpus copies.
ANS FDF decode smoke	After the nvCOMP teardown fix, the depth-8 ANS preflight passed with multiplicity total `84,998,978,956`, elapsed time `368.346s`, and `0` stderr bytes.
Canonical CCCF replay	Canonical CCCF sink replay verified `84,998,978,956` logical records collapsing to `988,187,354` unique at about `24.60` bits per stored position.
Frontier-aware filtering	Frontier multiplicity filtering is validated, including EPD output that preserves requested multiplicity when the target format can store it.

What It Does

The current codebase covers four jobs that matter to the rest of the lab:

convert between position formats and archive layouts
filter large corpora before they become downstream noise
validate shard routing and round-trips so the output can be trusted
materialize canonical corpora for replay, export, and later statistics work

Supported Formats

Direction	Formats
Source	`fen`, `epd`, `pgn`, `sfen`, `cccf`, `frontier`
Output	`fen`, `epd`, `sfen`, `cccf`

The frontier source support matters because it lets GPU-exported perft corpora flow into the same toolchain as ordinary position archives. Current frontier support includes the older packed-v1 format and GPUPerft FDF v1 catalogs. Portable builds read zstd-frame FDF data. CUDA/nvCOMP builds also read nvcomp-ans FDF data through the same reader contract.

FDF decoding currently supports byteplane and xor-byteplane transforms, with nbm40 and nb-state-v2 logical records. nb32 is rejected for normal frontier input because it does not carry multiplicity.

Filtering

Filtering is a real feature here, not an afterthought.

The branch history and current code together show three useful classes of filtering:

structural filters such as side to move, castling rights, piece placement, empty squares, and piece counts
numeric filters over fields such as hm, fm, depth, and mult
reproducible subset generation for benchmark corpora through sample-fens

Examples from the documented filter language include:

stm=b
K@e1,(ep!=none|castle=none)
count([Pp])>=10
depth=8

The more important detail is what happens after the filter matches. Current filter-output work preserves requested values when the destination format can store them. For example, if a filter depends on depth or mult, EPD output can retain those fields so the saved records still carry the information the filter used.

That behavior is tested explicitly in the repo for frontier multiplicity filtering and EPD output.

Validation And Determinism

The repo treats validation as part of the product, not as cleanup work:

verify-shards decodes output and checks shard routing
invalid input lines are rejected cleanly instead of poisoning the whole run
reject logs are supported
deterministic sharding is based on the first byte of a canonical occupancy map
staged and parallel runs emit manifests and runtime telemetry

That is why the throughput numbers matter. The fast runs are paired with validation instead of being presented in isolation.

FDF Frontier Validation

The most recent FENMaster work focuses on making GPUPerft FDF frontier exports auditable at full corpus scale.

On June 19, 2026, FENMaster compared a freshly regenerated depth-8 ANS FDF frontier export against the external raw depth-8 FEN occurrence corpus. The raw corpus is not deduplicated, while the FDF frontier is multiplicity-compressed. FENMaster reduced both sources to the same comparison key:

FIDE position identity -> multiplicity count

The result was a full PASS:

Metric	Value
External occurrence total	`84,998,978,956`
Frontier multiplicity total	`84,998,978,956`
External unique canonical positions	`988,187,354`
Frontier unique canonical positions	`988,187,354`
Missing external keys	`0`
Missing frontier keys	`0`
Multiplicity mismatches	`0`
Parse errors	`0`
Overflow errors	`0`
Elapsed seconds	`22,441.154`

The same FDF reader also passed depth-9 frontier preflights for both zstd-frame and nvcomp-ans, each summing to 2,439,530,234,167 total multiplicity. The remaining depth-9 external validation step is waiting on a clean raw depth-9 FEN occurrence corpus, because the visible copies tested in June 2026 failed direct zstd integrity checks.

CCCF And Canonical Storage

CCCF is one of the most important reasons FENMaster exists.

The canonical-storage problem is not just “compress the file.” It is:

collapse duplicates without losing multiplicity
preserve exact logical totals
make the result deterministic enough to replay and verify
store very large position corpora, including full perft-derived corpora, at densities that are worth keeping

The repo history is explicit about this. Canonical CCCF work added multiplicity-aware verification, canonical deduplication, universal sink stages, and replay tooling for large sort-input datasets.

Two measured checkpoints matter most:

On the validated 512k matrix, canonical fen -> cccf ran at 18.037M FEN/s and stored the result at 5.783 bits per input FEN.
On the larger canonical sink replay workload, the pipeline verified 84,998,978,956 logical records collapsing to 988,187,354 unique at about 24.60 bits per stored position.

That is the storage story that turns GPU frontier export and large corpus work into something sustainable.

Routine Dataset And Sampling

The sample-fens command exists for a reason. Large benchmark claims are only useful if the input set can be recreated.

The current routine dataset flow generates reproducible .fen.zst subsets from the depth-8 corpus. That matters because it gives the repo a standard workload for performance checks instead of relying on ad hoc samples that cannot be compared later.

Command-Line Shape

Convert

N:\Chess\repos\FENmaster\build\bin\Release\FENMaster.exe convert `
  G:\FENmaster-work\inputs\depth8_dev_16k `
  G:\FENmaster-work\outputs\depth8_dev_16k_staged `
  --staged `
  --max-threads 128 `
  --shards 256 `
  --compression-level 8 `
  --records-per-flush 4096 `
  --verify-output

Verify

N:\Chess\repos\FENmaster\build\bin\Release\FENMaster.exe verify-shards `
  G:\FENmaster-work\outputs\depth8_dev_16k_staged `
  --shards 256

Sample A Reproducible Benchmark Corpus

N:\Chess\repos\FENmaster\build\bin\Release\FENMaster.exe sample-fens `
  R:\FEN\depth_8_parts `
  G:\FENmaster-work\inputs\depth8_routine_8m `
  --max-lines-per-file 8388608 `
  --compression-level 8 `
  --line-batch-size 8192 `
  --manifest G:\FENmaster-work\manifests\depth8_routine_8m.txt

Compare An FDF Frontier Against Raw FEN Occurrences

FENMaster.exe compare-frontier-external `
  --external-fen-dir S:\depth_8_parts `
  --frontier G:\gp_fdf_validation\d8_external_validation_fresh_20260618\ans\depth_8 `
  --depth 8 `
  --expected-total 84998978956 `
  --scratch D:\fm_fdf_val\d8_scratch_ans_20260618_210523 `
  --out D:\fm_fdf_val\d8_external_ans_20260618_210523 `
  --buckets 8192 `
  --flush-records 4194304 `
  --line-batch-size 65536 `
  --external-workers 32

Why It Matters

Without FENMaster, the rest of the lab would keep generating more data than it can reliably organize.

With it, the workflow becomes much cleaner:

exact GPU work can export multiplicity-preserving corpora
those corpora can be filtered, canonicalized, and replayed
benchmark datasets can be reproduced
later statistics and distributed analysis can start from verified artifacts instead of one-off files

That is why FENMaster belongs in the software section alongside the executables that generate the data in the first place.