Add assertion after filtering val_path from parquet_paths for the "train"
split so an empty list fails fast instead of spinning in a silent infinite
loop. Also remove stray article "a" in README ("a three files" → "three
files").
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>