diff --git a/workers/tests/fixtures/README.md b/workers/tests/fixtures/README.md new file mode 100644 index 0000000..aab8a4f --- /dev/null +++ b/workers/tests/fixtures/README.md @@ -0,0 +1,9 @@ +# Test fixtures + +Used by `tests/test_pdf.py` and `tests/test_image.py`. Three invariants: + +1. **`born_digital.pdf`** — must contain the literal string `void-workers` when extracted via `pdftotext`. Generated from `/tmp/text.ps` then `ps2pdf`. +2. **`scanned.pdf`** — `pdftotext` must return **near-empty** output (the OCR fallback test depends on this). Generated by converting `eng_text.png` to a single-page image-only PDF: `convert -density 200 eng_text.png scanned.pdf`. +3. **`eng_text.png`** — must contain the literal string `blackflame palette`, rendered clearly enough for Tesseract to read it. Generated with `convert -size 800x200 xc:white -font DejaVu-Sans -pointsize 36 -fill black -annotate +50+100 "blackflame palette" eng_text.png`. + +Regenerate via the snippets in `../../docs/superpowers/plans/2026-06-01-void-v2-plan4-workers.md` Task B1. diff --git a/workers/tests/fixtures/born_digital.pdf b/workers/tests/fixtures/born_digital.pdf new file mode 100644 index 0000000..f5c97ad Binary files /dev/null and b/workers/tests/fixtures/born_digital.pdf differ diff --git a/workers/tests/fixtures/eng_text.png b/workers/tests/fixtures/eng_text.png new file mode 100644 index 0000000..2a5dec9 Binary files /dev/null and b/workers/tests/fixtures/eng_text.png differ diff --git a/workers/tests/fixtures/scanned.pdf b/workers/tests/fixtures/scanned.pdf new file mode 100644 index 0000000..1981493 Binary files /dev/null and b/workers/tests/fixtures/scanned.pdf differ