Data pipeline and reporting
Job Market Intelligence Pipeline
The pipeline collects Greenhouse and Lever jobs, normalizes and validates records, applies narrow enrichment, stores history in DuckDB, and exports market summaries, quality reports, and run-over-run deltas.
Highlights
- Separates source collection, validation, enrichment, history, and reporting.
- Uses fixture-backed runs so the demo can be reviewed without live scraping.
- Tracks deltas across runs instead of only producing one-off snapshots.
Validation
24 pytest tests
GitHub Actions CI
Package build
Synthetic fixture demo

