ADArsenios DiamantakosApplied AI Engineer
Back to projects

Data pipeline and reporting

Job Market Intelligence Pipeline

The pipeline collects Greenhouse and Lever jobs, normalizes and validates records, applies narrow enrichment, stores history in DuckDB, and exports market summaries, quality reports, and run-over-run deltas.

Highlights

  • Separates source collection, validation, enrichment, history, and reporting.
  • Uses fixture-backed runs so the demo can be reviewed without live scraping.
  • Tracks deltas across runs instead of only producing one-off snapshots.

Validation

24 pytest tests

GitHub Actions CI

Package build

Synthetic fixture demo

Job Market Intelligence Pipeline report desktop screenshot
Job Market Intelligence Pipeline report desktop screenshot
Job Market Intelligence Pipeline responsive report screenshot
Job Market Intelligence Pipeline responsive report screenshot