SequenceMatcher ratio on title + company — Python standard library, zero dependencies, fully deterministic.# on normalized title + company title_sim = ratio(a.title, b.title) company_sim = ratio(a.company, b.company) is_dup = title_sim >= 0.82 and company_sim >= 0.80
Confidence = 0.6 · title + 0.4 · company. Only cross-source pairs merge into one row (recorded in also_on); a board updating its own posting is handled by the job-id upsert.
Skills = the share of the job’s requirements the candidate has → more experience only ever helps.
Dedup, skills and the base score need no LLM — fast, free, reproducible.
Reserved for parsing CVs, refining the top matches and explaining them.
Scores and CVs come from real, reviewed experience — no fabrication.
Standard-library core, fallbacks everywhere, self-healing model choice.