Skip to content

Conversation

@abhiram050904
Copy link

Summary:
This PR strengthens the resume scoring pipeline, adds reliability under rate limits, improves project attribution (Live Demos), and cleans up developer ergonomics and docs.
Features:

  • CLI: added --force, --no-github, --max-workers to control caching, enrichment, and concurrency.
  • Projects: capture both repo_url and live_url; evaluator now sees “Repo URL” and “Live Demo”.
  • Languages: new extraction path and model wiring for spoken languages.
  • Reliability:
  • Rate-limit resilience: parsed backoff on 429s, configurable concurrency, targeted retries, and sequential fallback when parallel fails.
  • Cache validation: resume cache now stores file hash, model, and template hashes; auto re-extracts if stale/empty; --force bypasses caches.
  • Heuristic fallbacks: fills skills, projects, and languages when LLM extraction is sparse.
  • GitHub data: robust cache handling (invalid/empty cache refresh), token guidance, and rate-limit-aware behavior.

Fixes:

  • Live demo deductions addressed by explicitly surfacing Live Demo links.
  • Pydantic safety: avoid instance model_fields access when checking emptiness.
  • Models: added LanguagesSection and extended Project with repo_url/live_url; consistency improvements.
  • Docs/Config:
  • README: documented flags and usage; clarified caching behavior.
  • .env.example: sanitized placeholders only; added GITHUB_TOKEN.
  • .gitignore: ignore .pdf, cache/, .venv_win/, python-.exe, etc.

…ture repo_url+live_url and surface Live Demo; fix(pdf): rate-limit backoff + configurable concurrency; feat(cache): metadata validation; docs: README + .env.example; chore: ignore PDFs, .venv_win, python installer
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

1 participant