Skip to content

Latest commit

 

History

History
18 lines (16 loc) · 976 Bytes

README.md

File metadata and controls

18 lines (16 loc) · 976 Bytes

👩‍🍳 Recipe Relevancy Classifier

Binary classification models to determine recipe relevancy trained using manually labelled Reddit comments from r/Cooking.
Please see the report for annotation guidelines, model methodology, metrics, and other details.

🗣️ Team: Eric, Sydney, Jake, Kristen, Yaxin

Included in this repo:

  • A writeup for the assignment (Report.pdf)
  • A proposal for a multi-class recipe problem (Proposal.pdf)
  • Some Prodigy recipes (outputs not viewable in GitHub preview) (Code.ipynb)
  • Models for experiments 1 and 2
  • All data used:
    • Unlabeled training set (homework2_train.jsonl)
    • Unlabeled evaluation set (homework2_eval.jsonl)
    • Labeled evaluation set, uncombined (hmwk2-eval-1000.jsonl)
    • Labeled evaluation set, combined (hmwk2-eval-final.jsonl)
    • Labeled training set (hmwk2-train-final.jsonl)
  • requirements.txt