Skip to content

Commit 042df54

Browse files
committed
bugfix
1 parent c3292c1 commit 042df54

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

_modules/week-07.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ The announcement can be made red for due dates as follows
3535
-->
3636

3737
Oct 7
38-
: [Reinforcement Learning with Human Feedback: Proximal Policy Optimization (PPO) and Direct Preference Optimization (DPO)](({{site.baseurl}}assets/files/rlhf.pptx)
38+
: [Reinforcement Learning with Human Feedback: Proximal Policy Optimization (PPO) and Direct Preference Optimization (DPO)]({{site.baseurl}}assets/files/rlhf.pptx)
3939
: [Ziegler RLHF Paper]({{site.baseurl}}assets/files/ziegler.pdf), [DPO Paper]({{site.baseurl}}assets/files/dpo.pdf)
4040
: Emily Weiss - [Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration](https://arxiv.org/abs/2402.00367)
4141
: Questions by: Yifan Jiang

0 commit comments

Comments
 (0)