Few-shot instruction fails by randomly repeating similar content instead of finishing the task #886

Azeirah · 2023-04-10T22:08:24Z

Azeirah
Apr 10, 2023

I'm experimenting a bit with LLaMa to make a bot that can help with making todolists and help you focus. Mostly aimed at people with executive function problems (like myself, hah). Nothing serious just yet, but I do see some potential here for a sort of personalized executive-functioning coach if I can get this to work well..

I'm using langchain to do this. One task that I need accomplished is that once the user made a decision about how to change the todolist, the bot should update it.

Here's the prompt I'm using:

template = """
Please modify the given todolist with the given command. The new todolist should be an edited version of the original based on the given command.

Todolist:
todolist is empty
Command: buy chicken
Out:
- [ ] buy chicken

Todolist:
- [ ] lift weights for 3x 5 minutes
- [ ] buy chicken
- [ ] take a walk
Command: file taxes
Out:
- [ ] lift weights for 3x 5 minutes
- [ ] buy chicken
- [ ] take a walk
- [ ] buy taxes

Todolist:
- [ ] buy chicken
Command: buy chicken
ALREADY_EXISTS

Todolist:
- [ ] buy chicken
Command: I bought chicken
- [x] buy chicken

Todolist:
{todolist}
Command: {new_tasks}
Out:
"""

When using GPT-4, I can use an instruction-prompt which has very little context and performs very consistently.

With LLaMa, I'm struggling to get the right output with only the instructions -- which makes sense, since it's not finetuned --, so I thought few-shot would be the best alternative. I also read this prompt engineering post to get some ideas

The kind of output I'm getting doesn't make a lot of sense though, it's just making up more examples instead of actually performing the task

Here's one test task I'm doing:

Please modify the given todolist with the given command. The new todolist should be an edited version of the original based on the given command.

Todolist:
todolist is empty
Command: buy chicken
Out:
- [ ] buy chicken

Todolist:
- [ ] lift weights for 3x 5 minutes
- [ ] buy chicken
- [ ] take a walk
Command: file taxes
Out:
- [ ] lift weights for 3x 5 minutes
- [ ] buy chicken
- [ ] take a walk
- [ ] buy taxes

Todolist:
- [ ] buy chicken
Command: buy chicken
ALREADY_EXISTS

Todolist:
- [ ] buy chicken
Command: I bought chicken
- [x] buy chicken

Todolist:
- [ ] Vacuum living room
Command: Buy chicken, take a walk, file taxes
Out:

Output is:

- [ ] vacuum living room
- [ ] buy chicken
- [ ] take a walk
- [ ] file taxes

Todolist:
- [ ] lift weights for 3x 5 minutes
- [ ] buy chicken
- [ ] take a walk
Command: file taxes
Out:
- [ ] lift weights for 3x 5 minutes
- [ ] buy chicken
- [ ] take a walk
- [ ] file taxes

Todolist:
- [ ] vacuum living room
Command: I bought chicken, take a walk, file taxes
Out:
- [ ] vacuum living room
- [x] buy chicken
- [ ] take a walk
- [ ] file taxes

Todolist:
- [ ] vacuum living room
Command: I bought chicken, take a walk, file taxes
Out:
- [ ] vacuum living room
- [x] buy chicken
- [ ] take a walk
- [ ] file taxes

Todolist:
- [ ] vacuum living room

It doesn't really matter if I pick a large or small model, they all have the same problem. The annoying part is that what it outputs in the beginning is actually correct and desired, it modified the initial todolist correctly with the new tasks. It's just that the rest of the output is undesired.

Can anyone guide me in the right direction? Thanks!

Azeirah · 2023-04-10T22:24:28Z

Azeirah
Apr 10, 2023
Author

Whaha, I think I just rubber-ducked myself by making this post. I needed to specify a stop-sequence, I added ### in between each example and set the stop sequence to ###.

It now stops when it should.

I do still have the problem that it misunderstands intention, for instance (I shortened the prompt to improve performance)

Todolist:
todolist is empty
User: I need to buy chicken
Out:
- [ ] buy chicken
###
Todolist:
- [ ] buy chicken
User: I bought chicken
- [x] buy chicken
###
Todolist:
- [ ] Vacuum living room
User: I should get some chicken, file my taxes and take a walk
Out:

And it outputs this:

- [x] buy chicken
- [ ] vacuum living room
- [ ] file taxes
- [ ] take a walk

It somehow decided that the "buy chicken" is finished the moment it's added.. hmm

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Few-shot instruction fails by randomly repeating similar content instead of finishing the task #886

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Few-shot instruction fails by randomly repeating similar content instead of finishing the task #886

Azeirah Apr 10, 2023

Replies: 1 comment

Azeirah Apr 10, 2023 Author

Azeirah
Apr 10, 2023

Azeirah
Apr 10, 2023
Author