-
Notifications
You must be signed in to change notification settings - Fork 16
Open
Description
I'm having two problems trying your code.
- I uploaded something about two minutes of video I took and asked the same question (how many chairs, tables, etc. in the room) and got some inaccurate answer, so I'm guessing what's causing this?
- I tried to modify the prompt words inside the inference.py and realized that no matter what i ask, like what objects are in the room and expect him to give a detailed reasoning process. However the answer I get is still only 8 tokens and only numeric answers, what is the reason for this?
Thanks!
Metadata
Metadata
Assignees
Labels
No labels