Skip to content

Great work! May I ask if you have considered adding the bbox of the element executed by the action on the image to the data set, which is quite useful for the CV based methods. #15

Answered by xhluca
XuRui314 asked this question in Q&A
Discussion options

You must be logged in to vote

Yes the bboxes/bbox-*.json are part of the dataset (see weblinx-full on huggingface) which map an element id to coordinates. the target element id can be found in the metadata.json. If you want a tutorial you can check out the modeling/llama/eval.py or the new colab notebook: https://colab.research.google.com/github/McGill-NLP/weblinx/blob/main/examples/WebLINX_Colab_Notebook.ipynb

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@xhluca
Comment options

@XuRui314
Comment options

Answer selected by XuRui314
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #14 on March 26, 2024 21:34.