Skip to content

Clarification on data sources #6

@pspdada

Description

@pspdada

Hello, and thank you for this great work! I have a question about how the training dataset was constructed:

“To enable our model can decide when high resolution is necessary, we collect corresponding VQA samples, including both cases requiring high-resolution images and cases adequately answered using downsampled images.”

However, I wasn’t able to locate in the paper where these samples originate from. Could you please clarify:

  1. Data source

    • Which VQA dataset(s) or external benchmarks were used to gather these examples?
  2. Selection strategy

    • What criteria or heuristics did you apply to filter cases from the original data?
    • Was this labeling done manually, via pre-defined rules, or by some automated process?

If I’ve simply overlooked this detail in the manuscript, my apologies—could you point me to the relevant section or appendix? Thank you for your time and clarification!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions