Mark web pages for use with vision-language models
som prompt gemini operator cua claude playwright prompt-engineering llms vision-language-model gpt4v qwen-vl gpt4o set-of-mark computer-use computer-using-agent
-
Updated
Jan 7, 2025 - TypeScript