
UI parsing example showing element detection and classification with interactive elements
UI VQA & Grounding

Web Interface

Examples of UI parsing for different interface types.
Usage Example
For UI parsing, we highly recommend using the Structured Outputs API to get the UI elements and hierarchy in a structured and validated data format.
FAQ
What is UI Parsing?
What is UI Parsing?
UI Parsing is the process of analyzing UI elements in screenshots and application images to identify UI elements, buttons, and interactive components for automated testing.
What is UI VQA & Grounding?
What is UI VQA & Grounding?
UI VQA & Grounding is the process of asking specific questions about the UI elements in screenshots and application images to identify UI elements, buttons, and interactive components for automated testing. This is different from UI parsing, where all UI elements are returned. In most cases, you should use UI VQA & Grounding to get more accurate results.