~/swaylen
posts / projects / about /
All tags

Posts tagged with "uitag"

    Hybrid UI Detection: Why We Split Vision and Intelligence
    uitag combines Apple Vision's text detection with a fine-tuned YOLO model to hit 90.8% coverage on ScreenSpot-Pro — faster and cheaper than VLM-only approaches.
    Benchmarking UI Detection on ScreenSpot-Pro
    How we evaluated uitag against 1,581 annotations across 26 professional macOS applications — methodology, results, and what the numbers actually mean.
    Why Detection and Intelligence Should Be Separate Layers
    The architectural argument for splitting UI perception from UI reasoning — and what happens when you don't.
    Multi-Signal Verification for VLM UI Agents
    One detection method is a guess. Two that agree are evidence. How Leith uses signal redundancy to make UI interaction reliable.
© 2026 • ~/swaylen 🔬
Press Esc or click anywhere to close