Execution model
Browser tasks run inside your runtime context and can be invoked through:- Runtime browser tools
- Cloud browser-related routes/proxies
- Delegated subagents focused on UI workflows
Core capabilities
- Navigate and wait for dynamic content
- Click, type, select, and submit forms
- Read page content and extract structured data
- Capture screenshots and visual state
- Continue multi-step sessions with local cookies/session context
Typical workflows
- Lead generation research
- Form completion automation
- Competitive tracking and periodic checks
- Support/debug reproduction with screenshot evidence
Browser + skills pattern
A high-leverage pattern is:- Run a browser flow manually once
- Capture it as a reusable skill
- Let the agent call that skill for future runs
- Refine skill instructions when pages change
Safety boundaries
- Browser actions run in user sandbox context
- Tool policies can constrain execution behavior
- Credentials stay scoped to the user runtime
- Access is mediated by authenticated cloud session routes
Related pages
Skills
Turn browser routines into reusable automation skills.
Learning loop
Improve browser behavior over time through memory and iteration.

