🌐 Browser Automation Tools

Browser tools are available exclusively to the qa-kitten agent for automated browser testing and web scraping.

🐱 QA Kitten Required

Switch to the QA Kitten agent to use browser tools: /agent qa-kitten

Browser Control

browser_initialize(headless=True)
Start a browser session. Set headless=False to see the browser.
browser_close()
Close the browser and clean up resources.
browser_status()
Check if browser is running and get current URL.
browser_navigate(url)
Navigate to a URL. Waits for page load.
browser_go_back()
Go back in browser history.
browser_go_forward()
Go forward in browser history.
browser_reload()
Reload the current page.

Interactions

browser_click(selector)
Click an element. Supports CSS selectors and semantic locators.
browser_set_text(selector, text)
Type text into an input field. Clears existing text first.
browser_get_text(selector)
Get text content from an element.
browser_select(selector, value)
Select an option from a dropdown.
browser_check(selector, checked=True)
Check or uncheck a checkbox.

Semantic Element Location

Find elements using semantic attributes (better for testing):

browser_find_by_role(role, name=None)
Find by ARIA role: button, link, textbox, heading, etc.
browser_find_by_text(text, exact=True)
Find by visible text content.
browser_find_by_label(label)
Find form elements by their label text.
browser_find_by_placeholder(placeholder)
Find inputs by placeholder text.
browser_find_by_test_id(test_id)
Find by data-testid attribute.

Visual Analysis

browser_screenshot_analyze(question=None)
Take a screenshot and optionally analyze it with vision AI.

JavaScript Execution

browser_execute_script(script)
Execute JavaScript in the page context.

Workflow Management

browser_save_workflow(name)
Save current browser actions as a reusable workflow.
browser_load_workflow(name)
Load and execute a saved workflow.

Example Session

/agent qa-kitten

> Initialize a browser and go to example.com

🐱 Starting browser...
[browser_initialize(headless=False)]
[browser_navigate("https://example.com")]

> Click the "More information" link

[browser_find_by_text("More information")]
[browser_click(...)]

> Take a screenshot and describe what you see

[browser_screenshot_analyze("Describe the current page")]

> Close the browser

[browser_close()]

Complete Tool List

CategoryTools
Controlinitialize, close, status
Navigationnavigate, go_back, go_forward, reload
Interactionsclick, set_text, get_text, select, check
Locatorsfind_by_role, find_by_text, find_by_label, find_by_placeholder, find_by_test_id
Visualscreenshot_analyze
Scriptsexecute_script
Workflowssave_workflow, load_workflow