QA Agent
QA Agent
Section titled “QA Agent”The QA Agent verifies that changes made by other agents are correct and working. It opens pages in a headless browser, takes screenshots, checks content, and reports pass/fail with evidence.
Capabilities
Section titled “Capabilities”- Visual verification via headless browser screenshots
- Content checking to confirm expected text appears on page
- Deployment monitoring waits for deploys to complete before checking
- Broken link detection across a site
- Regression testing comparing before/after screenshots
Verification Process
Section titled “Verification Process”- Receives a verification request (usually from Content Agent)
- Waits for deployment to complete
- Opens the target URL in a headless browser
- Takes a screenshot
- Searches the page for expected content
- Reports PASS or FAIL with screenshot attached
Output Format
Section titled “Output Format”PASS:
QA Verification: PASSURL: https://example.com/pricingExpected: "$500 per trip for up to 6 guests"Found: Confirmed on pageScreenshot: [attached]Verified at: 2026-03-02 14:23:00 ESTFAIL:
QA Verification: FAILURL: https://example.com/pricingExpected: "$500 per trip for up to 6 guests"Found: "$500 per person"Screenshot: [attached]Issue: Content not updated. Possible deployment delay or wrong file edited.Model Selection
Section titled “Model Selection”QA tasks are typically straightforward (check if X appears on page Y). This makes the QA Agent a good candidate for cheaper, faster models:
- Gemini Flash — fast, cheap, good enough for simple checks
- Claude Haiku — slightly better reasoning, still cost-effective
- Claude Sonnet — use for complex visual comparisons
Configuration
Section titled “Configuration”agent: id: qa model: google/gemini-2.0-flash # cheaper model for simple checks tools: - browser # headless screenshots - web_fetch