Back to Directory
research
Agent Browser
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Installation
Run this in your terminal or add to your configuration:
# Clone into your skills directory
git clone ...About this Skill
Initial release of agent-browser: Powerful command-line browser automation.
- Automates website navigation, form filling, testing, screenshots, and data extraction.
- Extensive command set for browser control, interactive element referencing, state checks, and data capture.
- Supports semantic locators (find by role, label, text, etc.) for robust automation.
- Provides tools for video recordings, PDF creation, network interception, tab/window management, storage/cookie handling, and more.
- Includes options for session isolation, headless/headed mode, and structured JSON output.