Back to Directory
research

Agent Browser

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.

Installation

Run this in your terminal or add to your configuration:

# Clone into your skills directory git clone ...

About this Skill

Initial release of agent-browser: Powerful command-line browser automation. - Automates website navigation, form filling, testing, screenshots, and data extraction. - Extensive command set for browser control, interactive element referencing, state checks, and data capture. - Supports semantic locators (find by role, label, text, etc.) for robust automation. - Provides tools for video recordings, PDF creation, network interception, tab/window management, storage/cookie handling, and more. - Includes options for session isolation, headless/headed mode, and structured JSON output.