This topic describes the capabilities of MCP Tools and the environments they support. For more information, see the image (sandbox environment).
AgentBay MCP Tools Overview
MCP Tool Support Matrix
Capability | Tool Name | Description | BrowserUse | ComputerUse Linux | ComputerUse Windows | MobileUse | CodeSpace |
Create sandbox environment |
| Create a new AgentBay sandbox environment and return the sandbox ID. | Supported | Supported | Supported | Supported | Not supported |
Get sandbox access URL |
| Get the runtime access URL for the sandbox. Each URL expires after use. | Supported | Supported | Supported | Supported | Not supported |
Take full-screen screenshot and share |
| Capture a full-screen screenshot of the current display and return a shareable URL. | Supported | Supported | Supported | Supported | Not supported |
Release sandbox resources |
| Release sandbox resources after task completion. | Supported | Supported | Supported | Supported | Not supported |
Hover over browser element |
| Hover over a page element. | Supported | Not supported | Not supported | Not supported | Not supported |
Drag browser element |
| Perform a drag-and-drop operation between two elements. | Supported | Not supported | Not supported | Not supported | Not supported |
Navigate back in browser |
| Navigate to the previous page. | Supported | Not supported | Not supported | Not supported | Not supported |
Get network requests |
| Return all network requests after page load. | Supported | Not supported | Not supported | Not supported | Not supported |
Upload file in browser |
| Upload one or more files. | Supported | Not supported | Not supported | Not supported | Not supported |
Press key in browser |
| Press a key on the keyboard. | Supported | Not supported | Not supported | Not supported | Not supported |
Batch form filling |
| Fill multiple form fields at once. | Supported | Not supported | Not supported | Not supported | Not supported |
Type text in browser |
| Type text into an editable element. | Supported | Not supported | Not supported | Not supported | Not supported |
Resize browser window |
| Resize the browser window. | Supported | Not supported | Not supported | Not supported | Not supported |
Capture page snapshot |
| Capture an accessibility snapshot of the current page. This is better than screenshots for performing actions. | Supported | Not supported | Not supported | Not supported | Not supported |
Close current page |
| Close the current page. | Supported | Not supported | Not supported | Not supported | Not supported |
Manage browser tabs |
| List, create, close, or switch browser tabs. | Supported | Not supported | Not supported | Not supported | Not supported |
Get console messages |
| Return all console messages. | Supported | Not supported | Not supported | Not supported | Not supported |
Install browser |
| Install the browser specified in the configuration. | Supported | Not supported | Not supported | Not supported | Not supported |
Evaluate JavaScript expression |
| Evaluate a JavaScript expression on the page or an element. | Supported | Not supported | Not supported | Not supported | Not supported |
Navigate to URL in browser |
| Navigate to a specified URL. | Supported | Not supported | Not supported | Not supported | Not supported |
Take page screenshot |
| Take a screenshot of the current page. | Supported | Not supported | Not supported | Not supported | Not supported |
Click in browser |
| Perform a click action on a web page. | Supported | Not supported | Not supported | Not supported | Not supported |
Handle pop-up dialog |
| Handle pop-up dialogs. | Supported | Not supported | Not supported | Not supported | Not supported |
Wait for page content |
| Wait for text to appear or disappear, or wait for a specified time. | Supported | Not supported | Not supported | Not supported | Not supported |
Select option from dropdown |
| Select an option from a dropdown menu. | Supported | Not supported | Not supported | Not supported | Not supported |
Asynchronously observe UI elements |
| Asynchronously find and describe interactive UI elements on a page. | Supported | Not supported | Not supported | Not supported | Not supported |
Get extraction task result |
| Get the result of an asynchronous extraction task. | Supported | Not supported | Not supported | Not supported | Not supported |
Create planning task |
| Create a planning task that breaks down a task into detailed steps. | Supported | Not supported | Not supported | Not supported | Not supported |
Navigation URL |
| Navigate to a specified URL in the browser. | Supported | Not supported | Not supported | Not supported | Not supported |
Get operation task progress |
| Get the current progress or final result of an asynchronous operation task. | Supported | Not supported | Not supported | Not supported | Not supported |
Execute natural language task |
| Execute a task described in natural language. | Supported | Not supported | Not supported | Not supported | Not supported |
Take webpage screenshot |
| Capture an image of the current web page. | Supported | Not supported | Not supported | Not supported | Not supported |
Get observation task result |
| Get the result of an asynchronous observation task. | Supported | Not supported | Not supported | Not supported | Not supported |
Query task status |
| Query the status of a task by its ID. | Supported | Not supported | Not supported | Not supported | Not supported |
Extract webpage data |
| Retrieve information or data from a web page based on instructions. | Supported | Not supported | Not supported | Not supported | Not supported |
Synchronously execute page operation |
| Synchronously execute one or more operations on the current web page. | Supported | Not supported | Not supported | Not supported | Not supported |
Asynchronously execute page operation |
| Start one or more operations asynchronously and return a task ID. | Supported | Not supported | Not supported | Not supported | Not supported |
Terminate running task |
| Terminate a running task by its ID. | Supported | Not supported | Not supported | Not supported | Not supported |
Asynchronously extract web data |
| Asynchronously retrieve information or data from a web page. | Supported | Not supported | Not supported | Not supported | Not supported |
Log in on page |
| Perform a login operation using specific login configurations. | Supported | Not supported | Not supported | Not supported | Not supported |
Find interactive elements |
| Find and describe interactive UI elements on the current page. | Supported | Not supported | Not supported | Not supported | Not supported |
Close browser session |
| Close the current browser proxy session and clean up resources. | Supported | Not supported | Not supported | Not supported | Not supported |
Desktop screenshot in Base64 format |
| Capture a desktop screenshot and return base64-encoded data. | Supported | Supported | Supported | Supported | Not supported |
Stop Chrome browser |
| Stop the Chrome browser. | Supported | Not supported | Not supported | Not supported | Not supported |
Start Chrome with CDP |
| Start Chrome and create a WebSocket debugging port for the Chrome DevTools Protocol (CDP). | Supported | Not supported | Not supported | Not supported | Not supported |
Edit file by line |
| Edit a text file by line. Show changes in git-style diff format. | Supported | Supported | Supported | Supported | Not supported |
Get file metadata |
| Get detailed metadata for a file or directory. | Supported | Supported | Supported | Supported | Not supported |
Monitor file changes |
| Monitor filesystem events in a directory and return new events. | Supported | Supported | Supported | Supported | Not supported |
Read file content |
| Read file content from the filesystem. Supports offset and length parameters. | Supported | Supported | Supported | Supported | Not supported |
Read multiple files |
| Read the contents of multiple files at once. | Supported | Supported | Supported | Supported | Not supported |
List directory files |
| Get a detailed list of all files and directories at a specified path. | Supported | Supported | Supported | Supported | Not supported |
Move or rename file |
| Move or rename files and directories. | Supported | Supported | Supported | Supported | Not supported |
Search files recursively |
| Recursively search for files and directories matching a pattern using the find command. | Supported | Supported | Supported | Supported | Not supported |
Write file content |
| Create a new file or write content to an existing file. | Supported | Supported | Supported | Supported | Not supported |
Copy file or directory |
| Copy a file or directory to a new location while keeping the original. | Supported | Supported | Supported | Supported | Not supported |
Delete file or directory |
| Delete files and directories from the filesystem. | Supported | Supported | Supported | Supported | Supported |
Create directory |
| Create a new directory or ensure it exists. | Supported | Supported | Supported | Supported | Not supported |
Run shell command |
| Run a shell or cmd command with timeout. | Supported | Supported | Supported | Supported | Not supported |
Get system metrics |
| Get system metrics, including CPU, memory, disk, and network statistics. | Supported | Supported | Supported | Supported | Not supported |
List installed applications |
| Get a list of installed applications on the system. | Not supported | Supported | Supported | Supported | Not supported |
Start application |
| Start an application using a specified command. | Not supported | Supported | Supported | Supported | Not supported |
Stop application by command |
| Stop an application using a stop command. | Not supported | Supported | Supported | Supported | Not supported |
Get mouse cursor position |
| Get the current mouse cursor position on screen. | Not supported | Supported | Supported | Not supported | Not supported |
Drag mouse |
| Drag the mouse from a start point to an end point. | Not supported | Supported | Supported | Not supported | Not supported |
Get screen size |
| Get the width, height, and DPI scaling ratio of the primary screen. | Not supported | Supported | Supported | Not supported | Not supported |
Move mouse cursor |
| Move the mouse cursor to a specified screen coordinate. | Not supported | Supported | Supported | Not supported | Not supported |
Release keyboard keys |
| Release multiple keyboard keys. | Not supported | Supported | Supported | Not supported | Not supported |
Press keyboard shortcut |
| Press multiple keyboard keys simultaneously. Supports shortcuts. | Not supported | Supported | Supported | Not supported | Not supported |
Input text |
| Input specified text. | Not supported | Supported | Supported | Supported | Not supported |
Scroll mouse wheel |
| Scroll the mouse wheel at a specified coordinate. | Not supported | Supported | Supported | Not supported | Not supported |
Click mouse |
| Click at a specified absolute coordinate. | Not supported | Supported | Supported | Not supported | Not supported |
Set window to full screen |
| Set a specified window to full-screen mode. | Not supported | Supported | Supported | Not supported | Not supported |
Minimize window |
| Minimize a specified window. | Not supported | Supported | Supported | Not supported | Not supported |
Resize window |
| Resize a specified window. | Not supported | Supported | Supported | Not supported | Not supported |
Activate window |
| Activate (focus) a specified window. | Not supported | Supported | Supported | Not supported | Not supported |
List root windows |
| List all root windows and their associated information. | Not supported | Supported | Supported | Not supported | Not supported |
Maximize window |
| Maximize a specified window. | Not supported | Supported | Supported | Not supported | Not supported |
Close window |
| Close a specified window. | Not supported | Supported | Supported | Not supported | Not supported |
Get active window info |
| Get detailed information about the current active window. | Not supported | Supported | Supported | Not supported | Not supported |
Focus mode control |
| Enable or disable focus mode to restrict the foreground window scope. | Not supported | Supported | Not supported | Not supported | Not supported |
Restore window to normal state |
| Restore a specified window to its normal state. | Not supported | Supported | Supported | Not supported | Not supported |
Stop process by name |
| Stop all related processes by process name. | Not supported | Supported | Supported | Not supported | Not supported |
List visible applications |
| List all applications with visible windows and their process information. | Not supported | Supported | Supported | Not supported | Not supported |
Stop process by PID |
| Terminate a specified process by its PID. | Not supported | Supported | Supported | Not supported | Not supported |
Run code in sandbox |
| Run a specified code snippet in the sandbox. Supports multiple programming languages. | Not supported | Supported | Not supported | Not supported | Not supported |
Code for semantic search |
| Search for code or knowledge in the codebase by meaning using semantic search. | Not supported | Not supported | Not supported | Not supported | Supported |
Search code symbols |
| Discover code symbols and their relationships (classes, methods, interfaces, etc.). | Not supported | Not supported | Not supported | Not supported | Supported |
List directory structure |
| List directory contents to help understand the file structure. | Not supported | Not supported | Not supported | Not supported | Supported |
Glob search for file paths |
| Search for file paths in the workspace using glob patterns. | Not supported | Not supported | Not supported | Not supported | Supported |
Regex search in code |
| Perform high-performance file content search using the ripgrep engine. | Not supported | Not supported | Not supported | Not supported | Supported |
Get compilation errors |
| Get compilation or lint errors in code files. | Not supported | Not supported | Not supported | Not supported | Supported |
Run command in terminal |
| Run a shell command in the terminal. | Not supported | Not supported | Not supported | Not supported | Supported |
Get terminal output |
| Get the execution output of background terminal commands. | Not supported | Not supported | Not supported | Not supported | Supported |
Preview web service |
| Set up a preview browser for a local web service. | Not supported | Not supported | Not supported | Not supported | Supported |
Exact string replacement |
| Replace exact strings in a file. | Not supported | Not supported | Not supported | Not supported | Supported |
Create or overwrite file |
| Create a new file or overwrite an existing file with specified content. | Not supported | Not supported | Not supported | Not supported | Supported |
Task list management |
| Create and manage task lists to track complex multi-step tasks. | Not supported | Not supported | Not supported | Not supported | Supported |
Ask user question |
| Ask the user questions during execution to collect preferences or decisions. | Not supported | Not supported | Not supported | Not supported | Supported |
Switch interaction mode |
| Switch interaction modes (for example, switch to planning mode). | Not supported | Not supported | Not supported | Not supported | Supported |
Create implementation plan |
| Create and store a concise implementation plan. | Not supported | Not supported | Not supported | Not supported | Supported |
Update memory data |
| Update, add, or delete memory based on user intent. | Not supported | Not supported | Not supported | Not supported | Supported |
Tap on mobile device |
| Perform a tap action at a specified screen coordinate (mobile). | Not supported | Not supported | Not supported | Supported | Not supported |
Swipe on screen |
| Perform a swipe gesture on the screen. | Not supported | Not supported | Not supported | Supported | Not supported |
Send key event |
| Send a key event with a specified key code (for example, Back or HOME key). | Not supported | Not supported | Not supported | Supported | Not supported |
Clear input field text |
| Clear text in the current input field. | Not supported | Not supported | Not supported | Supported | Not supported |
Take long screenshot with scroll |
| Take a long screenshot by scrolling and stitch the images together. | Not supported | Not supported | Not supported | Supported | Not supported |
Get all UI elements |
| Get all UI elements on the device, including non-interactive ones. | Not supported | Not supported | Not supported | Supported | Not supported |
Get clickable UI elements |
| Get all clickable UI elements on the device. | Not supported | Not supported | Not supported | Supported | Not supported |
Start asynchronous task |
| Start an asynchronous mobile task and return a task ID. | Not supported | Not supported | Not supported | Supported | Not supported |
Query asynchronous task status |
| Get the status and incremental output stream of an asynchronous task. | Not supported | Not supported | Not supported | Supported | Not supported |
Terminate asynchronous task |
| Terminate a running asynchronous task. | Not supported | Not supported | Not supported | Supported | Not supported |
Send keyboard scan codes |
| Send raw keyboard scan codes for low-level keyboard input control. | Not supported | Not supported | Supported | Not supported | Not supported |
Tool capability details
Tool capability | Tool name | Tool description | Parameter example |
Create sandbox environment |
| Creates a new AgentBay sandbox and returns its ID. | |
Get sandbox access URL |
| Get the Wuying MCP runtime URL for user access. Each URL is valid for a single use and expires immediately. | |
Take full-screen screenshot and share |
| Captures a full-screen screenshot of the current screen and returns a shareable URL. The screenshot is automatically processed and stored securely. For security, the generated URL expires after 64 minutes. | |
Release sandbox resources |
| Releases resources after the task is complete. | |
Run shell command |
| Runs a shell command and returns the output or an error. | |
Close browser page |
| Closes the browser page. | |
Resize browser window |
| Resizes the browser window. | |
Get console messages |
| Returns all console messages. | |
Handle pop-up dialog |
| Handles a dialog box. | |
Upload file in browser |
| Uploads one or more files. | |
Install browser |
| Installs the browser specified in the configuration. Call this tool if you receive an error that the browser is not installed. | |
Press key in browser |
| Presses a key on the keyboard. | |
Navigate to URL in browser |
| Navigates to a URL. | |
Navigate back in browser |
| Navigates to the previous page. | |
Navigate forward in browser |
| Navigates to the next page. | |
Get network requests |
| Returns all network requests since the page was loaded. | |
Take page screenshot |
| Takes a screenshot of the current page. You cannot perform actions based on the screenshot. Use `browser_snapshot` for actions. | |
Capture page snapshot |
| Captures an accessibility snapshot of the current page. | |
Click in browser |
| Performs a click action on a web page. | |
Drag browser element |
| Performs a drag-and-drop operation between two elements. | |
Hover over browser element |
| Hovers the mouse over a page element. | |
Type text in browser |
| Types text into an editable element. | |
Select option from dropdown |
| Selects an option in a dropdown menu. | |
Wait for page content |
| Waits for text to appear or disappear, or for a specified time to pass. | |
Create directory |
| Creates a new directory or ensures a directory exists. You can create multiple nested directories in a single operation. If the directory already exists, this operation succeeds silently. Ideal for setting up the required directory structure for a project or ensuring a required path exists. Works only in allowed directories. | |
Edit file by line |
| Performs line-based edits on a text file. Each edit replaces an exact sequence of lines with new content. Returns a git-style diff to show the changes made. Works only in allowed directories. | |
Get file metadata |
| Retrieves detailed metadata for a file or directory. Returns comprehensive information including size, creation time, last modification time, permissions, and type. This tool is ideal for understanding file characteristics without reading the actual content. Works only in allowed directories. | |
Read file content |
| Reads the content of a file from the filesystem. You can specify an optional 'offset' (in bytes) to start reading from a specific position, and an optional 'length' (in bytes) to limit the number of bytes to read. If 'length' is omitted or is 0, the file is read to the end. Handles various text encodings and provides detailed error messages if the file cannot be read. Works only in allowed directories. | |
Read multiple files |
| Reads the content of multiple files simultaneously. This is more efficient than reading files one by one when you need to analyze or compare multiple files. The content of each file is returned with its path as a reference. A failure to read a single file does not stop the entire operation. Works only in allowed directories. | |
List directory files |
| Gets a detailed list of all files and directories in a specified path. The results clearly distinguish between files and directories with [FILE] and [DIR] prefixes. This tool is useful for understanding directory structure and finding specific files. Works only in allowed directories. | |
Move or rename file |
| Moves or renames files and directories. You can move a file to a different directory and rename it in a single operation. The operation fails if the destination already exists. It can work between different directories and can be used for simple renames within the same directory. Both source and destination must be in allowed directories. | |
Search files recursively |
| Recursively searches for files and directories that match a pattern. Searches all subdirectories from the starting path. The search is case-sensitive and matches partial names. Returns the full path of all matching items. Ideal for finding files when you do not know the exact location. Searches only in allowed directories. | |
Write file content |
| Creates a new file or writes to the content of an existing file. You can choose to completely overwrite the file or append to the end of the file by specifying the | |
Synchronously execute page operation |
| Executes one or more actions on the current webpage and blocks until all actions are complete. This method immediately executes the provided actions through the current agent and returns the final result after the entire sequence of actions is complete. It can handle single interactions along with a series of consecutive actions. | |
Take webpage screenshot |
| Captures a screenshot of the current webpage. This tool provides flexible screenshot capabilities, supporting full-page screenshots, cropping of specified areas, and image quality settings. The result is returned as a data URL (such as | |
Close browser session |
| Closes the current browser proxy session. This terminates the browser process managed by the proxy and releases related resources. | |
Find interactive elements |
| Identifies and locates interactive user interface elements. This tool helps you discover and describe elements that can be interacted with (such as buttons or input fields) for use in subsequent actions. Use the To extract structured data or text content, use the | |
Asynchronously execute page operation |
| Asynchronously starts executing one or more actions on the current webpage and returns a Unlike | |
Get asynchronous operation task progress |
| Gets the current progress or final result of an asynchronous operation task. This method should be used with the | |
Get asynchronous extraction task result |
| Asynchronous function to get the result of an extract task. | |
Navigate to URL on page |
| Navigates to the specified URL in the browser. | |
Run code |
| Runs code in the specified context on the Linux platform and returns the output or an error. | |