This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

BiDirectional functionality

1: WebDriver BiDi Logging Features
2: WebDriver BiDi Network Features
3: WebDriver BiDi Script Features
4: Chrome DevTools Protocol

4.1: Chrome DevTools Logging Features
4.2: Chrome DevTools Network Features
4.3: Chrome DevTools Script Features

5: BiDirectional API (W3C compliant)

5.1: Browsing Context
5.2: Input
5.3: Log
5.4: Network
5.5: Script

BiDirectional means that communication is happening in two directions simultaneously. The traditional WebDriver model involves strict request/response commands which only allows for communication to happen in one direction at any given time. In most cases this is what you want; it ensures that the browser is doing the expected things in the right order, but there are a number of interesting things that can be done with asynchronous interactions.

This functionality is currently available in a limited fashion with the [Chrome DevTools Protocol] (CDP), but to address some of its drawbacks, the Selenium team, along with the major browser vendors, have worked to create the new WebDriver BiDi Protocol. This specification aims to create a stable, cross-browser API that leverages bidirectional communication for enhanced browser automation and testing functionality, including streaming events from the user agent to the controlling software via WebSockets. Users will be able to listen for and record or manipulate events as they happen during the course of a Selenium session.

Enabling BiDi in Selenium

In order to use WebDriver BiDi, setting the capability in the browser options will enable the required functionality:

options.setCapability("webSocketUrl", true);

options.enable_bidi = True

UseWebSocketUrl = true,

options.web_socket_url = true

Options().enableBidi();

options.setCapability("webSocketUrl", true);

This enables the WebSocket connection for bidirectional communication, unlocking the full potential of the WebDriver BiDi protocol.

Note that Selenium is updating its entire implementation from WebDriver Classic to WebDriver BiDi (while maintaining backwards compatibility as much as possible), but this section of documentation focuses on the new functionality that bidirectional communication allows. The low-level BiDi domains will be accessible in the code to the end user, but the goal is to provide high-level APIs that are straightforward methods of real-world use cases. As such, the low-level components will not be documented, and this section will focus only on the user-friendly features that we encourage users to take advantage of.

If there is additional functionality you’d like to see, please raise a feature request.

1 - WebDriver BiDi Logging Features

These features are related to logging. Because “logging” can refer to so many different things, these methods are made available via a “script” namespace.

Remember that to use WebDriver BiDi, you must enable it in Options. For more details, see Enabling BiDi

Console Message Handlers

Record or take actions on console.log events.

Add Handler

Implementation Missing

    driver.script.add_console_message_handler(log_entries.append)

BiDirectional functionality

Enabling BiDi in Selenium

1 - WebDriver BiDi Logging Features

Console Message Handlers

Add Handler

Remove Handler

JavaScript Exception Handlers

Add Handler

Remove Handler

2 - WebDriver BiDi Network Features

Authentication Handlers

Request Handlers

Response Handlers

3 - WebDriver BiDi Script Features

Script Pinning

Execute Script

DOM Mutation Handlers

4 - Chrome DevTools Protocol

Using Chrome DevTools Protocol with Selenium

4.1 - Chrome DevTools Logging Features

Console Logs

JavaScript Exceptions

4.2 - Chrome DevTools Network Features

Basic authentication

Network Interception

Response information

Response transformation

Request interception

Performance Metrics

Setting Cookies

Waiting for Downloads

4.3 - Chrome DevTools Script Features

Script Pinning

DOM Mutation Handlers

5 - BiDirectional API (W3C compliant)

5.1 - Browsing Context

Commands

Open a new window

Open a new tab

Use existing window handle

Open a window with a reference browsing context

Open a tab with a reference browsing context

Navigate to a URL

Navigate to a URL with readiness state

Get browsing context tree

Get browsing context tree with depth

Get All Top level browsing contexts

Close a tab/window

Activate a browsing context

Reload a browsing context

Handle user prompt

Capture Screenshot

Capture Viewport Screenshot

Capture Element Screenshot

Set Viewport

Print page

Navigate back

Navigate forward

Traverse history

Events

Browsing Context Created Event

Dom Content loaded Event

Browsing Context Loaded Event

Navigated Started Event

Fragment Navigated Event

User Prompt Opened Event

User Prompt Closed Event

Browsing Context Destroyed Event

5.2 - Input

Perform Actions

Release Actions

5.3 - Log

Console logs

JavaScript exceptions

Listen to JS Logs

5.4 - Network

Commands

Add network intercept

Remove network intercept

Continue request blocked at authRequired phase with credentials