WebDriver

From AutoIt Wiki
Revision as of 08:12, 25 April 2024 by MLipok (talk | contribs) (→‎References: "au3webdriver-boilerplate")
Jump to navigation Jump to search

The W3C WebDriver API is a platform and language-neutral interface and wire protocol allowing programs or scripts to control the behavior of a web browser.

Introduction

WebDriver API

WebDriver enables developers to create automated tests that simulate user interaction. This is different from JavaScript unit tests because WebDriver has access to functionality and information that JavaScript running in the browser doesn't, and it can more accurately simulate user events or OS-level events. WebDriver can also manage testing across multiple windows, tabs and webpages in a single test session.

WebDriver UDF

The WebDriver UDF allows to interact with any browser that supports the W3C WebDriver specifications. Supporting multiple browsers via the same code base is now possible with just a few configuration settings.

Requirements

(Last modified: 2022/01/25)

The following UDFs need to be installed - independent of the Browser you try to automate:

One of the following Drivers needs to be installed - depending on the Browser type and version you try to automate:

Browser Download Link Comments
Chrome Google Follow this link to select the correct version depending on the Chrome version you run!
Edge Microsoft
Firefox GitHub Firefox version ≥ 60 is recommended

Note: You must still have the Microsoft Visual Studio redistributable runtime installed on your system for the binary to run. This is a known bug in version 0.26 which the authors weren't able fix for this release.

Opera GitHub The versioning of OperaDriver matches the Chromium version on which Opera browser is based on.

Limitations

(Last modified: 2020/01/28)
Not all WebDriver functions have been implemented by each browser. To check the status goto the corresponding website below:

  • Chrome
  • Edge
  • Firefox
  • Opera: "OperaChromiumDriver is a WebDriver implementation derived from ChromeDriver and adapted by Opera". That's why I think it has at least the same limitations as ChromeDriver.

Big Picture

How the browser independent and browser dependent parts fit together:

Big Picture - How everything fits together

Technical terms

(Last modified: 2022/02/12)

You will encounter the following technical terms when working with the WebDriver UDF.
These terms are not unique to the UDF, so you will find just the general description in this section. How to use these terms with the WebDriver UDF can be found in the FAQ.

CDP (Chrome DevTools Protocol)
Is a protocol that allows for tools to instrument, inspect, debug and profile Chromium, Chrome and other Blink-based browsers.
CSS Selector
Please see Selector.
Locator Strategy
A Locator strategy describes the method to use to search elements. This includes Linktext, Partial Linktext, Tag Name, CSS selector, XPath selector.
Marionette
Marionette is an automation driver for Mozilla’s Gecko engine. It can remotely control either the UI or the internal JavaScript of a Gecko platform, such as Firefox. It can control both the chrome (i.e. menus and functions) or the content (the webpage loaded inside the browsing context), giving a high level of control and ability to replicate user actions. In addition to performing actions on the browser, Marionette can also read the properties and attributes of the DOM.
Marionette consists of two parts: a server which takes requests and executes them in Gecko (the Marionette server ships with Firefox), and a client (the Marionette client ships with the GeckoDriver exe). The client sends commands to the server and the server executes the command inside the browser.
For details please visit this site.
Selector
Selectors are patterns used to select the element(s) you want to process. They are used by
CSS (see this link for Beginners or this link for Advanced) and
XPath (see this link for Beginners and this link for Advanced).
ShadowRoot
The ShadowRoot interface of the Shadow DOM API is the root node of a DOM sub tree that is rendered separately from a document's main DOM tree.
For details please visit this site.
XPath
XPath is a language for navigating in XML documents. XPath is a major element in the XSLT standard and includes over 200 built-in functions.
XQuery
XQuery is a language for querying XML documents.
For details please visit this site.
XSLT
XSLT is a language for transforming XML documents.
For details please visit this site.

Installation

(Last modified: 2022/02/02)

To automate your browser the following installation steps are needed:

  • Download the files listed in section "Requirements"
  • Move the UDFs to a directory where SciTE and Autoit can find them:
    • wd_Core.au3, wd_helper.au3 , wd_cdp.au3 , wd_capabilities.au3 from the WebDriver UDF
    • Json.au3 and BinaryCall.au3 from the JSON UDF
    • WinHttp.au3 and WinHttpConstants.au3 from the WinHttp UDF
  • Move the browser dependent WebDriver to the same directory (with WD_Demo.au3):
    • chromedriver.exe (Chrome)
    • geckodriver.exe (Firefox)
    • msedgedriver.exe (Edge - Chromium) or MicrosoftWebDriver.exe (Edge - EdgeHTML)
    • or use "Update" option by choosing one of ComboBox option in WD_Demo.au3
  • Run WD_Demo.au3 and select "DemoNavigation" to validate the installation.
    The result (for Firefox) displayed in the DOS window should be similar to the following:
1577745813519   geckodriver     DEBUG   Listening on 127.0.0.1:4444
1577745813744   webdriver::server       DEBUG   -> POST /session {"capabilities": {"alwaysMatch": {"browserName": "firefox", "acceptInsecureCerts":true}}}
1577745813746   geckodriver::capabilities       DEBUG   Trying to read firefox version from ini files
1577745813747   geckodriver::capabilities       DEBUG   Found version 71.0
1577745813757   mozrunner::runner       INFO    Running command: "C:\\Program Files\\Mozilla Firefox\\firefox.exe" "-marionette" "-foreground" "-no-remote" "-profile" "C:\\ ...
1577745813783   geckodriver::marionette DEBUG   Waiting 60s to connect to browser on 127.0.0.1:55184
1577745817392   geckodriver::marionette DEBUG   Connection to Marionette established on 127.0.0.1:55184.
1577745817464   webdriver::server       DEBUG   <- 200 OK {"value":{"sessionId":"925641bf-6c5d-4fe2-a985-02de9b1c7c74","capabilities":"acceptInsecureCerts":true,"browserName":"firefox", ...

Getting started example

(Last modified: 2023/09/09)

#include "wd_helper.au3"
#include "wd_capabilities.au3"

_Example()

Func _Example()
	# REMARK
	#   This is not functional script
	#   It only shows the concept how to use WebDriver UDF

	#Region ; initialize webdriver sesion

	; you should take care about download/update dirver
	If $IDYES = MsgBox($MB_YESNO + $MB_TOPMOST + $MB_ICONQUESTION + $MB_DEFBUTTON1, "Question", _
			"Do you want to download/update driver ?") Then
		_WD_UpdateDriver('chrome')
	EndIf

	; specify driver, port and other options
	_WD_Option('Driver', 'chromedriver.exe')
	_WD_Option('Port', 9515)
	_WD_Option('DriverParams', '--verbose --log-path="' & @ScriptDir & '\chrome.log"')

	; start the driver
	_WD_Startup()
	If @error Then Return SetError(@error, @extended, 0) ; always remember to check and handle error's

	; create capabilites for session
	_WD_CapabilitiesStartup()
	_WD_CapabilitiesAdd('alwaysMatch', 'chrome')
	_WD_CapabilitiesAdd('w3c', True)
	_WD_CapabilitiesAdd('excludeSwitches', 'enable-automation')
	Local $sCapabilities = _WD_CapabilitiesGet()

	; create session with given Capabilities
	Local $WD_SESSION = _WD_CreateSession($sCapabilities)
	If @error Then Return SetError(@error, @extended, 0) ; always remember to check and handle error's

	#EndRegion ; initialize webdriver sesion

	#Region ; do your's stuff

	; navigate to some website
	Local $sURL = '******'
	_WD_Navigate($WD_SESSION, $sURL)
	If @error Then Return SetError(@error, @extended, 0) ; always remember to check and handle error's

	; wait for loading process ends
	_WD_LoadWait($WD_SESSION, 1000)
	If @error Then Return SetError(@error, @extended, 0) ; always remember to check and handle error's

	; for example find element
	Local $sXPath = "*****"
	Local $sElement = _WD_FindElement($WD_SESSION, $_WD_LOCATOR_ByXPath, $sXPath)
	If @error Then Return SetError(@error, @extended, 0) ; always remember to check and handle error's

	; get element text
	Local $sText = _WD_ElementAction($WD_SESSION, $sElement, 'text')
	If @error Then Return SetError(@error, @extended, 0) ; always remember to check and handle error's
	ConsoleWrite($sText & @CRLF)

	; or click the element
	_WD_ElementAction($WD_SESSION, $sElement, 'click')
	If @error Then Return SetError(@error, @extended, 0) ; always remember to check and handle error's

	#EndRegion ; do your's stuff

	#Region ; Clean Up

	; on the end session should be deleted
	_WD_DeleteSession($WD_SESSION)
	If @error Then Return SetError(@error, @extended, 0) ; always remember to check and handle error's

	; and driver should be closed
	_WD_Shutdown()
	If @error Then Return SetError(@error, @extended, 0) ; always remember to check and handle error's

	#EndRegion ; Clean Up

EndFunc   ;==>_Example

Function reference

(Last modified: 2021/12/09)

The functions are now documented in the CHM help file that comes with the UDF.

Capabilities

WebDriver capabilities are used to communicate the features supported by a given implementation. The local end may use capabilities to define which features it requires the remote end to satisfy when creating a new session. Likewise, the remote end uses capabilities to describe the full feature set for a session.
Details can be found at WebDriver Capabilities (sub page).

Browser related functionality

Google Chrome

ChromeDriver supports "Chrome DevTools Protocol" (CDP) commands (for an explanation of the term and related links please see the Used Terms section).
Details can be found in the CHM Help file that comes with the UDF.

Translate IE UDF to WebDriver

This page is still a work in progress.

Internet Explorer is no longer supported by Microsoft. Therefore, it may be necessary to rewrite existing scripts for other browsers using the WebDriver UDF. Below you will find a mapping of the functions of the IE UDF to the functions of the WebDriver UDF.

IE function WebDriver function Comments
_IEAction _WD_ElementAction + _WD_ElementActionEx
_IEAttach _WD_Attach

_WD_Attach: Attach to existing browser tab.

Attaching to running browser instance/session is much more difficult. Examples can be found here.

_IEBodyReadHTML _WD_GetSource
_IEBodyReadText _WD_ExecuteScript($sSession, 'return document.body.innerText',Default,Default, $_WD_JSON_Value)
_IEBodyWriteHTML _WD_ExecuteScript($s_Session, "document.body.outerHTML='" & $sHTML & "'")
_IECreate _WD_CreateSession
_IECreateEmbedded N/A
_IEDocGetObj
_IEDocInsertHTML
_IEDocInsertText
_IEDocReadHTML _WD_GetSource
_IEDocWriteHTML _WD_ExecuteScript($s_Session, "document.open();document.write(" & $sHTML & ");document.close();") https://stackoverflow.com/a/11984907/5314940
_IEErrorNotify N/A you can set debug level for example: $_WD_DEBUG = $_WD_DEBUG_Full
_IEFormElementCheckBoxSelect _WD_ElementActionEx _WD_FindElement + _WD_ElementActionEx($sSession, $sElement, "check")
_IEFormElementGetCollection _WD_FindElement
_IEFormElementGetObjByName _WD_GetElementByName
_IEFormElementGetValue _WD_ElementAction _WD_FindElement + _WD_ElementAction($sSession, $sSelectElement, "value")
_IEFormElementOptionSelect _WD_ElementSelectAction + _WD_ElementOptionSelect
_IEFormElementRadioSelect _WD_ElementSelectAction
_IEFormElementSetValue _WD_SetElementValue
_IEFormGetCollection _WD_FindElement
_IEFormGetObjByName _WD_GetElementByName
_IEFormImageClick _WD_ElementAction _WD_FindElement + _WD_ElementAction($sSession, $sElement, "click")
_IEFormReset
_IEFormSubmit
_IEFrameGetCollection _WD_FrameList _WD_FrameList($sSession, True)
_IEFrameGetObjByName _WD_GetElementByName + _WD_FrameEnter
_IEGetObjById _WD_GetElementById
_IEGetObjByName _WD_GetElementByName
_IEHeadInsertEventScript
_IEImgClick _WD_ElementAction _WD_FindElement + _WD_ElementAction($sSession, $sElement, "click")
_IEImgGetCollection _WD_FindElement
_IEIsFrameSet
_IELinkClickByIndex _WD_ElementAction _WD_FindElement + _WD_ElementAction($sSession, $sElement, "click")
_IELinkClickByText _WD_LinkClickByText
_IELinkGetCollection _WD_FindElement
_IELoadWait _WD_LoadWait
_IELoadWaitTimeout _WD_SetTimeouts
_IENavigate _WD_Navigate
_IEPropertyGet _WD_ElementAction _WD_FindElement + _WD_ElementAction($sSession, $sElement, 'property', 'nodeName')
_IEPropertySet _WD_ExecuteScript _WD_FindElement + ......
_IEQuit _WD_DeleteSession
_IETableGetCollection _WD_FindElement
_IETableWriteToArray _WD_GetTable _WD_FindElement + _WD_GetTable( ......
_IETagNameAllGetCollection _WD_FindElement
_IETagNameGetCollection _WD_FindElement
_IE_Example N/A Take a look at wd_demo.au3 script
_IE_Introduction N/A Take a look at wd_demo.au3 script
_IE_VersionInfo _WD_Option('version')

Troubleshooting

Debug the WebDriver setup

(Last modified: 2020/07/06)

WinHTTP UDF

Make sure that you are running at least version 1.6.4.2 (currently unreleased, but can be obtained here).

Chrome

Chrome does not start and the DOS window for chromedriver does not get displayed

Firefox

Firefox does not start and the DOS window for geckodriver does not get displayed

Debug your Script

FAQ

(Last modified: 2022/03/24)

1. How to connect to a running browser instance
2. How to hide the webdriver console
3. How to utilize an existing user profile
4. How to specify location of browser executable
5. How to maximize the browser window
6. How to specify location of WebDriver executable
7. How to retrieve the values of a drop-down list
8. How to run the browser in headless mode (hidden mode)
9. How to configure the UDF to call a user-defined Sleep function
10. How to keep my WebDriver environment up-to-date
11. How to use "Locator strategy" and "Selectors"?
12. How to download PDF (or any other) file automatically?
13. How to setup session with proxy?
14. How to add capabilities options for "goog:chromeOptions" or "ms:edgeOptions" or "moz:firefoxOptions"?
15. How to change user-agent?
16. How to enable/disable logging into webdriver console?
17. How to use devtools in browsers?
18. How to troubleshoot when you can't locate the desired element using _WD_FindElement() ?

Tools

The following tools will help you to automate your browser:

  • ChroPath plugin: Makes finding an element by XPath, ID or CSS incredibly easy (Chrome, Firefox, Opera)
  • SelectorsHub plugin: Next Gen XPath tool to generate, write and verify the XPath and cssSelectors (All browsers)
  • Save Page WE plugin: helps to save web page content as single MHTML file (Save Page WE for Firefox, Save Page WE for Chrome)
  • Stylus Stylus is a fork of Stylish for Chrome, also compatible with Firefox as a WebExtension

References

(Last modified: 2022/02/12)

Further information sources: