Categorygithub.com/4ydx/chrome-protocol
modulepackage
1.0.3
Repository: https://github.com/4ydx/chrome-protocol.git
Documentation: pkg.go.dev

# README

About chrome-protocol

A relatively thin wrapper on top of code that is generated based on the chrome devtool protocol. Aims to provide a few of the basic commands that one would desire when automating actions in chrome or any other browser that supports the protocol.

This is still a work in progress.

  • Very fast.
  • No hidden errors.
  • No hidden sleep calls/timeouts.
  • Simple approach makes it easy to understand what is happening under the hood.

Examples

Look under github.com/4ydx/chrome-protocol/actions. The testing files are the examples. There is one example in the example folder.

  • Click
  • Fill
  • Focus
  • Navigation
  • Screenshot
  • As well as other actions (css style retrieval, javascript evaluation, etc).

I will be working on other actions as I need them for my own personal projects.

You can take the generated code in github.com/4ydx/cdp and create your own higher level actions for interacting with the browser. This will require understanding the Devtools Reference.

Navigation example:

package main

import (
	"github.com/4ydx/chrome-protocol"
	"github.com/4ydx/chrome-protocol/actions"
	"log"
	"time"
)

func main() {
	browser := cdp.NewBrowser("/usr/bin/google-chrome", 9222, "browser.log")

	frame := cdp.Start(browser, cdp.LogBasic)
	defer func() {
		// passing false prevents the browser from stopping immediately
		frame.Stop(false)

		// Give yourself time to view the final page in the browser.
		time.Sleep(3 * time.Second)
		browser.Stop()
	}()

	// Enable page events
	if err := actions.EnablePage(frame, time.Second*2); err != nil {
		panic(err)
	}

	// Navigate
	if _, err := actions.Navigate(frame, "https://google.com", time.Second*5); err != nil {
		panic(err)
	}

	log.Printf("\n-- All completed for %s --\n", frame.FrameID)
}

Creating your own Actions

Actions encapsulate everything you need in order to interact with a browser. An action contains commands and events.

When you construct an action, you need to fill in at least one command that consists of the struct representing the parameters that will be sent with the command, the struct that represents the reply to that command from the server, and the method name of the API call you are making.

It is possible to associate events that the server will send to the client with your action. By specifying events you can be sure that a given action has actually run its course and the browser state is where you would expect it to be.

API methods, command parameters, command responses, possible events, and types are all defined in the Devtools Reference.

This is a possible Navigation action that watches for the FrameStoppedLoadingReply event which helps to ensure that navigation to a url is fully completed.

// Navigate sends the browser to the given URL
func Navigate(frame *cdp.Frame, url string, timeout time.Duration) ([]cdp.Event, error) {
	events := GetNavigationEvents()
	action := cdp.NewAction(frame,
		events,
		[]cdp.Command{
			cdp.Command{ID: frame.RequestID.GetNext(), Method: page.CommandPageNavigate, Params: &page.NavigateArgs{URL: url}, Reply: &page.NavigateReply{}, Timeout: timeout},
		})
	if err := action.Run(); err != nil {
		log.Print(err)
		return events, err
	}
	return events, nil
}

Caveats

  • Concurrent actions are currently not supported.

# Packages

No description provided by the author
No description provided by the author

# Functions

GetWebsocket returns a websocket connection to the running browser.
NewAction returns a newly created action with any events that will be triggered by commands the action will take.
NewBrowser accepts the path to the browser's binary, the port, and any arguments that need to be passed to the binary.
Read reads replies from the server over the websocket.
SendClose closes the websocket.
Start prepares required resources to begin automation.
UpdateDOMEvent takes the event and, for a certain subset of events, makes sure that the current DOM object is updated.
Write writes requests to the server over the websocket.

# Constants

LogAll records everything.
LogBasic records outgoing commands, their replies, and any specified events.
LogDetails records additional details about the reply from the server for a given command/event.

# Variables

Wait is the default timeout taken as the action wait loop runs.

# Structs

Action represents a collection of json requests (commands) and any events that those requests might trigger that need to be tracked.
Browser contains information required to stop an exec'd browser at a later point in time.
Command represents a single json request sent to the server over the websocket.
Error error type that is apart of the Message struct.
Event holds the value returned by the server based on a matching MethodType name.
Frame stores the current FrameID.
Message is the chrome DevTools Protocol message sent/read over the websocket connection.
RequestID stores the last value used for chrome devtool protocal requests being sent to the server.

# Interfaces

CommandReply specifies required methods for handling the json encoded replies received from the server.

# Type aliases

LogLevelValue is the type for loglevel information.