playwright/docs/src/service-workers-experimental-network-events-js.md
Yury Semikhatsky d62baa005f
docs: hide experimental service worker api in language ports (#15722)
* Revert "docs: expose BrowserContext.serviceWorkers to Java/.NET (#15635)"

This reverts commit 43906d0f7b.

* Revert "docs: expose BrowserContext.serviceWorker to Java/.NET (#15616)"

This reverts commit cfcc35b9a6.

* Make Request.serviceWorker available only in js
* Make sw doc js specific
2022-07-15 10:57:18 -07:00

17 KiB

id title
service-workers-experimental (Experimental) Service Worker Network Events

:::warning If you're looking to do general network mocking, routing, and interception, please see the Network Guide first. Playwright provides built-in APIs for this use case that don't require the information below. However, if you're interested in requests made by Service Workers themselves, please read below. :::

Service Workers provide a browser-native method of handling requests made by a page with the native Fetch API (fetch) along with other network-requested assets (like scripts, css, and images).

They can act as a network proxy between the page and the external network to perform caching logic or can provide users with an offline experience if the Service Worker adds a FetchEvent listener.

Many sites that use Service Workers simply use them as a transparent optimization technique. While users might notice a faster experience, the app's implementation is unaware of their existence. Running the app with or without Service Workers enabled appears functionally equivalent.

How to Enable

Playwright's inspection and routing of requests made by Service Workers are experimental and disabled by default.

Set the PW_EXPERIMENTAL_SERVICE_WORKER_NETWORK_EVENTS environment variable to 1 (or any other value) to enable the feature. Only Chrome/Chromium are currently supported.

If you're using (or are interested in using this this feature), please comment on this issue letting us know your use case.

Service Worker Fetch

Accessing Service Workers and Waiting for Activation

You can use [method: BrowserContext.serviceWorkers] to list the Service [Worker]s, or specifically watch for the Service [Worker] if you anticipate a page will trigger its registration:

const [ serviceworker ] = await Promise.all([
  context.waitForEvent('serviceworker'),
  page.goto('/example-with-a-service-worker.html'),
]);
const [ serviceworker ] = await Promise.all([
  context.waitForEvent('serviceworker'),
  page.goto('/example-with-a-service-worker.html'),
]);
async with context.expect_event("serviceworker") as event_info:
    await page.goto('/example-with-a-service-worker.html')
serviceworker = await event_info.value
with context.expect_event("serviceworker") as event_info:
    page.goto('/example-with-a-service-worker.html')
serviceworker = event_info.value
var waitForServiceWorkerTask = page.WaitForServiceWorkerAsync();
await page.GotoAsync('/example-with-a-service-worker.html');
var serviceworker = await waitForServiceWorkerTask;
Worker serviceWorker = page.waitForRequest(() -> {
  page.navigate('/example-with-a-service-worker.html');
});

[event: BrowserContext.serviceWorker] is fired before the Service Worker's main script has been evaluated, so before calling service[method: Worker.evaluate] you should wait on its activation.

There are more iodiomatic methods of waiting for a Service Worker to be activated, but the following is an implementation agnostic method:

await page.evaluate(async () => {
  const registration = await window.navigator.serviceWorker.getRegistration();
  if (registration.active?.state === 'activated')
    return;
  await new Promise(res => window.navigator.serviceWorker.addEventListener('controllerchange', res));
});
await page.evaluate(async () => {
  const registration = await window.navigator.serviceWorker.getRegistration();
  if (registration.active?.state === 'activated')
    return;
  await new Promise(res => window.navigator.serviceWorker.addEventListener('controllerchange', res));
});
await page.evaluate("""async () => {
  const registration = await window.navigator.serviceWorker.getRegistration();
  if (registration.active?.state === 'activated')
    return;
  await new Promise(res => window.navigator.serviceWorker.addEventListener('controllerchange', res));
}""")
page.evaluate("""async () => {
  const registration = await window.navigator.serviceWorker.getRegistration();
  if (registration.active?.state === 'activated')
    return;
  await new Promise(res => window.navigator.serviceWorker.addEventListener('controllerchange', res));
}""")
await page.EvaluateAsync(@"async () => {
  const registration = await window.navigator.serviceWorker.getRegistration();
  if (registration.active?.state === 'activated')
    return;
  await new Promise(res => window.navigator.serviceWorker.addEventListener('controllerchange', res));
}");
page.evaluate(
  "async () => {"
  "  const registration = await window.navigator.serviceWorker.getRegistration();" +
  "  if (registration.active?.state === 'activated')" +
  "    return;" +
  "  await new Promise(res => window.navigator.serviceWorker.addEventListener('controllerchange', res));" +
  "}"
)

Network Events and Routing

Any network request made by the Service Worker will have:

  • [event: BrowserContext.request] and its correponding events ([event: BrowserContext.requestFinished] and [event: BrowserContext.response], or [event: BrowserContext.requestFailed])
  • [method: BrowserContext.route] will see the request
  • [method: Request.serviceWorker] will be set to the Service [Worker] instance, and [method: Request.frame] will throw
  • [method: Response.fromServiceWorker] will return false

Additionally, any network request made by the Page (including its sub-[Frame]s) will have:

  • [event: BrowserContext.request] and its correponding events ([event: BrowserContext.requestFinished] and [event: BrowserContext.response], or [event: BrowserContext.requestFailed])
  • [event: Page.request] and its correponding events ([event: Page.requestFinished] and [event: Page.response], or [event: Page.requestFailed])
  • [method: Page.route] and [method: Page.route] will not see the request (if a Service Worker's fetch handler was registered)
  • [method: Request.serviceWorker] will be set to null, and [method: Request.frame] will return the [Frame]
  • [method: Response.fromServiceWorker] will return true (if a Service Worker's fetch handler was registered)

Many Service Worker implementations simply execute the request from the page (possibly with some custom caching/offline logic omitted for simplicity):

// filename: transparent-service-worker.js
self.addEventListener("fetch", (event) => {
  // actually make the request
  const responsePromise = fetch(event.request); 
  // send it back to the page
  event.respondWith(responsePromise);
});

self.addEventListener("activate", (event) => {
  event.waitUntil(clients.claim());
});

If a page registers the above Service Worker:

<!-- filename: index.html -->
<script>
  window.registrationPromise = navigator.serviceWorker.register('/transparent-service-worker.js');
</script>

On the first visit to the page via [method: Page.goto], the following Request/Response events would be emitted (along with the corresponding network lifecycle events):

Event Owner URL Routed [method: Response.fromServiceWorker]
[event: BrowserContext.request] [Frame] index.html Yes
[event: Page.request] [Frame] index.html Yes
[event: BrowserContext.request] Service [Worker] transparent-service-worker.js Yes
[event: BrowserContext.request] Service [Worker] data.json Yes
[event: BrowserContext.request] [Frame] data.json Yes
[event: Page.request] [Frame] data.json Yes

Since the example Service Worker just acts a basic transparent "proxy":

  • There's 2 [event: BrowserContext.request] events for data.json; one [Frame]-owned, the other Service [Worker]-owned.
  • Only the Service [Worker]-owned request for the resource was routable via [method: BrowserContext.route]; the [Frame]-owned events for data.json are not routeable, as they would not have even had the possibility to hit the external network since the Service Worker has a fetch handler registered.

:::caution It's important to note: calling [method: Request.frame] or [method: Response.frame] will throw an exception, if called on a [Request]/[Response] that has a non-null [method: Request.serviceWorker]. :::

Advanced Example

When a Service Worker handles a page's request, the Service Worker can make 0 to n requests to the external network. The Service Worker might respond directly from a cache, generate a reponse in memory, rewrite the request, make two requests and then combine into 1, etc.

Consider the code snippets below to understand Playwright's view into the Request/Responses and how it impacts routing in some of these cases.

// filename: complex-service-worker.js
self.addEventListener("install", function (event) {
  event.waitUntil(
    caches.open("v1").then(function (cache) {
      // 1. Pre-fetches and caches /addressbook.json
      return cache.add("/addressbook.json");
    })
  );
});

// Opt to handle FetchEvent's from the page
self.addEventListener("fetch", (event) => {
  event.respondWith(
    (async () => {
      // 1. Try to first serve directly from caches
      let response = await caches.match(event.request);
      if (response) return response;

      // 2. Re-write request for /foo to /bar
      if (event.request.url.endsWith("foo")) return fetch("./bar");

      // 3. Prevent tracker.js from being retrieved, and returns a placeholder response
      if (event.request.url.endsWith("tracker.js"))
        return new Response('conosole.log("no trackers!")', {
          status: 200,
          headers: { "Content-Type": "text/javascript" },
        });

      // 4. Otherwise, fallthrough, perform the fetch and respond
      return fetch(event.request);
    })()
  );
});

self.addEventListener("activate", (event) => {
  event.waitUntil(clients.claim());
});

And a page that simply registers the Service Worker:

<!-- filename: index.html -->
<script>
  window.registrationPromise = navigator.serviceWorker.register('/complex-service-worker.js');
</script>

On the first visit to the page via [method: Page.goto], the following Request/Response events would be emitted:

Event Owner URL Routed [method: Response.fromServiceWorker]
[event: BrowserContext.request] [Frame] index.html Yes
[event: Page.request] [Frame] index.html Yes
[event: BrowserContext.request] Service [Worker] complex-service-worker.js Yes
[event: BrowserContext.request] Service [Worker] addressbook.json Yes

It's important to note that cache.add caused the Service Worker to make a request (Service [Worker]-owned), even before addressbook.json was asked for in the page.

Once the Service Worker is activated and handling FetchEvents, if the page makes the following requests:

await page.evaluate(() => fetch('/addressbook.json'));
await page.evaluate(() => fetch('/foo'));
await page.evaluate(() => fetch('/tracker.js'));
await page.evaluate(() => fetch('/fallthrough.txt'));
await page.evaluate(() => fetch('/addressbook.json'));
await page.evaluate(() => fetch('/foo'));
await page.evaluate(() => fetch('/tracker.js'));
await page.evaluate(() => fetch('/fallthrough.txt'));
await page.evaluate("fetch('/addressbook.json')")
await page.evaluate("fetch('/foo')")
await page.evaluate("fetch('/tracker.js')")
await page.evaluate("fetch('/fallthrough.txt')")
page.evaluate("fetch('/addressbook.json')")
page.evaluate("fetch('/foo')")
page.evaluate("fetch('/tracker.js')")
page.evaluate("fetch('/fallthrough.txt')")
await page.EvaluateAsync("fetch('/addressbook.json')");
await page.EvaluateAsync("fetch('/foo')");
await page.EvaluateAsync("fetch('/tracker.js')");
await page.EvaluateAsync("fetch('/fallthrough.txt')");
page.evaluate("fetch('/addressbook.json')")
page.evaluate("fetch('/foo')")
page.evaluate("fetch('/tracker.js')")
page.evaluate("fetch('/fallthrough.txt')")

The following Request/Response events would be emitted:

Event Owner URL Routed [method: Response.fromServiceWorker]
[event: BrowserContext.request] [Frame] addressbook.json Yes
[event: Page.request] [Frame] addressbook.json Yes
[event: BrowserContext.request] Service [Worker] bar Yes
[event: BrowserContext.request] [Frame] foo Yes
[event: Page.request] [Frame] foo Yes
[event: BrowserContext.request] [Frame] tracker.js Yes
[event: Page.request] [Frame] tracker.js Yes
[event: BrowserContext.request] Service [Worker] fallthrough.txt Yes
[event: BrowserContext.request] [Frame] fallthrough.txt Yes
[event: Page.request] [Frame] fallthrough.txt Yes

It's important to note:

  • The page requested /foo, but the Service Worker requested /bar, so there are only [Frame]-owned events for /foo, but not /bar.
  • Likewise, the Service Worker never hit the network for tracker.js, so ony [Frame]-owned events were emitted for that request.

Routing Service Worker Requests Only

await context.route('**', async route => {
  if (route.request().serviceWorker()) {
    // NB: calling route.request().frame() here would THROW
    return route.fulfill({
      contentType: 'text/plain',
      status: 200,
      body: 'from sw',
    });
  } else {
    return route.continue();
  }
});
await context.route('**', async route => {
  if (route.request().serviceWorker()) {
    // NB: calling route.request().frame() here would THROW
    return route.fulfill({
      contentType: 'text/plain',
      status: 200,
      body: 'from sw',
    });
  } else {
    return route.continue();
  }
});
async def handle(route: Route):
  if route.request.service_worker:
    # NB: calling route.request.frame here would THROW
    await route.fulfill(content_type='text/plain', status=200, body='from sw');
  else:
    await route.continue_()
await context.route('**', handle)
def handle(route: Route):
  if route.request.service_worker:
    # NB: calling route.request.frame here would THROW
    route.fulfill(content_type='text/plain', status=200, body='from sw');
  else:
    route.continue_()
context.route('**', handle)
await context.RouteAsync("**", async route => {
  if (route.request().serviceWorker() != null) {
    // NB: calling route.request.frame here would THROW
    await route.FulfillAsync(
      contentType: "text/plain",
      status: 200,
      body: "from sw"
    );
  } else {
    await route.Continue()Async();
  }
});
browserContext.route("**", route -> {
  if (route.request()) {
    // calling route.request().frame() here would THROW
    route.fulfill(new Route.FulfillOptions()
      .setStatus(200)
      .setContentType("text/plain")
      .setBody("from sw"));
  } else {
    route.resume();
  }
});

Known Limitations

Requests for updated Service Worker main script code currently cannot be routed (https://github.com/microsoft/playwright/issues/14711).