Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance control of embedded content #911

Open
wants to merge 37 commits into
base: main
Choose a base branch
from
Open
Changes from 6 commits
Commits
Show all changes
37 commits
Select commit Hold shift + click to select a range
9918ead
Create explainer.md
nishitha-burman Nov 21, 2024
e8fe860
Merge branch 'MicrosoftEdge:main' into performance-control-of-embedde…
nishitha-burman Nov 21, 2024
360c2c5
Nits
lflores-ms Nov 22, 2024
db0d5ff
Migrate and expand on sections 5.1, 5.2, 5.3
lflores-ms Nov 22, 2024
57ee232
Add acknowledgements section
nishitha-burman Nov 22, 2024
d7ee3d0
Nit
nishitha-burman Nov 22, 2024
bcad7e9
Fixed typo
nishitha-burman Nov 26, 2024
b9110f2
Fix typo
lflores-ms Dec 7, 2024
2878ba2
Re-order non-goals section
lflores-ms Dec 9, 2024
797c0e0
Add 'remove burden' to goals
lflores-ms Dec 9, 2024
820e6a3
Re-arrange sections
lflores-ms Dec 9, 2024
6ca66f1
Category list spacing
lflores-ms Dec 17, 2024
240131b
Category list spacing
lflores-ms Dec 17, 2024
94ed1f8
Category list formatting
lflores-ms Dec 17, 2024
89e1266
add internal section link: negotiation
lflores-ms Dec 17, 2024
13ae7b8
Add link to header docs
lflores-ms Dec 17, 2024
08db464
Add single category example
lflores-ms Dec 17, 2024
8e92d81
Grammar fixes
lflores-ms Dec 19, 2024
0938264
Re-word alternatives considered
lflores-ms Dec 19, 2024
a366a0a
Grammar and formatting edits.
nishitha-burman Dec 30, 2024
1042bbb
Acknowledgements & spelling
nishitha-burman Dec 30, 2024
79c8fe9
Expand on use cases and scenarios section
nishitha-burman Dec 31, 2024
904b816
Remove concrete numbers for limits & fix typos in proposal table
nishitha-burman Dec 31, 2024
235ed21
Add more context about what to do with reports in the intro section
nishitha-burman Dec 31, 2024
34ccf2d
Add note in proposed solution about source of proposal
nishitha-burman Dec 31, 2024
f239784
Clean up description of basic category
nishitha-burman Jan 1, 2025
6d94c66
Add context to "what should be standardized" section
nishitha-burman Jan 1, 2025
055b3cf
Clarify non-goals section
nishitha-burman Jan 1, 2025
d1462c6
Clarify the different script categories
nishitha-burman Jan 1, 2025
da9c878
Fix formatting
nishitha-burman Jan 1, 2025
5775239
Added open questions
nishitha-burman Jan 1, 2025
972aade
Nit
nishitha-burman Jan 7, 2025
f53e432
Nit
nishitha-burman Jan 7, 2025
2889db6
Re-work 'best practices' section
lflores-ms Jan 7, 2025
48385a8
Merge Proposed Solution/API Design Details
lflores-ms Jan 7, 2025
64c98ed
Small edit in goals section
nishitha-burman Jan 7, 2025
d29de03
Merge branch 'performance-control-of-embedded-content' of https://git…
nishitha-burman Jan 7, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
157 changes: 157 additions & 0 deletions Performance control of embedded content/explainer.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,157 @@
# Performance control of embedded content

## Authors
- [Nishitha Burman Dey](https://github.com/nishitha-burman)
- [Luis Flores](https://github.com/lflores-ms)
- [Andy Luhrs](https://github.com/aluhrs13)
- [Alex Russell](https://github.com/slightlyoff)

## Introduction
This document proposes platform functionality to give embedders (browsers, websites, hosting applications) the ability to put constraints on resources allocated to embedees (iframes, browsers, tabs, WebViews) to minimize the performance impact that embedded web content can have on an user’s device. Additionally, violations of the constraints will be reported to the embedder to inform and improve the ecosystem.

Embedder developers can do this by enabling various categories of criteria that constrain performance impacting features on the embedee.

## Goals
With global web usage continuing to rise and more companies relying on the web as a primary platform to deliver their applications, performance has become a critical factor for success. As more users access websites through mobile devices and lower-powered hardware, the need for fast responsive web experiences is non-negotiable[^1],[^2].
When it comes to optimizing performance, websites and apps are limited by the performance of the external content they embed, these can be 3rd party sites, 3rd party apps, and even content from other organizations within a company. As a result, being able to control the performance of embedded content is crucial to improving the overall performance of a site or app.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
This proposal has two primary goals:
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
1. Improve users’ satisfaction with their OS, browser, and applications via formalizing methods of constraining the resources available to web content.
2. Provide information to help developers improve the performance of web sites and apps through reporting when performance is negatively impacting end-users and/or applications hosting the site in a frame.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I might swap the order of these two points, since the second goal is required to be achieved for the first goal to be achieved.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have it in that order to lead with and highlight the main driving factor, which is to improve end user experience. And you are right that the second goal is necessary to achieve the first. But I wanted to put emphasis on the end user impact.

nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

### Scenarios
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
* Embedded widgets: Weather forecast, stock tickets, etc.
* Embedded Ads: Embedded ads from networks like Google AdSense or Bing Ads.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
* Embedded calendars: Embedding calendars from services like Outlook Calendar, Google Calendar, etc.

## Proposed Solution
There are four categories (A, B, C, D) of performance impacting criteria that developers can enforce on embedded content. Based on the scenarios, the app can enable all or some of the categories.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

| **Perf. Category** | **Criteria** | **Handling violations** |
| ------------- | -------- | ------------------- |
| **A: Basic**<br>**Description**: Basic web development best practices that are scenario-agnostic. | **- Text resources must be compressed** (HTML, CSS, JS, JSON).<br> **- Oversized unzipped assets are flagged:**<br>* Assets larger than 100KB embedded via `data:...` URLs.<br>* Image files larger than 500KB served in last generation formats.<br> * Web fonts that are larger than 300KB. | - Reporting violations via Reporting API.<br> - Assets not rendered.<br> UI indicator to block out images that are too large. |
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
| **B: Early-script**<br>**Description**: JavaScript constraints to enhance performance and minimize impact on user experience before interaction begins. | **- JS limits:**<br>* Total limits on JS served prior to user interaction: 2MB<br>* Scripts must contain `content-length` headers<br>* No non-compositor thread animations (e.g. animated SVGs, loading spinners, etc.). | - Report violations via Reporting API.<br> - Loading of scripts that violate the limit are paused/blocked.<br> - Pause/disconnect animations that are not visible, interacted with. |
| **C: Globals**<br>**Description:** Overall media and system resource usage constraints. | **- Cumulative resource consumption limits per interaction:**<br>* Caps on total media usage. No more than a total of X.<br>* Limits on iframe count. No more than a total of 10.<br>* Limits of iframe depth. No mroe than a depth of 10.<br>* CPU usage before first interaction: XMB. | - Report violations via Reporting API.<br>- Do not load media at all. <br>- Do not load iframes that surpass the depth. |
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
| **D: Script**<br>**Description:** Strict JavaScript restrictions. | **-Additional JS limits:**<br>* Long tasks in the main thread.<br>* High CPU usage.<br>* Workers with long tasks that exceed Xms.<br> | - Report violations via Reporting API.<br>- Stopping JavaScript if [in the background]. |

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this imply early-script, as well?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Early-script refers to constraints during load vs. Script refers to constraints while running/post-load. Will add to explainer to clarify this.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds good, perhaps more specific naming could help in creating clearer scoping of these, since script generically feels like a superset of Early-script based on the naming alone

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, the specific names need to be worked on at some point. We're looking to start the discussion with the problem and solution we're proposing, and work the details like names as we get a better picture/direction. Do you think that's reasonable? Or should we find better names now to make the read easier?


### Discussion of different categories

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The table above was a bit hard to follow. I think the text in this section is a bit clearer to start with, so I might swap those sections

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agreed

**A: Basic – Basic web development best practices that are scenario-agnostic:** This category covers fundamental web development best practices to ensure that websites are optimized for performance across all environments. This includes compressing text resources such as HTML, CSS JavaScript, and JSON to reduce load times and bandwidth usage, and compressing assets larger than 100KB that are embedded via `data: URLs` as they can slow down page rendering and increase resource consumption. Additionally, images should be served in modern, efficient formats, with any image files exceeding 500KB considered oversized and requiring optimization. Web fonts must also be kept under 300KB to avoid unnecessarily delaying page rendering.

**B: Early-script – JavaScript constraints to enhance performance and minimize impact on user experience before interaction begins:** This category focuses on JavaScript development best practices that can be done to minimize performance issues before user interaction begins. This includes capping JavaScript resources loaded initially to avoid overwhelming devices with limited processing power or bandwidth, and serving JavaScript with constrained content-length headers to ensure predictable resource delivery and prevent bloated downloads. Additionally, animations that don’t run on the compositor thread should be avoided, as they can trigger costly layout recalculations and choppy user experiences, especially during page load or scroll events.

**C: Globals – Overall media and system resource usage constraints:** This category entails imposing limits on overall media and system resource usage during interactions to help prevent websites from over-consuming resources and degrading user experiences. This includes capping total media usage and iframe count/nesting to avoid excessive memory consumption and rendering issues because it can slow down the page and make it unresponsive, particularly on lower-end devices or in resource-constrained environments.

**D: Script – Strict JavaScript restrictions:** This category enforces restrictions on more complex JavaScript to further enhance performance. This includes limiting long tasks running on the main thread as they block the event loop and degrade interactivity leading to slow response times, and capping high CPU usage tasks, particularly those involving workers that exceed certain execution times, to ensure they don’t monopolize system resources. These restrictions ensure that JavaScript execution remains lightweight and efficient, preventing detrimental performance impacts on the user experience.

## What should be standardized?
| **Layer of configuration** | **Standardize?** | **Notes** |
| -------------------------- | ---------------- | --------- |
| **Different categorizations of features:** Currently there are four and can expand in the future with new categories. | Yes | There needs to be alignment within the web community on what the key factors are that we want to allow restrictions for. This allows site developers to be on the same page and make tradeoffs accordingly. The definition for each category and number of categories need to be standardized. Standardizing this gives site developers an opportunity to optimize their performance regardless of the browser their end users are on. |
| **Mechanism to set restrictions:** How site embedder set constraints. | Yes | Related to the different categories, the mechanism should be the same across browsers so that sites work agnostic of the browser. |
| **Criteria for each category** | Yes | Currently, the criteria for each category is determined from observations and learnings from customer engagements. This may change or evolve with time. This should be standardized so that developers know what the expectations are across all browsers and the web platform. |
| **Limits for each criteria** | Yes | Some of the limits have been determined based on observations, use cases, etc. This should also be browser agnostic so embedee developers know what the expectations are. |
| **Reporting violations** | Yes | Similar to mechanism for setting restrictions, embedder developers should have the same expectations on getting violations across all browsers their site/app runs on. |
| **How violations are handled** | No | Embedder developers should be able to opt into default behavior when restrictions are violated. There is a plethora of things that can happen when restrictions are violated. Different levels of standardization can happen here. The web platform can provide a default option for how violations are handled e.g. standard can be “some UI indicator is shown when violations are made” but it doesn’t have to be a standard what exactly is the UI. |

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if we need this table at all, and can just move this one to a non-goal?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The purpose of this table to to ensure alignment on what should be standardized for the various layers of configuration. This may influence the API. Any feedback on how that could be better conveyed?


## Non-goals
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
The key factor of this solution is there are categories of focused, perf impacting features that developers can choose to enforce restrictions on their apps. The threshold and limits and specific criteria within a category may evolve over time and is determined by the platform. **The developer will not have control over granular values of each limit or individual criteria within a category. This is determined by the platform.**

## Proposed API Solution
Introduce [Document Policy](https://github.com/WICG/document-policy/blob/main/document-policy-explainer.md) configuration points, one for each of the categories above:
A: basic
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
B: early-script
C: globals
D: script

> **Note:** Names here are only monikers and expected to change.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

This enables each document to:

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this be the user agent as opposed to the document?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The intention is to enable documents to set restrictions on embedded content. Each document would be able to choose which policies to apply to which content, through Document Policy mechanism. While this could be leveraged by the User Agent in some "mode", we're focusing on exposing the capability as a Web Platform API for this explainer.

* Self-impose performance constraints.
* Negotiate constraints (see discussion section) for each subresource.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

**Example**
A feeds app embeds content from different sources, through iframes. To cap the performance impact of the embedded content, the host application aligns with its producers on guidelines and best practices for the embeddees to be loaded into the experience, requiring the content to be served with an agreed upon subset of policies (categories above).
The host app serves its main document with Document Policy directives to enforce on embedded content:<br>
```
Require-Document-Policy: basic, early-script, globals, script

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we have a keyword to target all of them? Is that a use case we think will be common?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd prefer not to have a keyword for all as the categories might change as the API evolves. We still need to get a better understanding of how that would work.

```

Alternatively, the app can set any policy subset to individual frames:<br>
```
<iframe policy=”basic”>
<iframe policy=”basic, early-script”>
```

As a result, requests to embedded content will be sent with `Sec-Required-Document-Policy` header matching the top-level document’s requested configuration. Per Document Policy design, embeddees receiving this request header must opt in for their content to be loaded, with the option to specify a reporting endpoint:
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

```
Document-Policy: basic, early-script, globals, script, *; report-to=endpoint
Reporting-Endpoints: endpoint="https://example.com/reports"
```

### Discussion
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
#### Using multiple Document Policy configuration points

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since the example above includes multiple, I wonder if the example should only include one, and show how to use multiple in this section instead

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added a single-category example to the general API portion. This section is intended to provide the reasoning behind some of the choices shown in the proposed API shape.

Document Policy allows for a single value for each configuration point. Since the categories proposed by this solution are independent of each other, this necessitates multiple configuration points.

#### Opt-in and policy negotiation
Document Policy requires that each document opts in to the policies to be applied on such document. While this is a limitation to amount of control the embedder can have over the embeddee's performance, we consider this behavior from Document Policy as necessary due to the following reasons:
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

* **Control flow alteration.** Direct enforcement would allow for a document to impose changes to the control flow of unwitting embedded third parties.
* **Potential for side-channel attacks.** See more details in Security and Privacy Considerations.

Despite this limitation, Document Policy allows policy negotation which would allow the embedder to require the relevant policies in this proposal. Through this mechanism, the embedder can still require conformance from the embedee to be loaded, while keeping the embeddee in charge of its own control flow.

#### Negotiation vs enforcement

Document Policy proposes a mechanism for policy negotiation. An embeddee which doesn’t agree to the embedder’s policies will not be loaded. This document makes a distinction between this _negotiation_ (which might result in an embedee failing to load), and _enforcement_. Enforcement of the policy (what will happen when a violation occurs) is to be defined by each aspect of the proposed configuration points.

#### Per-document constraints
Document Policy directives are effective on a per-document basis. This generally makes it harder for constraints to have the desired impact to improve performance. In this document, we propose capping the overall complexity and resources through frame depth and count restrictions.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

**Open question:** what would be the appropriate frame count and depth limit? See related discussion in Security and Privacy Considerations.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

#### Open question: how to evolve best practices?
Performance categories introduced in this proposal are based on the idea of taking the burden of determining performance best practices off of individual site and app developers. However, best practices evolve with time, and for these policies to achieve their goal, the criteria needs to evolve with them. Changing the criteria for what constitutes a policy violation would introduce compatibility issues for anyone opting-in, as things which are allowed today might not be allowed in the future. We need to define a mechanism in which this evolution can happen in a controlled manner, or decide whether it's a reasonable trade off for developers opting in to be expected to keep up with the platform as best practices evolve.

#### Open question: reporting 3pp violations to embedder
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
Embedders are best equipped to influence change in the performance when they are aware of where the issues are. While Document Policy provides a Reporting API integration, this only reports violations to the endpoint of the document where the violation occurs. Embedders do not receive reports that the embedded content has incurred policy violations, which is a limitation. Currently under consideration: send a minimal report to the embedder when a violation occurs in the embedded document.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

#### Open question: interaction with Heavy Ad Interventions

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Curious why this one doesn't have more details like the others?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We know we need to think through any interactions but haven't done it yet, so it's currently more a placeholder.


#### Open question: required policy and report-only mode
It is unclear from the Document Policy explainer whether a report-only header in an embedded document satifies the requirements set by `Sec-Required-Document-Policy` header.

### Security and Privacy Considerations

#### Global budgets and side-channel attacks
The criteria proposed in this document includes budgets which are shared globally across documents. This could allow for documents to learn things about cross-origin documents, as described in the [Never Slow Mode explainer](https://github.com/slightlyoff/never_slow_mode?tab=readme-ov-file#global-limits). We consider the same alternatives as NSM as viable for this proposal:
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
* Require CORS

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Might be worth explaining these in more detail

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These are explained in the linked NSM document. Should we describe the gist of it here, or more explicitly ask to read through the linked doc for the details?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agree with Alison - I don't think we should be referencing the NSM doc like this, we should take the contents we care about from it instead.

* Fuzzy budgeting

#### Frame depth
As called out in the [Never Slow Mode explainer](https://github.com/slightlyoff/never_slow_mode?tab=readme-ov-file#global-limits), a limit greater than 2 would expose the depth in the treee of a document.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Might be worth explaining why this is a problem and/or how this is solved if so.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also brought in from NSM explainer. For the solution, we're looking for input on whether the 2-depth limit would be a reasonable constraint or if we'd need to find some sort of trade off.


### Dependencies on non-stable features
* Document Policy ([explainer](https://wicg.github.io/document-policy/))

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This section could use a bit more clarity and detail

- general: currently, only implemented by Chromium-based browsers.
- Document Policy negotiation: disabled by default.
- `policy` attribute: currently unimplemented.

### Alternatives considered

#### Custom attributes and headers
Using Document Policy for this proposal has limitations and challenges, including 3pp violation reporting, opt-in requirement, budget-based state leaks. A custom mechanism was briefly considered, but the following was determined for such approach:

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"the following was determined for such approach" -> "the following outline the reasons for moving away from such an approach"

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Resolved.

nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

* Re-defines a mechanism for a problem already in the scope of Document Policy.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing subject to the sentence

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Resolved.

* The challenges in Document Policy are still applicable with a custom mechanism.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
* Changes to iframe HTML element represent additional standards work.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"to the iframe"

Also should we update this to <iframe>?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed item from the list.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure additional standards work is a good reason to go with one approach and not another, so I'm not sure if that is worth listing

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The reasoning here was that even if we wanted to go with the custom mechanism option, we'd need to re-do a lot of the work that Document Policy already does. In particular, it seems like HTML spec changes are more challenging to get through, so defining our own would add that while not providing anything fundamentally better compared to Document Policy. I can remove this item.


#### Levels vs categories
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved
It was considere to have a single configuration point based on “levels” which would restrictions on top of each other, but this was discarded due to increased difficulty to introduce new values in the future.
nishitha-burman marked this conversation as resolved.
Show resolved Hide resolved

## Acknowledgements
Many thanks for the valuable feedback and advice from:
* [Limin Zhu](https://github.com/liminzhu)


[^1]: https://infrequently.org/2023/02/the-market-for-lemons/
[^2]: https://www.thinkwithgoogle.com/marketing-strategies/app-and-mobile/page-load-time-statistics/