AI Preferences L. Peiyuan
Internet-Draft Condé Nast
Intended status: Experimental 20 April 2025
Expires: 22 October 2025
Protocol Extension for Automation Control
draft-liao-aipref-autoctl-ext-01
Abstract
This document specifies extensions to [CORE-SPEC], providing a wider
range of controls for server-side automation permissions. It
supports rate limiting, automation technology restrictions, API
permissions, session requirements, and HTML asset annotations. These
extensions enable service owners to exercise more granular control
over automated interactions.
About This Document
This note is to be removed before publishing as an RFC.
The latest revision of this draft can be found at
https://datatracker.ietf.org/doc/draft-liao-aipref-autoctl-ext/.
Status information for this document may be found at
https://datatracker.ietf.org/doc/draft-liao-aipref-autoctl-ext/.
Discussion of this document takes place on the AI Preferences Working
Group mailing list (mailto:ai-control@ietf.org), which is archived at
https://mailarchive.ietf.org/arch/browse/ai-control/. Subscribe at
https://www.ietf.org/mailman/listinfo/ai-control/.
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at https://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on 22 October 2025.
Peiyuan Expires 22 October 2025 [Page 1]
Internet-Draft aipref-autoctl-ext April 2025
Copyright Notice
Copyright (c) 2025 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents (https://trustee.ietf.org/
license-info) in effect on the date of publication of this document.
Please review these documents carefully, as they describe your rights
and restrictions with respect to this document. Code Components
extracted from this document must include Revised BSD License text as
described in Section 4.e of the Trust Legal Provisions and are
provided without warranty as described in the Revised BSD License.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2
1.1. Relationship to Core Specification . . . . . . . . . . . 3
2. Conventions and Terminology . . . . . . . . . . . . . . . . . 3
3. Extended Protocol Specification . . . . . . . . . . . . . . . 4
3.1. Rate Limiting . . . . . . . . . . . . . . . . . . . . . . 4
3.2. Automation Technology Restrictions . . . . . . . . . . . 4
3.2.1. Protocol Tokens . . . . . . . . . . . . . . . . . . . 5
3.2.2. Runtime Tokens . . . . . . . . . . . . . . . . . . . 6
3.3. API and XHR Permissions . . . . . . . . . . . . . . . . . 6
3.4. Session Requirements . . . . . . . . . . . . . . . . . . 7
3.5. HTML Asset Annotation . . . . . . . . . . . . . . . . . . 8
4. Formal Syntax . . . . . . . . . . . . . . . . . . . . . . . . 9
5. Backward Compatibility . . . . . . . . . . . . . . . . . . . 11
6. Implementation and Enforcement . . . . . . . . . . . . . . . 12
7. Security Considerations . . . . . . . . . . . . . . . . . . . 13
8. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 13
9. References . . . . . . . . . . . . . . . . . . . . . . . . . 13
9.1. Normative References . . . . . . . . . . . . . . . . . . 13
9.2. Informative References . . . . . . . . . . . . . . . . . 13
Sample Extended automation-preferences.txt File . . . . . . . . . 14
Author's Address . . . . . . . . . . . . . . . . . . . . . . . . 15
1. Introduction
Often, the behaviors of automated systems may exhibit a certain
pattern that is hard to characterize with only the HTTP method and
user-agent used by [CORE-SPEC]. This document introduces a set of
extensions for service owners to specify more complex and granular
automation policies involving rate limiting, particularly prevalent
automation technologies, API permissions, session requirements, and
annotations on HTML assets.
Peiyuan Expires 22 October 2025 [Page 2]
Internet-Draft aipref-autoctl-ext April 2025
1.1. Relationship to Core Specification
All directives and mechanisms defined in the core specification
remain valid and are not redefined here. This document assumes
familiarity with [CORE-SPEC] and uses its terminology and concepts
throughout.
The extensions defined in this document are OPTIONAL for both servers
and clients. Implementations that support only the core
specification are considered compliant with the automation-
preferences.txt protocol, though they will not benefit from the
granular controls defined here.
When both core and extended directives are present in an automation-
preferences.txt file, parsers that do not support the extensions
defined in this document MUST ignore the unrecognized directives, as
specified in the core specification's extension-mechanism.
2. Conventions and Terminology
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
"OPTIONAL" in this document are to be interpreted as described in BCP
14 [RFC2119] [RFC8174] when, and only when, they appear in all
capitals, as shown here.
This document uses the terminology defined in [CORE-SPEC]. The
following additional terms are introduced:
* Rate limiting: Constraints on the frequency or concurrency of
automated requests.
* Automation technology: Specific tools or frameworks used for
automation, such as headless browsers or browser automation
protocols.
* XHR/Fetch: XMLHttpRequest or Fetch API calls performed
programmatically.
* Session validation: Mechanisms to verify that automated requests
are part of a legitimate user session.
* Asset annotation: Metadata embedded within HTML documents to
specify automation policies for individual content elements.
Peiyuan Expires 22 October 2025 [Page 3]
Internet-Draft aipref-autoctl-ext April 2025
3. Extended Protocol Specification
This section defines additional directives that extend the
automation-preferences.txt protocol. These directives may be used
alongside the core directives in any group within the automation-
preferences.txt file.
Unless stated otherwise, the empty form of any comma-separated list
means "explicitly none", distinct from omitting the directive.
3.1. Rate Limiting
The following directives are defined:
* request-limit: Specifies the maximum number of requests allowed
within a time period, expressed as a count followed by a time unit
(e.g., "60/minute"). Supported time units are "second", "minute",
"hour", and "day".
* concurrent-limit: Specifies the maximum number of concurrent
connections allowed from a single client.
Example:
request-limit: 60/minute
concurrent-limit: 5
Figure 1
Rate limiting directives apply to all requests within the scope of
the group, regardless of HTTP method. If no rate limiting directives
are specified, clients SHOULD NOT assume any specific rate limits.
3.2. Automation Technology Restrictions
Automation technology directives specify whether particular
automation capabilities are permitted. Automation mechanisms fall
into two orthogonal layers: _protocol_ (the wire format exchanged
with the browser) and _runtime_ (whether the browser runs headless or
headed). Tokens defined in Section 3.2.1 and Section 3.2.2 MAY be
mixed in any order.
The following directive is defined:
Peiyuan Expires 22 October 2025 [Page 4]
Internet-Draft aipref-autoctl-ext April 2025
* allowed-automations: A comma-separated list of tokens representing
permitted automation mechanisms. If this directive is present but
empty, it implies that no standard tokens are explicitly
permitted. If the directive is absent, clients SHOULD NOT assume
any mechanism is allowed.
Default behavior: when the allowed-automations directive is:
present and empty no standard tokens are permitted;
absent the server makes no statement, and clients SHOULD assume the
most restrictive stance (i.e., that no automation mechanism is
authorised).
This aligns the silent default with that of api-automation, allow-
xhr, and disallow-fetch-from, ensuring a single, predictable "fail-
closed" rule across all extension directives.
A list-valued directive that appears as name: followed only by
whitespace or a comment MUST be parsed as an empty list, not a syntax
error. This applies to allowed-automations and all other comma-
separated lists in the core or extension specs.
Example (allowing WebDriver traffic in headless mode):
allowed-automations: webdriver, headless
Figure 2
If a token is not listed in a present allowed-automations directive,
clients SHOULD assume that the use of that mechanism is not permitted
within the specified scope. Implementations SHOULD respect these
directives when applicable, even though detection techniques may
differ.
The directive accepts any string token, but the following identifiers
are suggested for interoperability.
This list is not exhaustive. Future specifications MAY define
additional standard tokens.
3.2.1. Protocol Tokens
* webdriver: Automation using the W3C WebDriver wire protocol
[WEBDRIVER].
* cdp: Automation using the Chrome DevTools Protocol [CDP].
Peiyuan Expires 22 October 2025 [Page 5]
Internet-Draft aipref-autoctl-ext April 2025
* webkit-remote: Automation using the WebKit Remote Debugging
Protocol [WKREMOTE].
* firefox-remote: Automation using the Firefox Remote Debug Protocol
via GeckoDriver [FFREMOTE].
3.2.2. Runtime Tokens
* headless: Browser runs with no visible UI [HEADLESS].
* headed: Browser UI is presented to the user.
3.3. API and XHR Permissions
API and XHR permission directives specify rules for API usage and
automated use of XMLHttpRequest, Fetch, or AJAX. The following
directives are defined:
* api-automation: Indicates how API endpoints may be accessed by
automated clients. Valid values are:
- _none_: No API automation is permitted.
- _with-key-only_: API automation is permitted only with proper
authentication.
- _open_: API automation is generally permitted.
* allow-xhr: Indicates how XMLHttpRequest or Fetch API may be used
by automated clients. Valid values are:
- _none_: No XHR/Fetch automation is permitted.
- _read-only_: Only GET requests are permitted via XHR/Fetch.
- _open_: XHR/Fetch automation is generally permitted.
* disallow-fetch-from: Comma-separated list of URL patterns from
which automated XHR/Fetch requests are prohibited. Wildcards MAY
be used.
Example:
api-automation: with-key-only
allow-xhr: read-only
disallow-fetch-from: /account/*, /checkout/*, /admin/*
Figure 3
Peiyuan Expires 22 October 2025 [Page 6]
Internet-Draft aipref-autoctl-ext April 2025
If API and XHR permission directives are not specified, clients
SHOULD assume the most restrictive value (i.e., "none" for api-
automation and allow-xhr).
3.4. Session Requirements
Session requirement directives specify whether automated requests
must be part of a legitimate user session. The following directives
are defined:
* require-human-initiated-session: Boolean value indicating whether
automated requests must be part of a session that was initiated by
a human user.
* session-validation: Specifies the method used to validate
sessions. Valid values are:
- _cookie-based_: Sessions are validated using HTTP cookies.
- _token-based_: Sessions are validated using authentication
tokens.
- _oauth_: Sessions are validated using OAuth.
- _none_: No session validation is required.
* session-ttl: Specifies the maximum time-to-live for a session,
expressed as a duration (e.g., "30m", "2h", "1d").
A duration literal consists of a sequence of decimal digits
immediately followed by a single unit character. Units are mapped
as follows:
- s - seconds (1 <= value <= 86400)
- m - minutes (1 <= value <= 1440)
- h - hours (1 <= value <= 168)
- d - days (1 <= value <= 365)
Implementations MUST reject values outside these ranges with a
client or server error appropriate to their role.
Example:
Peiyuan Expires 22 October 2025 [Page 7]
Internet-Draft aipref-autoctl-ext April 2025
require-human-initiated-session: true
session-validation: cookie-based
session-ttl: 1h
Figure 4
If session requirement directives are not specified, clients SHOULD
NOT assume any specific session requirements, but SHOULD include a
valid User-Agent header in all requests.
3.5. HTML Asset Annotation
Automation preferences MAY be embedded directly within HTML documents
to annotate individual assets. Authors SHOULD use structured data
markup using JSON-LD [JSON-LD11] embedded in a
Figure 5
When both an automation-preferences.txt file and HTML asset
annotations are present, the more specific rule (typically the HTML
annotation) SHALL be applied to the corresponding content asset.
Clients supporting HTML asset annotations SHOULD parse and respect
these annotations when present.
The annotation schema MAY include any directives defined in the core
or extension specifications. Fields in the annotation SHOULD use
camelCase naming to align with JSON-LD conventions (e.g.,
allowedAutomations for allowed-automations, requestLimit for request-
limit), while maintaining semantic equivalence to the corresponding
directives in the automation-preferences.txt file.
Peiyuan Expires 22 October 2025 [Page 8]
Internet-Draft aipref-autoctl-ext April 2025
4. Formal Syntax
Below is an Augmented Backus-Naur Form (ABNF) description,
incorporating the extensions defined in this document. It extends
the grammar defined in the core specification Part formal-syntax of
[CORE-SPEC].
Rules inherited or unmodified from the core specification are
included for completeness. Rules added or modified by this extension
are indicated with comments (`; Extension directive` or `; Extended
rule`).
automation-preferences = *( group )
group = 1*scope-directive ; at least one
*( directive / emptyline )
1*emptyline ; blank line terminates group
directive = scope-directive / host-directive /
user-agent-directive /
method-directive / purpose-directive /
; Extended rule: Added extension directives
request-limit-directive / concurrent-limit-directive /
allowed-automations-directive / api-automation-directive /
allow-xhr-directive / disallow-fetch-from-directive /
require-human-session-directive / session-validation-directive /
session-ttl-directive
; --- Core directives (from CORE-SPEC) ---------------------
scope-directive = *WS "scope" *WS ":" *WS url-pattern EOL
host-directive = *WS "host" *WS ":" *WS host-pattern EOL
method-directive = *WS "allowed-methods" *WS ":" *WS method-list EOL
purpose-directive = *WS "allowed-purposes" *WS ":" *WS purpose-list EOL
user-agent-directive = *WS "user-agent" *WS ":" *WS product-token
*( *WS "," *WS product-token ) EOL
; --- Extension directives ---------------------------------
request-limit-directive = *WS "request-limit" *WS ":" *WS rate-spec EOL
concurrent-limit-directive = *WS "concurrent-limit" *WS ":" *WS count EOL
allowed-automations-directive = *WS "allowed-automations" *WS ":" *WS automation-list EOL
api-automation-directive = *WS "api-automation" *WS ":" *WS api-automation-value EOL
allow-xhr-directive = *WS "allow-xhr" *WS ":" *WS allow-xhr-value EOL
disallow-fetch-from-directive = *WS "disallow-fetch-from" *WS ":" *WS url-pattern-list EOL
require-human-session-directive = *WS "require-human-initiated-session" *WS ":" *WS boolean EOL
session-validation-directive = *WS "session-validation" *WS ":" *WS session-validation-value EOL
session-ttl-directive = *WS "session-ttl" *WS ":" *WS duration EOL
Peiyuan Expires 22 October 2025 [Page 9]
Internet-Draft aipref-autoctl-ext April 2025
; --- Directive value syntax (Core) ------------------------
url-pattern = 1*( VCHAR / UTF8-char-noctl )
host-pattern = 1*( ALPHA / DIGIT / "-" / "." / UTF8-char-noctl )
method-list = method *( *WS "," *WS method )
method = "GET" / "HEAD" / "POST" / "PUT" /
"DELETE" / "PATCH" / "OPTIONS" /
"TRACE" / "CONNECT"
purpose-list = purpose-token *( *WS "," *WS purpose-token )
purpose-token = 1*VCHAR ; placeholder for future vocabulary
product-token = identifier / "*"
identifier = 1*( %x2D / %x41-5A / %x5F / %x61-7A )
; --- Directive value syntax (Extension) -------------------
rate-spec = count "/" time-unit
count = 1*DIGIT
time-unit = "second" / "minute" / "hour" / "day"
automation-list = *( automation-token *( *WS "," *WS automation-token ) )
automation-token= 1*VCHAR ; e.g., "cdp", "headless", "selenium"
api-automation-value = "none" / "with-key-only" / "open"
allow-xhr-value = "none" / "read-only" / "open"
url-pattern-list= url-pattern *( *WS "," *WS url-pattern )
boolean = "true" / "false"
session-validation-value = "cookie-based" / "token-based" / "oauth" / "none"
duration = 1*DIGIT time-unit-char
time-unit-char = "s" / "m" / "h" / "d" ; s=second, m=minute,
; h=hour, d=day
; --- Lexical primitives (from CORE-SPEC) ------------------
comment = "#" *( UTF8-char-noctl / WS / "#" )
emptyline = *WS [comment] EOL
EOL = *WS [comment] NL
NL = CRLF / LF / CR
CRLF = CR LF
CR = %x0D
LF = %x0A
WS = SP / HTAB
SP = %x20
Peiyuan Expires 22 October 2025 [Page 10]
Internet-Draft aipref-autoctl-ext April 2025
HTAB = %x09
; --- Core ABNF terminals (RFC 5234) -----------------------
ALPHA = %x41-5A / %x61-7A
DIGIT = %x30-39
VCHAR = %x21-7E
; --- UTF-8 (derived from RFC 3629) ------------------------
UTF8-char-noctl = UTF8-1-noctl / UTF8-2 / UTF8-3 / UTF8-4
UTF8-1-noctl = %x21 / %x22 / %x24-7F
UTF8-2 = %xC2-DF UTF8-tail
UTF8-3 = %xE0 %xA0-BF UTF8-tail
/ %xE1-EC UTF8-tail-2
/ %xED %x80-9F UTF8-tail
/ %xEE-EF UTF8-tail-2
UTF8-4 = %xF0 %x90-BF UTF8-tail-2
/ %xF1-F3 UTF8-tail-3
/ %xF4 %x80-8F UTF8-tail-2
UTF8-tail = %x80-BF
UTF8-tail-2 = UTF8-tail UTF8-tail
UTF8-tail-3 = UTF8-tail UTF8-tail UTF8-tail
5. Backward Compatibility
All directives defined in this document are OPTIONAL.
Implementations that support only the core specification can safely
ignore these directives.
Implementations supporting these extensions SHOULD degrade gracefully
when interacting with servers or clients that support only the core
specification.
* Servers supporting extensions SHOULD still process all core
directives correctly, even if extended directives are also
present.
* Clients supporting extensions SHOULD still honor all core
directives, even if they do not recognize extended directives in a
file.
* When HTML asset annotations are not supported by a client, the
client SHOULD fall back to the site-level automation-
preferences.txt file for guidance.
Peiyuan Expires 22 October 2025 [Page 11]
Internet-Draft aipref-autoctl-ext April 2025
6. Implementation and Enforcement
Servers implementing the extensions defined in this document SHOULD:
* Employ detection mechanisms (e.g., CDP fingerprinting, headless
browser detection) to identify automated clients using specific
technologies.
* Implement rate limiting according to the specified directives.
* Validate sessions as required by the session requirement
directives.
* Process HTML asset annotations when interpreting automation
policies for specific content.
* Respond with appropriate HTTP status codes [RFC9110] for non-
compliant requests, such as:
- 429 Too Many Requests for rate limit violations.
- 403 Forbidden for unauthorized automation technology use.
- 401 Unauthorized for missing or invalid authentication.
Clients supporting these extensions SHOULD:
* Honor rate limiting directives by self-throttling requests.
* Respect automation technology restrictions by only using tools
listed in the allowed-automations directive, if present.
* Adhere to API and XHR permissions as specified (api-automation,
allow-xhr, disallow-fetch-from).
* Establish and maintain valid sessions when required (require-
human-initiated-session, session-validation, session-ttl).
* Parse and respect HTML asset annotations when present.
Both servers and clients MAY implement additional detection and
enforcement mechanisms beyond those explicitly described in this
document, as long as they maintain compatibility with the specified
directives.
Peiyuan Expires 22 October 2025 [Page 12]
Internet-Draft aipref-autoctl-ext April 2025
7. Security Considerations
In addition to the security considerations mentioned in [CORE-SPEC],
the extensions defined in this document introduce the following
considerations:
* Rate Limiting: Implementations of rate limiting SHOULD use secure
methods to track request counts and prevent circumvention through
IP spoofing or other means.
* Technology Detection: Methods used to detect specific automation
technologies can be circumvented by sophisticated clients.
Servers SHOULD employ multiple detection approaches and adapt to
evolving evasion techniques.
* Session Validation: Session validation mechanisms SHOULD be
resistant to replay attacks and session hijacking attempts.
* HTML Asset Annotations: Parsing of JSON-LD annotations MUST be
performed securely to prevent injection attacks or denial-of-
service through malformed input.
8. IANA Considerations
This document has no IANA actions.
9. References
9.1. Normative References
[CORE-SPEC]
Liao, P., "Protocol for Automation Control", Work in
Progress, Internet-Draft, draft-liao-aipref-autoctl-core-
01, April 2025, .
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119,
DOI 10.17487/RFC2119, March 1997,
.
[RFC8174] Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC
2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174,
May 2017, .
9.2. Informative References
Peiyuan Expires 22 October 2025 [Page 13]
Internet-Draft aipref-autoctl-ext April 2025
[CDP] Google, "Chrome DevTools Protocol", 2025,
.
[FFREMOTE] Mozilla, "GeckoDriver Remote Protocol", 2025,
.
[HEADLESS] Google, "Chrome Headless Mode", 2024,
.
[JSON-LD11]
Kellogg, G., Champin, P.-A., and D. Longley, "JSON-LD
1.1", W3C Recommendation REC-json-ld11-20200716, 16 July
2020, .
[RFC9110] Fielding, R., Ed., Nottingham, M., Ed., and J. Reschke,
Ed., "HTTP Semantics", STD 97, RFC 9110,
DOI 10.17487/RFC9110, June 2022,
.
[WEBDRIVER]
W3C, "WebDriver", 2018,
.
[WKREMOTE] WebKit Project, "WebKit Remote Debugging Protocol", 2015,
.
Sample Extended automation-preferences.txt File
The following is an example of an automation-preferences.txt file
that includes both core and extended directives:
Peiyuan Expires 22 October 2025 [Page 14]
Internet-Draft aipref-autoctl-ext April 2025
user-agent: *
host: example.com
scope: /
allowed-methods: GET, HEAD
allowed-purposes: PLACEHOLDER_PURPOSE1, PLACEHOLDER_PURPOSE2
request-limit: 60/minute
concurrent-limit: 5
allowed-automations:
api-automation: with-key-only
require-human-initiated-session: true
session-validation: cookie-based
session-ttl: 1h
user-agent: ExampleBot
host: example.com
scope: /admin/
allowed-methods: GET
allowed-purposes: PLACEHOLDER_PURPOSE1
request-limit: 10/minute
concurrent-limit: 2
require-human-initiated-session: true
session-validation: token-based
session-ttl: 30m
user-agent: *
host: example.com
scope: /admin/
allowed-methods: GET
Figure 6
Author's Address
Peiyuan Expires 22 October 2025 [Page 15]
Internet-Draft aipref-autoctl-ext April 2025
Liao Peiyuan
Condé Nast
United States of America
Email: peiyuan_liao@condenast.com
Peiyuan Expires 22 October 2025 [Page 16]