ChatGPT User is the OpenAI user agent responsible for executing user initiated browsing actions inside ChatGPT, Custom GPTs and GPT Actions. It retrieves live web content on demand rather than performing background crawling.

Does ChatGPT User still follow robots.txt

No. As of the December documentation update, OpenAI no longer documents robots.txt compliance for ChatGPT User. Only GPTBot and OAI SearchBot are explicitly stated as respecting robots.txt rules.

What content access does robots.txt still control at OpenAI

robots.txt still controls GPTBot for training data collection and OAI SearchBot for AI powered search experiences. It no longer governs real time ChatGPT User access.

Can websites block ChatGPT User access

Websites can no longer block ChatGPT User via robots.txt. Access control now requires server side measures such as user agent filtering, rate limiting or bot protection at the infrastructure level.

Does ChatGPT User traffic mean AI training

No. ChatGPT User is used for real time browsing and information retrieval for users. It is not used for background crawling or automatic training collection.

How can Scalevise help with this

Scalevise helps businesses turn complex digital challenges into scalable solutions. Whether you're facing compliance, automation, integration, or innovation hurdles our team delivers custom strategies and implementations that work. Contact us today at contact@scalevise.com or call +31-6-27178770 to find out how we can help you move forward.

OpenAI Breaks Robots.txt Pact: Website Content Access Permanently Changed

Name: Scalevise AI Traffic Control Architecture
Brand: Scalevise

OpenAI has removed robots.txt compliance for ChatGPT User, changing how live ChatGPT browsing interacts with websites and limiting traditional control for site owners.

3 min read

ChatGPT Robots.txt

OpenAI has implemented a significant policy change that redefines how websites can control access by ChatGPT. The core of the update is the removal of the robots.txt compliance requirement for the ChatGPT User agent—the agent responsible for user-initiated browsing inside ChatGPT, Custom GPTs, and GPT Actions.

This quiet, technical change positions ChatGPT User as a proxy for human browsing, explicitly stating that traditional robots.txt rules "may not apply." This marks a critical evolution in how AI systems retrieve and interpret live web content.

The Core Technical Update

Previously, all major OpenAI user agents were implicitly or explicitly documented as adhering to robots.txt. The new documentation draws a definitive line between autonomous crawling and user-driven access.

Agents That Maintain Robots.txt Compliance

Website owners retain full control over these agents via robots.txt:

GPTBot: Primarily used for collecting data for training future AI models.
OAI SearchBot: Used for discovery and indexing related to AI search features.

The Agent That Bypasses Robots.txt

ChatGPT User: This agent is now functionally defined as a technical extension of the user. It performs real-time browsing based on user prompts, is integrated into Custom GPTs, and executes GPT Actions.

OpenAI’s rationale is that because these actions are user-initiated, the agent is not considered an autonomous indexing or scraping system, thus bypassing the rules intended for general-purpose crawlers.

The Critical Impact on Content Control

This policy has a direct and practical consequence: you can no longer use robots.txt alone to prevent interactive ChatGPT browsing on your site.

The Key Difference:

Even if you implement a restrictive robots.txt block designed to stop all AI training and indexing:

User-agent: GPTBot
Disallow: /

User-agent: OAI-SearchBot
Disallow: /

The ChatGPT User agent can still access your site in real-time whenever a user or a Custom GPT triggers a browsing prompt. Your content can be loaded, read, and summarized during live ChatGPT sessions, even with a strict opt-out policy against crawling.

Why This Matters for Website Owners and SEO

The removal of robots.txt compliance for this interactive agent fundamentally breaks the long-standing model of web governance.

1. Robots.txt is No Longer a Universal AI Control

For decades, robots.txt has served as the universal mechanism for expressing search visibility and indexing preferences. That mechanism now fails for a primary, interactive access path.

Robots.txt still controls training and AI search experience.
It no longer controls interactive content fetching by ChatGPT User.

2. Unpredictable Traffic and Content Flow

From a server perspective, traffic generated by the ChatGPT User is challenging to manage. It originates from OpenAI's infrastructure, does not behave like typical browser traffic, and can scale rapidly if multiple users query the same source. Furthermore, it allows for:

Real-Time Summarization: Your content is fetched and interpreted instantaneously for the user.
Indirect Content Reuse: While not used for long-term training, your content is actively repurposed and recontextualized within the user's ChatGPT session.

3. Shift from Policy to Technical Enforcement

The change dictates that website owners must now manage two entirely separate control layers to enforce a comprehensive AI access policy:

Control Layer	Purpose	Required Tool
Robots.txt	Governing Training and AI Search Indexing	File-level policy
Server-Side Controls	Governing Interactive Access (`ChatGPT User`)	User-agent filtering, IP rate limiting, WAF/CDN rules

To restrict the ChatGPT User agent, owners must now implement more sophisticated systems, such as IP-level blocking or WAF-based filtering, since robots.txt is insufficient.

The Signal for the Future of AI Access

OpenAI's update signals a broader technological shift in the industry:

Move to On-Demand Retrieval: AI systems are moving away from the classic model of bulk background crawling towards real-time, on-demand content fetching.
Blurred Boundaries: The line between human browsing and sophisticated agent browsing is intentionally being blurred to facilitate deeper integration of LLMs with live web data.

Robots.txt was not designed for this era of interactive, tool-using LLMs. This policy change explicitly confirms that the established model of web governance is evolving rapidly to accommodate new forms of delegation and retrieval.

Conclusion

OpenAI’s removal of robots.txt compliance for ChatGPT User is a technical access model change that fundamentally shifts the control balance. Website owners must acknowledge this dual control system: robots.txt for indexing, and server-side rules for real-time interaction. Reliance on the traditional robots.txt file alone is no longer enough to express a complete content access policy for AI systems.

Discover New (AI) Tools

OpenAI Breaks Robots.txt Pact: Website Content Access Permanently Changed

The Core Technical Update

Agents That Maintain Robots.txt Compliance

The Agent That Bypasses Robots.txt

The Critical Impact on Content Control

Why This Matters for Website Owners and SEO

1. Robots.txt is No Longer a Universal AI Control

2. Unpredictable Traffic and Content Flow

3. Shift from Policy to Technical Enforcement

The Signal for the Future of AI Access

Conclusion

Follow Us

Navigation

Solutions

Tools

Popular