Google Shares More Information On Googlebot Crawl Limits

March 17, 2026

Google’s Gary Ilyes and Martin Splitt mentioned Googlebot’s crawl limits, offering extra particulars about why limits exist and revealing new details about how these limits could be adjusted upward or dialed down relying on wants and what’s being completed.

Table of Contents

Particulars About Googlebot Limits

Gary Illyes shared particulars of what’s going on behind the scenes at Google that drive the assorted crawl limits, starting with the Googlebot 15 megabyte restrict.

He stated that any crawler inside Google has a 15 megabyte restrict and explicitly stated that this restrict may very well be overridden or switched off. In reality, he stated that groups inside Google repeatedly override that restrict. He used the instance of Google Search, which overrides that restrict by dialing it down to 2 megabytes.

Illyes defined:

“I imply, there’s a bunch of issues which can be for our personal safety or our infrastructure’s safety. Like for instance, the notorious 15 megabyte default restrict that’s set on the infrastructure stage.

And principally any crawler that doesn’t override that setting goes to have a 15 megabyte restrict. Mainly it begins fetching the bytes from the server or regardless of the server is sending.After which there’s an inner counter. After which when it reached 15 megabytes, then it principally stops receiving the bytes.

I don’t know if it closes the connection or not. I feel it doesn’t shut the connection. It simply sends a response to the server that, OK, you may cease now. I’m good.

However then particular person groups can override that. And that occurs. It occurs fairly a bit. And for instance, for Google Search, particularly for Google search, the restrict is overridden to 2 megabytes.”

Limits On Googlebot Are For Infrastructure Safety

Illyes subsequent shared an instance the place the 15 megabyte restrict is overridden to extend the crawl restrict, on this case for PDFs. That is the place he mentions Googlebot limits within the context of defending Google’s infrastructure from being overwhelmed by an excessive amount of information.

He supplied extra particulars:

“Properly, principally all the pieces. Like, for instance, for PDFs, it’s, I don’t know, 64 or no matter. As a result of PDFs can, just like the HTTP commonplace, in case you export it as PDF, I feel you stated that, in case you export it as PDF, then it’s 96 megabytes or one thing.

However that signifies that it will overwhelm our infrastructure if we fetch the entire thing after which convert it to HTML, blah, blah, after which begin processing it.
It’s similar to, it’s overwhelming as a result of it’s a lot information.

And identical goes for HTML. It’s the HTML dwelling commonplace. Like if in case you have like 14 megabytes, we’re not going to fetch that. We’re going to fetch the person pages as a result of happily, additionally they had sufficient mind energy to have particular person pages for particular person options of HTML. We are able to fetch these pages, however we’re not going to have something helpful out of the 14 megabyte one pager of the HTML commonplace.”

Different Google Crawlers Have Completely different Limits

At this level, Illyes revealed that different Google crawlers have completely different limits and that the documented limits aren’t exhausting limits throughout all of Google’s crawlers.

He continued:

“So yeah, and different crawlers, I by no means labored on different crawlers, however different crawlers I’m positive have completely different settings. I may think about, for instance, even in particular person tasks, it might have completely different settings for a similar factor.

Like, for instance, I can think about that if we have to index one thing very quick, then the truncation restrict may very well be one megabyte, for instance. I don’t know if that’s the case, however I may think about that to be the case. As a result of if it’s essential to push one thing by way of the indexing pipeline inside seconds, then it’s simpler to cope with little information.”

Google’s Crawling Infrastructure Is Not Monolithic

This a part of the Search Off The File episode got here to an in depth with Martin Splitt affirming that Google’s crawling infrastructure is versatile and way more numerous than what’s described in Google’s documentation, saying that it’s not monolithic. Monolithic actually means a large stone rock and is used to explain one thing that’s unchanging and constant. By saying that Google’s crawlers aren’t monolithic, Splitt is affirming that they’re versatile when it comes to fetch limits and different configurations.

He additionally zeroed in on describing Google’s crawling infrastructure as software program as a service.

Splitt summarized the takeaways:

“That’s true. That’s true. I feel basically, it’s helpful to have cleared up this concept of crawling simply being like a monolithic form of factor. It’s extra like a software program as a service that search is, or net search particularly, is one consumer to and never like a monolithic form of factor.

And as you stated, like configuration can change. It could even change inside, let’s say, Googlebot. If I’m in search of a picture, we in all probability enable photos to be bigger than 2 megabytes, I suppose, as a result of photos simply are bigger than 2 megabytes. PDFs, enable 64. No matter is documented, we’ll hyperlink the documentation. However I feel that makes excellent sense.

And if you concentrate on it as in, it’s a service we name with a bunch of parameters, then it makes much more sense to see, OK, so there’s completely different configuration. And this configuration can change on request stage, not essentially simply on like, Googlebot is all the time the identical.”

Hearken to the Search Off The File Episode from the 20 minute mark:

Featured Picture by Shutterstock/BestForBest

Google Shares More Information On Googlebot Crawl Limits

Particulars About Googlebot Limits

Limits On Googlebot Are For Infrastructure Safety

Different Google Crawlers Have Completely different Limits

Google’s Crawling Infrastructure Is Not Monolithic

How To Improve AI Search Visibility & Citations

Google AI Mode In Chrome Isn’t Killing SEO; It’s Exposing Weak...

How To Remove Negative Reviews That AI Overviews Cites

LEAVE A REPLY Cancel reply

Most Popular

TikTok Adds Post Scheduling to Studio App

What The Scrub Daddy Tells Us About The Perfect...

10 New YouTube Marketing Strategies With Fresh Examples For...

Threads Adds Image Sharing in DMs

Apple Marketing Strategy: What Brands Can Learn & Apply...

14 Digital Content Types You’re Probably Not Using Enough

What Content Works Well In LLMs?

EDITOR PICKS

Elon Musk’s xAI Says Grok 3 Is Better Than ChatGPT, DeepSeek

Google Announces A New Carousel Rich Result

Up 165% in a year! Is it time investors woke up...

Popular News

How To Remove Negative Reviews That AI Overviews Cites

Up 11% today, could the Magnum Ice Cream share price be...

AI Search Clicks Often Go To Local Domains: Report

POPULAR Tags

Popular Tags

ABOUT US

FOLLOW US

Google Shares More Information On Googlebot Crawl Limits

Particulars About Googlebot Limits

Limits On Googlebot Are For Infrastructure Safety

Different Google Crawlers Have Completely different Limits

Google’s Crawling Infrastructure Is Not Monolithic

Related posts:

LEAVE A REPLY Cancel reply

Most Popular

EDITOR PICKS

Popular News

POPULAR Tags

Popular Tags

ABOUT US

FOLLOW US