Search used to be a one-way conversation. You built a site, optimized it, and waited for a crawler to show up. The rules were clear.
It doesn’t work like that anymore.
In 2026, Technical SEO sits at the intersection of three overlapping systems: traditional search engines, AI Overviews, and large language models that crawl, interpret, and cite the web on their own terms.
I’ve seen sites rank on page one of Google while ChatGPT can’t find them. I’ve watched pages pass every Core Web Vital and still never surface in an AI-generated answer.
The infrastructure that got you here may not keep you visible from the same angles you’ve been using.
This page compiles technical SEO statistics across site health, structured data, crawlability, schema as AI infrastructure, mobile, security, and the business impact of technical performance. Every stat links to its original publisher. The info will be updated on a monthly basis.
Key Takeaways

LocalBusiness Schema Usage Statistics
- 85% of websites return a valid robots.txt file, up from 84% in 2024, and the mobile-desktop gap has effectively disappeared. (Web Almanac, 2025)
- 98.6% of desktop pages and 98.5% of mobile pages have a title tag, while only 67.7% of desktop and 67.2% of mobile pages have a meta description. (Web Almanac, 2025)
- 56% of desktop sites pass Core Web Vitals, up from 48% in 2023. Only 48% of mobile sites pass. (Web Almanac, 2025)
- WebSite schema leads structured data adoption at 12.73% of mobile pages, followed by Organization (7.16%) and LocalBusiness (3.97%). (Web Almanac, 2024)
- Adding JSON-LD schema to 1,885 pages produced no statistically significant uplift in AI Overview, AI Mode, or ChatGPT citations. (Ahrefs, May 2026)
- Only 38% of URLs cited in AI Overviews still rank in Google’s top 10 organic results as of March 2026, down from 76% in July 2025. The sharp drop signals that AI citation logic is decoupling from traditional rankings faster than the industry expected. (Ahrefs, March 2026)
- AI-crawled sites generate 320% more human traffic, 270% more form submissions, and 250% more click-to-call events than non-crawled sites. (Duda, April 2026)
- 45% of consumers now use ChatGPT or other generative AI tools for local business recommendations, up from 6% in 2025. (BrightLocal, 2026)
- Visibility in local recommendations by ChatGPT is up to 30x harder to achieve than ranking in Google’s local search results. (SOCi Local Visibility Index, cited by BrightLocal, 2026)
- A one-second delay in page load time can decrease conversions by up to 7%. (Search Engine Land, 2025)
- Pages loading within 2 seconds have a 9% bounce rate. Pages taking 5 seconds have a 38% bounce rate. (ALM Corp, 2026)
The State of Technical SEO
How many sites pass the technical basics?

SEO | 2025 | The Web Almanac by HTTP Archive
The 2025 Web Almanac, published January 2026, draws on HTTP Archive crawl data, Lighthouse reports, and the Chrome User Experience Report across millions of websites. It paints a picture of a web that is technically more stable than ever but still uneven in real-world quality.
- 85% of websites return a valid robots.txt 200 status code, up from 84% in 2024. The 404 error rate fell to 13% from 14%, meaning more sites are explicitly managing crawler access. (Web Almanac, 2025)
- 1.8% of desktop and 1.7% of mobile sites serve completely empty robots.txt files, slightly up from 2024. Empty files create inconsistent handling across minor crawlers. (Web Almanac, 2025)
- 98% of robots.txt files are under 100KB, within Google’s 500KB parsing limit. (Web Almanac, 2025)
- Invalid head elements dropped to 10.1% on desktop and 10.3% on mobile, down from 10.6% and 10.9% in 2024. (Search Engine Land analysis of Web Almanac, 2025 data, April 2026)
- Meta robots usage crept up to 46.2% in 2025 from 45.5% in 2024, reflecting growing awareness of indexation control. (Web Almanac, April 2026)
- 98.6% of desktop pages and 98.5% of mobile pages have a title tag. Only 67.7% of desktop and 67.2% of mobile pages have a meta description. (Web Almanac, 2025)
- WordPress powers 43% of the web with a 60.5% CMS market share as of the 2025 State of the Word. Among the top 1,000 websites, WordPress’s share climbed to 49.4%, up 2.3% from the prior year. (WordPress.org, State of the Word 2025)
- Canonical tag usage rose to 68% on desktop and 67% on mobile in 2025, up from 65% on both in 2024, a steady climb reflecting growing awareness of duplicate content management. (Web Almanac, 2025)
- Nofollow appears on 32.3% of pages in 2025, nearly identical to 2024 levels. The more specific sponsored and ugc attributes remain stuck at 0.5% each, suggesting webmasters see little value in finer-grained link classification. (Web Almanac, 2025)
- 6.40% of desktop pages and 5.68% of mobile pages include a video element in 2025, up slightly from 5.87% and 5.13% in 2024. (Web Almanac, 2025)
Global vs. local site health benchmarks
- 56% of desktop sites pass Core Web Vitals in 2025, up from 55% in 2024 and 48% in 2023. Mobile lags at 48%, a consistent gap that tracks with the higher latency and less stable connection environments of mobile browsing. (Web Almanac, 2025 Performance Chapter)
- 52% of mobile websites fail at least one Core Web Vital in real field data, even when they score well in lab-based Lighthouse tests. The gap between lab and field performance is a persistent measurement problem. (Web Almanac, 2025)
- Sites with poor Core Web Vitals rarely appear in AI-generated search responses. As AI Overviews take up more SERP real estate, slow sites lose both organic and AI-cited traffic simultaneously. (Ideafueled, April 2026)
- Duda leads all major website platforms with an 82% Core Web Vitals pass rate as of May 2025. Wix achieves 71 to 75%. WordPress.org passes at 43%. (Duda, May 2025)
- E-commerce sites optimizing to good Core Web Vital thresholds report 15 to 30% conversion rate improvements and 12 to 20% organic traffic increases. (ALM Corp, 2026)
Schema and Structured Data

Structured data | 2024 | The Web Almanac by HTTP Archive
How widely is structured data adopted?
Structured data adoption has matured steadily since Google began rewarding it with rich results. The 2024 Web Almanac, the most recent edition with a dedicated structured data chapter, shows adoption patterns that reflect both the SEO community’s response to Google’s rich result incentives and the growing awareness of schema as an entity signal for AI systems.
- WebSite schema leads JSON-LD adoption at 12.73% of mobile pages, followed by Organization at 7.16% and LocalBusiness at 3.97%. (Web Almanac, 2024)
- BreadcrumbList appears on 5.66% of pages, showing notable growth as sites prioritize structured navigation data. (Web Almanac, 2024)
- ItemList schema appears on 2.44% of pages, reflecting increased use of structured listing data. (Web Almanac, 2024)
- Product schema appears on 0.77% of pages. BlogPosting appears on 1.40% and Article on 0.18%. (Web Almanac, 2024)
- Specialized business types in JSON-LD: Restaurant (0.19%), AutoDealer (1.09%), Store (0.17%), demonstrating growing industry-specific markup adoption. (Web Almanac, 2024)
- JSON-LD is used on 41% of pages as of 2024, up from 37% in 2022, making it the dominant implementation format for schema markup, ahead of Microdata and RDFa. (Web Almanac, 2024)
- Google conducted a significant Knowledge Graph cleanup in early 2025, removing many entries. Previously valid kgmid values may no longer resolve, requiring schema audits for sites using entity references. (Ahrefs, May 2026)
- AI Overviews appear on 20.5% of all SERPs analyzed in Ahrefs’s study of 146 million searches, but appear significantly less for branded queries, local queries, and shorter search queries. (Ahrefs, March 2026)
- AI Overviews had over 1.5 billion users a month in Q1 2025, 18.3% of the global population, or 26.6% of all internet users. (Ahrefs, August 2025)
- 26% of brands have zero mentions in AI Overviews. The top 50 brands account for 28.9% of all citations. (Ahrefs, May 2025)
- 57.9% of all question queries trigger an AI Overview, versus 15.5% of non-question queries, confirming that question-format content and FAQ structure significantly increase AI citation exposure. (Ahrefs, March 2026)
LocalBusiness and Service Area schema

- LocalBusiness schema is live on approximately 6.2 million websites globally as of 2026, with an additional 6.3 million sites that have used it historically. 3.7 million are US-based sites. (BuiltWith Trends, 2026)
- Local businesses that implement LocalBusiness schema can appear in Local Search results and on Google Maps as a direct result of the structured markup. (Google Search documentation via Ahrefs)
- Each blog post added to a local business site is associated with a 7% increase in AI crawler visits, signaling fresh, relevant content. Each additional page is associated with a 4% increase in crawler visits. (Duda, April 2026)
- Implementing local schema including address and opening hours helps AI crawlers understand and surface local businesses. Synchronizing a Google Business Profile ensures consistent local business information across the web, compounding the schema signal. (Duda, April 2026)
- Local search experts in the 2026 Local Search Ranking Factors survey cite citations as making up approximately 7% of the top ranking factors for both Local Pack/Finder and Local Organic results. (Whitespark, November 2025)
Reviews and ratings schema

Local Consumer Review Survey 2026: Star Ratings Keep Rising, Old Reviews Don’t Cut It
- 97% of consumers read online reviews before making a purchase decision. In 2026, 41% of consumers say they always read reviews when browsing for businesses, a jump from 29% in 2025. (BrightLocal, 2026)
- 31% of consumers will only use businesses with a rating of 4.5 stars or higher. (BrightLocal, 2026)
- 54% of consumers visit a business website after reading positive reviews. (BrightLocal, 2026)
- Consumers use an average of six review sites in 2026, up from prior years, making review schema across multiple platforms an increasingly important signal. (BrightLocal Survey 2026)
- Recommended local businesses have an average of 4.3 stars on ChatGPT, 4.1 on Perplexity, and 3.9 on Gemini, confirming that AI systems apply star rating signals when surfacing local recommendations. (SOCi Local Visibility Index, 2026)
Crawlability and Indexing

ChatGPT Now Crawls 3.6x More Than Googlebot: What 24M Requests Reveal
Citation crawlability vs. ranking lift (global)
Crawlability is the prerequisite for everything else. A page that cannot be crawled cannot be indexed. A page that cannot be indexed cannot rank. And in 2026, it cannot be cited by AI systems either.
- 85% of websites serve a valid robots.txt file. The remaining 13% return 404 errors, defaulting to unrestricted crawling. (Web Almanac, 2025)
- In late 2024, the IETF introduced a working draft known as REPext, which builds on the Robots Exclusion Protocol RFC 9309 by exploring page-level crawl controls through response headers and HTML meta tags, enabling more granular future implementations. (Web Almanac, 2025)
- llms.txt adoption is emerging as a new crawl management signal, with a small but growing share of sites implementing the file to guide AI crawler behavior in 2025. (Web Almanac, 2025)
- AI crawlers are now explicitly named in robots.txt files across a measurable share of websites, a new pattern in 2025 reflecting the industry’s awareness of LLM bot traffic. (Web Almanac, 2025)
- OpenAI’s ChatGPT-User crawler now makes 3.6x more requests than Googlebot, based on 24.4 million requests across 69 sites between January and March 2026. This is a new category of infrastructure pressure, distinct from traditional search crawler load. (Search Engine Journal, April 2026)
- Only 38% of URLs cited in AI Overviews still rank in Google’s top 10 organic results as of March 2026, down from 76% in July 2025. The sharp drop signals that AI citation logic is decoupling from traditional rankings faster than the industry expected. (Ahrefs, March 2026)
- AI-cited content is 25.7% fresher than organic Google results, based on analysis of 17 million citations, meaning sites with stale or infrequently crawled content are systematically disadvantaged in AI citation. (Ahrefs, December 2025)
- 80% of LLM citations do not rank in Google’s top 100 for the original query, confirming that AI systems have their own crawl and citation logic independent of traditional organic rankings. (Ahrefs, August 2025)
Citation crawlability vs. ranking lift (local)
- AI-crawled SMB sites generate 320% more human traffic, 270% more form submissions, and 250% more click-to-call events than sites not crawled by AI systems, based on analysis of 69 million AI crawler visits across 850,000 websites in February 2026. (Duda, April 2026)
- More than 99% of US businesses are small to medium sized and rely heavily on local SEO to drive customers. With AI search projected to overtake traditional search by 2028, AEO is becoming as operationally critical as traditional SEO. (Duda, April 2026)
- Less than half of businesses that lead in Google local search results also appear in AI local recommendations, exposing a structural gap between traditional local SEO performance and AI visibility. (SOCi Local Visibility Index, cited by BrightLocal, 2026)
- ChatGPT Search shows business websites for 58% of its local search sources, followed by business mentions (27%) and online directories (15%). (BrightLocal, Local SEO Statistics, 2026)
- According to local SEO experts, AI search visibility factors are most influenced by: presence on expert-curated Best Of lists, dedicated pages for each service, and prominence on key industry-relevant domains. (Whitespark Local Search Ranking Factors 2026, cited by BrightLocal)
Schema as AI-Citation Infrastructure

We Tracked 1,885 Pages Adding Schema. AI Citations Barely Moved.
How structured data drives AI Overviews, ChatGPT, and Perplexity
The relationship between schema markup and AI citation is more nuanced in 2026 than the SEO industry assumed. The data challenges some assumptions about schema as a direct AI citation driver, while confirming its role as a foundational crawlability and entity signal.
- Ahrefs tracked 1,885 web pages that added JSON-LD schema between August 2025 and March 2026, matched against 4,000 control pages. Adding schema produced no major uplift in citations on any platform, Google AI Overviews, AI Mode, or ChatGPT. (Ahrefs, May 2026)
- A separate experiment by searchVIU tested whether five major AI systems including ChatGPT, Claude, Perplexity, Gemini, and Google AI Mode used schema markup when fetching a page in real time. None of them did. During direct retrieval, every system extracted only visible HTML content. (Ahrefs, May 2026)
- For pages already being crawled and cited by AI systems, schema adds no incremental lift. For pages not yet in AI consideration sets, schema may still play a role in initial crawlability and indexation. (Ahrefs, May 2026)
- Being mentioned on highly linked pages has a strong correlation with visibility in AI Overviews, suggesting that authority and link equity remain more powerful AI citation signals than structured markup alone. (Ahrefs, 2025)
- Brands in the top 25% for web mentions earn over 10x more AI Overview mentions than brands in the next quartile, confirming that entity prominence drives AI citation far more than technical markup. (Ahrefs, 2025)
- FAQPage schema has been growing steadily and is interpreted as a leading indicator of AI search optimization strategies changing the structure of the web, even without confirmed direct ranking or citation impact. (Search Engine Land, April 2026)
- AI only cites HTML pages and ignores Markdown (.md) pages, a technical constraint with direct implications for how AI-built sites and documentation should be structured for discoverability. (OtterlyAI, April 2026)
Local business data in AI overview generation
- 45% of consumers now use ChatGPT or other generative AI tools for local business recommendations, up from just 6% in 2025, a dramatic acceleration in local AI search behavior. (BrightLocal, 2026)
- Visibility in local recommendations by ChatGPT is up to 30x harder to achieve than ranking in Google’s local search results, reflecting the far more selective citation behavior of LLMs compared to traditional search. (SOCi, 2026)
- 37% of US consumers use Instagram to find local business reviews, and 29% use TikTok as alternative local business discovery platforms, expanding the definition of what local citation infrastructure means beyond traditional directories. (BrightLocal, 2026)
- Dynamically generating content from data, such as pages for individual services or locations, encourages deeper AI crawling by providing specific information in a more structured and machine-readable way. (Duda, April 2026)
- Synchronizing a Google Business Profile ensures crawlers see consistent and accurate local business information across the web, compounding the impact of on-site local schema. (Duda, April 2026)
- Local SEO experts in the 2026 Whitespark survey confirmed for the first time that AI search visibility factors are being tracked as a distinct ranking signal category, separate from traditional local pack factors. (Whitespark, 2026)
Mobile, HTTPS, and Security

SEO | 2025 | The Web Almanac by HTTP Archive
Mobile-first indexing and local pack visibility
Mobile-first indexing has been Google’s default since 2023. The 2025 data shows the web has largely adapted, but inconsistencies remain.
- According to the official data recorded, the industry-wide move toward responsive design has effectively closed the gap between mobile and desktop configurations, standardizing around universal codebases that uniformly deploy viewport meta tags across platforms. (Web Almanac, 2025)
- The mobile-desktop gap in valid robots.txt has effectively disappeared, with mobile now holding just a 0.06% lead, reflecting the industry shift away from separate m-dot sites toward unified responsive configurations. (Web Almanac, 2025)
- 48% of mobile sites pass Core Web Vitals in 2025, versus 56% for desktop, a persistent 8-point gap that reflects the structural performance disadvantages of mobile networks and devices. (Web Almanac, 2025)
- 53% of mobile site visits are abandoned if pages take more than 3 seconds to load. (Search Engine Land)
- A 10-second load time increases the probability of a bounce by 123% compared to a 1-second load time. (ALM Corp, 2026)
- More than 63% of web traffic comes from mobile devices as of 2024, the primary driver behind Google’s pivot to mobile-first indexing and a clear signal for where technical SEO priority should lie. (Search Engine Land)
- Mobile-first behavior is still growing in 2025, and Google continues to prioritize personalized, location-based results, making mobile technical performance directly correlated with local pack visibility. (Search Engine Land, 2025)
Security signals across global and regional markets
- HTTPS adoption is now nearly universal across the web, described by the Web Almanac, 2025 as a baseline expectation rather than a differentiator. (Web Almanac, 2025)
- X-Content-Type-Options leads security header adoption at nearly 50% of sites as of 2025. X-Frame-Options and Strict-Transport-Security follow at approximately 35% of sites. (Web Almanac, 2025)
- More modern security headers such as Cross-Origin-Opener-Policy and Permissions-Policy remain below 10% adoption, indicating significant security infrastructure gaps on most of the web. (Web Almanac, 2025)
- Cross-Origin-Resource-Policy grew from approximately 1.75% in 2023 to over 2.25% by 2025. Cross-Origin-Opener-Policy grew the most significantly among the new cross-origin security headers. (Web Almanac, 2025)
- The number of pages with cryptomining scripts declined 42% in one year and 83% since September 2022, falling to just 37 pages on mobile, an indicator of improving security hygiene at the page level. (Web Almanac, 2025)
- The EU AI Act becomes fully applicable August 2, 2026, establishing risk-based obligations for high-impact AI systems and affecting any site using AI-driven personalization, recommendation, or content generation in European markets. (Secure Privacy, 2026)
The Business Impact of Technical SEO

Update: AI Overviews Reduce Clicks by 58%
Traffic and conversion: global sites vs. location-dependent
Technical SEO is not an abstract hygiene exercise. The performance data shows direct, measurable impact on traffic, conversion, and revenue, and the impact is larger for location-dependent businesses than the aggregate figures suggest.
- A one-second delay in page load time can decrease conversions by up to 7%. (Search Engine Land)
- Pages loading within 2 seconds have a 9% bounce rate. Pages taking 5 seconds have a 38% bounce rate, a 422% increase in bounce driven by a 3-second performance gap. (ALM Corp, 2026)
- Websites with responsive design see 11% higher conversion rates and 20% more user engagement compared to non-responsive counterparts. (ALM Corp, 2026)
- E-commerce sites that optimize to good Core Web Vital thresholds report 15 to 30% conversion rate improvements and 12 to 20% organic traffic increases. (ALM Corp, 2026)
- AI search visitors spend 8 seconds more on site than other visitors, but are 5.4% more likely to bounce, their sessions are longer in duration but shallower in depth, suggesting strong initial intent but lower secondary engagement. (Ahrefs, 2025)
- Websites with more organic search traffic also receive more mentions in AI responses, confirming a reinforcing relationship between traditional SEO performance and AI visibility. (Ahrefs, 2025)
- LLM traffic converts 4.4x better than traditional organic search visitors at the point of conversion, based on Semrush’s analysis of 500-plus digital marketing and SEO topics, the category where AI adoption runs highest. The commercial impact is outsized given that AI still accounts for only about 0.14% of total web visits. (Semrush, 2025)
- Organic search declined in 13 of 17 industries analyzed across 2025, while AI referral traffic grew 66%. Sites that maintain technical SEO foundations are best positioned to benefit from both channels simultaneously. (Semrush, April 2026)
- AI Overviews reduce organic click-through rates for the top-ranking page by 58% as of December 2025, up from a 34.5% reduction measured in April 2025, the gap is accelerating, not stabilizing. (Ahrefs, February 2026)
- 7 in 10 searchers never read past the first third of an AI Overview, making the structural positioning of content within a page a direct determinant of AI citation visibility. (Ahrefs)
- AI visitors visit 1.2 fewer pages per session than search visitors and 1.5 fewer than the average visitor overall, despite spending 8 seconds longer on site. (Ahrefs, 2025)
- The median unsized images per page is 2, rising sharply to 25 on desktop and 22 on mobile at the 90th percentile, unsized images increase layout shift risk and directly affect Core Web Vitals CLS scores. (Web Almanac, 2025)
Cross-location technical consistency and performance spread
- Less than half of businesses leading in Google local search also appear in AI local recommendations, creating a technical performance divergence between traditional and AI local visibility that requires separate optimization strategies. (SOCi, 2026)
- AI-crawled local business sites generate 320% more human traffic, a performance spread that compounds across multi-location businesses that may have inconsistent schema, crawlability, and GBP synchronization across locations. (Duda, April 2026)
- Just 35% of SMBs have a Google Business Profile, the foundational local technical signal that underpins both traditional and AI local search visibility. (SMB Marketing Report 2025)
- 82% of consumers search for something online at least daily, with two in five consumers estimating that at least 41% of their searches are dedicated to finding local business information. (BrightLocal, April 2025)
- For multi-location businesses, cross-location technical consistency, uniform schema implementation, synchronized GBP data, and consistent NAP (name, address, phone) across citations, is cited by local SEO experts as a direct ranking driver for both local pack and AI recommendation visibility. (Whitespark, 2026)
Methodology and Sources
This page compiles statistics from primary research reports, platform studies, government data, and industry surveys published between 2022 and 2026. Every stat links to its original publisher. Where data originates from an annual industry survey (Whitespark Local Search Ranking Factors, BrightLocal Local Consumer Review Survey), the survey page is the primary citation. Web Almanac data is cited at the HTTP Archive chapter level. Updated monthly.
Sources:
Web Almanac, 2025 SEO ·Web Almanac, 2025 Performance ·Web Almanac, 2025 Security ·Web Almanac, 2024 Structured Data ·Web Almanac, 2022 Structured Data ·WordPress State of the Word 2025 ·Ahrefs Schema AI Citations ·Ahrefs AI Overview Citations Top 10 ·Ahrefs AI Overview Citations July 2025 ·Ahrefs Fresh Content ·Ahrefs AI Search Overlap ·Ahrefs AI SEO Statistics ·Ahrefs Schema Markup Guide ·Semrush AI Visibility ·Semrush Traffic Channel Mix ·Duda AEO Study ·Duda Core Web Vitals ·BrightLocal Local Consumer Review Survey 2026 ·BrightLocal AI Trust Research ·BrightLocal Local SEO Statistics ·BrightLocal Consumer Search Behavior ·Whitespark Local Search Ranking Factors 2026 ·BuiltWith LocalBusiness Schema ·Search Engine Land Technical SEO Guide ·Search Engine Land Page Speed ·Search Engine Land Web Almanac, Analysis 2026 ·Search Engine Land Bounce Rate ·Search Engine Journal LLM Crawl Data ·ALM Corp Core Web Vitals 2026 ·Ideafueled Core Web Vitals 2026 ·OtterlyAI HTML vs Markdown ·Secure Privacy EU AI Act ·Google AI Mode Insights