Structured Data, llms.txt & FAQ: Why AI Can't Find Your Website
Your website looks great. Your SEO is decent. Your Google ranking is fine. But when someone asks ChatGPT "best digital agency in Vienna" โ your name never comes up.
Why? Because AI doesn't read your website the way Google does. It reads structured data. And if you don't have any, you're invisible.
This guide explains the four technical elements that determine whether AI assistants can find, understand, and recommend your business โ and exactly how to implement each one.
1. Schema.org Structured Data โ The Language AI Speaks
What Is Schema.org?
Schema.org is a standardized format for telling search engines and AI models what your business is, what you do, and where you are. It's a block of code (JSON-LD) that sits in your website's HTML โ invisible to visitors, but crystal clear to machines.
Think of it like this: your website is a shop window. Schema.org is the business card you hand to every AI assistant that visits.
Why AI Models Love Schema.org
When ChatGPT, Gemini, or Perplexity need to recommend a business, they don't "read" your website like a human. They look for structured signals:
- What does this business do? โ
Serviceschema - Where is it located? โ
LocalBusinessschema - Who runs it? โ
Organizationschema - What do customers ask? โ
FAQPageschema - What do customers say? โ
Reviewschema
Without these, AI is basically trying to guess your business details from unstructured text. Sometimes it guesses wrong. Sometimes it doesn't guess at all.
The 5 Schema Types Every Business Needs
| Schema Type | What It Tells AI | Priority |
|---|---|---|
Organization | Company name, logo, contact info, social profiles | ๐ด Critical |
LocalBusiness | Address, phone, opening hours, geo coordinates | ๐ด Critical |
Service | What services you offer, prices, areas served | ๐ด Critical |
FAQPage | Common questions and answers about your business | ๐ก High |
Review | Customer ratings, review count, star ratings | ๐ก High |
How to Implement Schema.org
Here's a minimal LocalBusiness schema for a digital agency:
<script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "LocalBusiness",
"name": "Your Agency Name",
"description": "Digital marketing agency specializing in SEO, GEO, and AI visibility.",
"url": "https://youragency.com",
"telephone": "+43-1-234-5678",
"address": {
"@type": "PostalAddress",
"streetAddress": "Mariahilfer Straรe 123",
"addressLocality": "Vienna",
"addressCountry": "AT"
},
"geo": {
"@type": "GeoCoordinates",
"latitude": 48.2082,
"longitude": 16.3738
},
"openingHours": "Mo-Fr 09:00-18:00",
"priceRange": "โฌโฌ",
"sameAs": [
"https://linkedin.com/company/youragency",
"https://facebook.com/youragency"
]
}
</script>
๐ก Pro tip: Use Google's Rich Results Test to validate your Schema.org implementation. Fix any errors before AI crawlers find them.
2. llms.txt โ The AI Welcome Mat
What Is llms.txt?
llms.txt is a simple text file you place at the root of your website (like robots.txt) that tells AI models how to read your site. It was proposed in 2024 and is now being adopted by major AI systems.
Where robots.txt tells crawlers what NOT to read, llms.txt tells AI what TO read โ and how to understand it.
Why It Matters
When an AI crawler visits your site, it needs to quickly understand:
- What is this business?
- What pages are most important?
- What services are offered?
- How should the AI describe this business to users?
Without llms.txt, the AI has to figure this out by crawling every page and guessing. With it, you control the narrative.
How to Create llms.txt
Create a file called llms.txt and place it at your website's root (https://yoursite.com/llms.txt):
# llms.txt โ AI-readable business summary
## About
We are [Agency Name], a digital marketing agency based in [City, Country].
We specialize in [services: SEO, GEO, web design, AI chatbots].
## Key Pages
- Services: /services
- About Us: /about
- Portfolio: /portfolio
- Contact: /contact
- Blog: /blog
## Business Info
- Founded: 2020
- Team size: 5-10
- Languages: English, German, Slovenian
- Industries served: SaaS, E-commerce, Hospitality
## Contact
- Email: hello@youragency.com
- Phone: +43-1-234-5678
- Address: Mariahilfer Straรe 123, Vienna, Austria
โ
Impact: Websites with llms.txt are 3x more likely to be accurately described by AI assistants. You're telling the AI exactly what to say about you.
llms-full.txt โ The Extended Version
For more complex businesses, you can also create llms-full.txt with detailed information about your services, pricing, team, and FAQ. This is especially useful for agencies with multiple service offerings.
3. FAQ Structured Data โ The Questions AI Answers
Why FAQ Data Is Gold for AI
When someone asks ChatGPT "How much does a website cost in Slovenia?", the AI looks for pages that directly answer that question. If your website has a FAQPage schema with that exact question and answer, you're 5x more likely to be cited.
FAQ structured data does two things:
- Tells AI what questions you can answer
- Provides the exact answer to use
How to Implement FAQ Schema
<script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "FAQPage",
"mainEntity": [
{
"@type": "Question",
"name": "How much does a website cost in Slovenia?",
"acceptedAnswer": {
"@type": "Answer",
"text": "A basic website in Slovenia costs โฌ890+VAT. A business website with custom design starts at โฌ1,990+VAT. Premium e-commerce solutions range from โฌ3,490+VAT."
}
},
{
"@type": "Question",
"name": "What is GEO optimization?",
"acceptedAnswer": {
"@type": "Answer",
"text": "GEO (Generative Engine Optimization) is the practice of optimizing your website so AI assistants like ChatGPT, Gemini, and Perplexity can find, understand, and recommend your business."
}
},
{
"@type": "Question",
"name": "How do I make ChatGPT recommend my business?",
"acceptedAnswer": {
"@type": "Answer",
"text": "Add Schema.org structured data, create an llms.txt file, implement FAQ markup, and ensure your business information is consistent across directories and review sites."
}
}
]
}
</script>
FAQ Best Practices for GEO
- Use real questions your customers ask (check Google Search Console, support tickets, sales calls)
- Include location-specific questions ("best agency in Vienna," "web design prices Austria")
- Keep answers concise โ 2-3 sentences that AI can directly quote
- Add new questions regularly โ freshness matters for AI ranking
4. Why AI Can't Find You โ The Complete Diagnostic
The 4 Invisible Signals
If your website is missing these elements, AI assistants literally cannot recommend you:
| Signal | What Happens Without It | Impact |
|---|---|---|
Schema.org | AI doesn't know what you do or where you are | Critical |
llms.txt | AI has to guess your business details | Critical |
FAQPage | You don't appear in Q&A-style searches | High |
OpenGraph tags | AI can't generate rich previews of your content | Medium |
The AI Visibility Score
We've built a tool that checks all four elements and gives you a score from 0-100:
Average score for most websites: 23/100
Here's what a typical failing score looks like:
| Check | Status |
|---|---|
| Schema.org structured data | โ Not found |
| llms.txt file | โ Not found |
| FAQPage schema | โ Not found |
| OpenGraph tags | โ Present |
| HTTPS enabled | โ Valid |
| Meta descriptions | โ Present |
โ ๏ธ Reality check: 73% of business websites have no Schema.org markup. That means 73% of businesses are invisible to AI recommendations. This is your competitive advantage.
How to Fix It โ Step by Step
- Audit your current state โ Use BoostSuite's GEO Check to see your score
- Add Organization + LocalBusiness schema โ This is the foundation
- Create llms.txt โ 15 minutes of work, massive impact
- Implement FAQ schema โ Start with your top 5 customer questions
- Add OpenGraph tags โ For rich previews in AI responses
- Test everything โ Use Google's Rich Results Test + BoostSuite GEO Check
- Monitor and update โ Check your score monthly, add new FAQ questions
The Bottom Line
GEO isn't about writing better content or building more backlinks. It's about giving AI models the technical signals they need to understand and recommend your business.
Schema.org tells AI what you do. llms.txt tells AI how to describe you. FAQ data tells AI what questions you can answer. Without these three elements, you're hoping AI guesses correctly about your business.
And hope is not a strategy.
Check Your AI Visibility Score โ Free
See exactly what ChatGPT, Gemini, and Perplexity say about your business. Takes 30 seconds.
Run Free GEO Check โFrequently Asked Questions
How long does it take for AI to notice my Schema.org markup?
AI models crawl websites continuously. After implementing Schema.org, you can expect AI systems to pick up your structured data within 1-4 weeks. However, the impact on AI recommendations may take 2-3 months as models re-index your site.
Is llms.txt required for AI visibility?
It's not technically required, but it's the fastest way to control how AI describes your business. Without it, AI models rely on crawling your entire site and guessing โ which often leads to inaccurate descriptions.
Can I implement Schema.org myself?
Yes! Basic Schema.org is a single JSON-LD block you can add to your website's HTML. For more complex implementations (multiple services, locations, team members), you might want to use a plugin or hire a developer.
Does this replace traditional SEO?
No. GEO complements SEO. Traditional SEO helps you rank on Google. GEO helps you get recommended by AI assistants. You need both โ but GEO is becoming increasingly important as AI search grows.
What's the difference between Schema.org and OpenGraph?
Schema.org is for search engines and AI models. OpenGraph is for social media and link previews. Both are important, but Schema.org has a much bigger impact on AI visibility.
This article was written by Darko Herceg, founder of hd-webdesign.si โ a web design and AI services company from Slovenia. We build tools that help businesses get found by AI.