doc/15-SEO.md

# SEO Implementation Guide

**Project:** CV Interactive Website
**Last Updated:** 2025-11-30
**Status:** Production Ready

---

## Overview

This document describes the comprehensive SEO (Search Engine Optimization) implementation for the CV website, including traditional search engine optimization and modern AI-era optimizations for LLM crawlers and AI Overviews.

---

## SEO Architecture

### 1. Traditional SEO Elements

#### Meta Tags (`templates/index.html`)

```html
<!-- Primary Meta Tags -->
<title>{{.CV.Personal.Name}} - {{.CV.SEO.PageTitle}}</title>
<meta name="title" content="...">
<meta name="description" content="...">
<meta name="keywords" content="...">
<meta name="author" content="...">
<meta name="robots" content="index, follow">
<link rel="canonical" href="{{.CanonicalURL}}">
```

#### International SEO (Hreflang)

```html
<link rel="alternate" hreflang="en" href="{{.AlternateEN}}">
<link rel="alternate" hreflang="es" href="{{.AlternateES}}">
<link rel="alternate" hreflang="x-default" href="https://juan.andres.morenorub.io/?lang=en">
```

#### Social Media Integration

| Platform | Meta Type | Implementation |
|----------|-----------|----------------|
| Facebook | Open Graph | `og:type`, `og:title`, `og:description`, `og:image` |
| Twitter/X | Twitter Cards | `twitter:card`, `twitter:title`, `twitter:description` |
| LinkedIn | Open Graph | Uses same `og:*` tags |

---

### 2. Structured Data (JSON-LD)

The site implements multiple Schema.org types for comprehensive semantic understanding:

#### Person Schema (Primary)
```json
{
  "@type": "Person",
  "@id": "{{.CV.Personal.Website}}/#person",
  "name": "...",
  "jobTitle": "...",
  "description": "...",
  "knowsAbout": [...],
  "knowsLanguage": [...],
  "worksFor": [...],
  "hasOccupation": [...]
}
```

**Fields included:**
- Basic info: name, givenName, familyName, jobTitle
- Contact: email, telephone, url
- Demographics: birthDate, birthPlace, nationality
- Location: address with locality and country
- Social: sameAs (LinkedIn, GitHub, Domestika)
- Education: alumniOf
- Skills: knowsAbout (array of expertise areas)
- Languages: knowsLanguage (with Language type)
- Employment: worksFor (multiple organizations)
- Occupations: hasOccupation (dynamically generated from experience)

#### WebSite Schema
```json
{
  "@type": "WebSite",
  "name": "... - Professional CV",
  "url": "...",
  "inLanguage": ["en", "es"],
  "potentialAction": { "@type": "SearchAction", ... }
}
```

#### BreadcrumbList Schema
```json
{
  "@type": "BreadcrumbList",
  "itemListElement": [
    { "position": 1, "name": "Home", "item": "..." },
    { "position": 2, "name": "CV (English/Español)", "item": ".../?lang=..." }
  ]
}
```

#### ProfilePage Schema
```json
{
  "@type": "ProfilePage",
  "mainEntity": { "@id": ".../#person" },
  "dateCreated": "...",
  "dateModified": "...",
  "inLanguage": "..."
}
```

#### EducationalOccupationalCredential Schema
Generated dynamically for each education entry:
```json
{
  "@type": "EducationalOccupationalCredential",
  "name": "{{.Degree}}",
  "description": "{{.Field}}",
  "educationalLevel": "Bachelor's Degree",
  "credentialCategory": "degree",
  "recognizedBy": { "@type": "CollegeOrUniversity", ... }
}
```

#### Course Schema
Generated dynamically for each course/certification:
```json
{
  "@type": "Course",
  "name": "{{.Title}}",
  "description": "...",
  "provider": { "@type": "Organization", ... },
  "hasCourseInstance": { "@type": "CourseInstance", ... },
  "timeRequired": "{{.Duration}}"
}
```

---

### 3. AI-Era SEO Optimizations

#### llms.txt File (`static/llms.txt`)

A dedicated file for AI crawlers following the [llmstxt.org](https://llmstxt.org/) standard:

```
# llms.txt - AI Crawler Information
name: Juan Andrés Moreno Rubio - Professional CV
description: Interactive curriculum vitae...

## Professional Summary
- Senior Technical Consultant...

## Key Expertise
- SAP Customer Data Cloud...

## Contact
- Website: ...
- LinkedIn: ...
```

**Purpose:** Provides AI systems (ChatGPT, Claude, Perplexity, etc.) with structured, human-readable information about the site content.

#### robots.txt AI Bot Rules (`static/robots.txt`)

Explicit permissions for AI crawlers:

| Bot | Service | Status |
|-----|---------|--------|
| GPTBot | OpenAI/ChatGPT | Allowed |
| ChatGPT-User | OpenAI | Allowed |
| ClaudeBot | Anthropic | Allowed |
| Claude-Web | Anthropic | Allowed |
| anthropic-ai | Anthropic | Allowed |
| Google-Extended | Google AI/Gemini | Allowed |
| PerplexityBot | Perplexity AI | Allowed |
| cohere-ai | Cohere | Allowed |
| CCBot | Common Crawl | Allowed |
| Amazonbot | Amazon/Alexa | Allowed |
| Applebot | Apple/Siri | Allowed |
| Copilot | Microsoft | Allowed |
| YouBot | You.com | Allowed |
| BraveBot | Brave Search | Allowed |

---

## E-E-A-T Signals

The implementation supports Google's E-E-A-T (Experience, Expertise, Authority, Trust) framework:

### Experience
- Detailed work history with responsibilities
- Real project descriptions
- Duration and dates for credibility

### Expertise
- Skills categorized by domain
- Technologies listed per job
- Certifications and courses

### Authority
- Links to LinkedIn, GitHub, portfolio
- Company associations (SAP, Olympic Broadcasting)
- Client count and project metrics in summary

### Trust
- Canonical URLs prevent duplicate content
- HTTPS enforced
- Clear contact information
- Privacy-respecting analytics (Matomo)

---

## Files Overview

| File | Purpose |
|------|---------|
| `templates/index.html` | Meta tags, JSON-LD schemas |
| `static/robots.txt` | Search engine and AI bot directives |
| `static/llms.txt` | AI crawler information file |
| `static/sitemap.xml` | XML sitemap for search engines |
| `data/cv-en.json` | SEO fields (pageTitle, metaTitle, etc.) |
| `data/cv-es.json` | Spanish SEO fields |

---

## SEO Data Model

The SEO-specific fields in `data/cv-{lang}.json`:

```json
{
  "seo": {
    "pageTitle": "Curriculum Vitae",
    "metaTitle": "Professional CV",
    "metaDescription": "18 years of experience in...",
    "ogDescription": "Senior Technical Consultant...",
    "keywords": "CV, Resume, FullStack Developer, SAP CDC..."
  }
}
```

---

## Validation & Testing

### Schema Validation
Test structured data at:
- [Google Rich Results Test](https://search.google.com/test/rich-results)
- [Schema.org Validator](https://validator.schema.org/)

### Expected Schema Count
The site generates **12+ JSON-LD blocks**:
- 1 Person schema
- 1 WebSite schema
- 1 BreadcrumbList schema
- 1 ProfilePage schema
- N EducationalOccupationalCredential schemas (1 per education)
- N Course schemas (1 per course)

### robots.txt Validation
Test at: [Google Robots.txt Tester](https://www.google.com/webmasters/tools/robots-testing-tool)

---

## Best Practices Implemented

### Content Structure
- [ ] Clear H1-H6 heading hierarchy
- [x] Semantic HTML5 elements (article, section, nav)
- [x] Alt text for images
- [x] Descriptive link text

### Technical SEO
- [x] Mobile-responsive design
- [x] Fast page load (bundled CSS, preload fonts)
- [x] Canonical URLs
- [x] Hreflang for multilingual
- [x] Sitemap.xml
- [x] robots.txt with AI bot rules

### Modern SEO (AI-Era)
- [x] llms.txt file
- [x] Comprehensive JSON-LD schemas
- [x] AI bot permissions in robots.txt
- [x] Clear, parseable content structure

---

## Maintenance

### When to Update

1. **Content changes**: Update `data/cv-{lang}.json` SEO fields
2. **New sections**: Add corresponding Schema.org types
3. **New AI bots**: Add to `robots.txt`
4. **Annual review**: Update `llms.txt` with current info

### Monitoring

- Google Search Console for traditional SEO
- Matomo Analytics for traffic patterns
- Manual testing in AI chat interfaces (ChatGPT, Claude, Perplexity)

---

## References

- [Schema.org](https://schema.org/)
- [Google Search Central](https://developers.google.com/search)
- [llmstxt.org Standard](https://llmstxt.org/)
- [WPBeginner SEO Guide 2025](https://www.wpbeginner.com/opinion/does-seo-still-work/)
feat: Implement comprehensive AI-era SEO optimizations 2025-11-30 13:23:22 +00:00			`# SEO Implementation Guide`

			`Project: CV Interactive Website`
			`Last Updated: 2025-11-30`
			`Status: Production Ready`

			`---`

			`## Overview`

			`This document describes the comprehensive SEO (Search Engine Optimization) implementation for the CV website, including traditional search engine optimization and modern AI-era optimizations for LLM crawlers and AI Overviews.`

			`---`

			`## SEO Architecture`

			`### 1. Traditional SEO Elements`

			#### Meta Tags (`templates/index.html`)

			```html
			`<!-- Primary Meta Tags -->`
			`<title>{{.CV.Personal.Name}} - {{.CV.SEO.PageTitle}}</title>`
			`<meta name="title" content="...">`
			`<meta name="description" content="...">`
			`<meta name="keywords" content="...">`
			`<meta name="author" content="...">`
			`<meta name="robots" content="index, follow">`
			`<link rel="canonical" href="{{.CanonicalURL}}">`
			```

			`#### International SEO (Hreflang)`

			```html
			`<link rel="alternate" hreflang="en" href="{{.AlternateEN}}">`
			`<link rel="alternate" hreflang="es" href="{{.AlternateES}}">`
			`<link rel="alternate" hreflang="x-default" href="https://juan.andres.morenorub.io/?lang=en">`
			```

			`#### Social Media Integration`

			`\| Platform \| Meta Type \| Implementation \|`
			`\|----------\|-----------\|----------------\|`
			\| Facebook \| Open Graph \| `og:type`, `og:title`, `og:description`, `og:image` \|
			\| Twitter/X \| Twitter Cards \| `twitter:card`, `twitter:title`, `twitter:description` \|
			\| LinkedIn \| Open Graph \| Uses same `og:*` tags \|

			`---`

			`### 2. Structured Data (JSON-LD)`

			`The site implements multiple Schema.org types for comprehensive semantic understanding:`

			`#### Person Schema (Primary)`
			```json
			`{`
			`"@type": "Person",`
			`"@id": "{{.CV.Personal.Website}}/#person",`
			`"name": "...",`
			`"jobTitle": "...",`
			`"description": "...",`
			`"knowsAbout": [...],`
			`"knowsLanguage": [...],`
			`"worksFor": [...],`
			`"hasOccupation": [...]`
			`}`
			```

			`Fields included:`
			`- Basic info: name, givenName, familyName, jobTitle`
			`- Contact: email, telephone, url`
			`- Demographics: birthDate, birthPlace, nationality`
			`- Location: address with locality and country`
			`- Social: sameAs (LinkedIn, GitHub, Domestika)`
			`- Education: alumniOf`
			`- Skills: knowsAbout (array of expertise areas)`
			`- Languages: knowsLanguage (with Language type)`
			`- Employment: worksFor (multiple organizations)`
			`- Occupations: hasOccupation (dynamically generated from experience)`

			`#### WebSite Schema`
			```json
			`{`
			`"@type": "WebSite",`
			`"name": "... - Professional CV",`
			`"url": "...",`
			`"inLanguage": ["en", "es"],`
			`"potentialAction": { "@type": "SearchAction", ... }`
			`}`
			```

			`#### BreadcrumbList Schema`
			```json
			`{`
			`"@type": "BreadcrumbList",`
			`"itemListElement": [`
			`{ "position": 1, "name": "Home", "item": "..." },`
			`{ "position": 2, "name": "CV (English/Español)", "item": ".../?lang=..." }`
			`]`
			`}`
			```

			`#### ProfilePage Schema`
			```json
			`{`
			`"@type": "ProfilePage",`
			`"mainEntity": { "@id": ".../#person" },`
			`"dateCreated": "...",`
			`"dateModified": "...",`
			`"inLanguage": "..."`
			`}`
			```

			`#### EducationalOccupationalCredential Schema`
			`Generated dynamically for each education entry:`
			```json
			`{`
			`"@type": "EducationalOccupationalCredential",`
			`"name": "{{.Degree}}",`
			`"description": "{{.Field}}",`
			`"educationalLevel": "Bachelor's Degree",`
			`"credentialCategory": "degree",`
			`"recognizedBy": { "@type": "CollegeOrUniversity", ... }`
			`}`
			```

			`#### Course Schema`
			`Generated dynamically for each course/certification:`
			```json
			`{`
			`"@type": "Course",`
			`"name": "{{.Title}}",`
			`"description": "...",`
			`"provider": { "@type": "Organization", ... },`
			`"hasCourseInstance": { "@type": "CourseInstance", ... },`
			`"timeRequired": "{{.Duration}}"`
			`}`
			```

			`---`

			`### 3. AI-Era SEO Optimizations`

			#### llms.txt File (`static/llms.txt`)

			`A dedicated file for AI crawlers following the [llmstxt.org](https://llmstxt.org/) standard:`

			```
			`# llms.txt - AI Crawler Information`
			`name: Juan Andrés Moreno Rubio - Professional CV`
			`description: Interactive curriculum vitae...`

			`## Professional Summary`
			`- Senior Technical Consultant...`

			`## Key Expertise`
			`- SAP Customer Data Cloud...`

			`## Contact`
			`- Website: ...`
			`- LinkedIn: ...`
			```

			`Purpose: Provides AI systems (ChatGPT, Claude, Perplexity, etc.) with structured, human-readable information about the site content.`

			#### robots.txt AI Bot Rules (`static/robots.txt`)

			`Explicit permissions for AI crawlers:`

			`\| Bot \| Service \| Status \|`
			`\|-----\|---------\|--------\|`
			`\| GPTBot \| OpenAI/ChatGPT \| Allowed \|`
			`\| ChatGPT-User \| OpenAI \| Allowed \|`
			`\| ClaudeBot \| Anthropic \| Allowed \|`
			`\| Claude-Web \| Anthropic \| Allowed \|`
			`\| anthropic-ai \| Anthropic \| Allowed \|`
			`\| Google-Extended \| Google AI/Gemini \| Allowed \|`
			`\| PerplexityBot \| Perplexity AI \| Allowed \|`
			`\| cohere-ai \| Cohere \| Allowed \|`
			`\| CCBot \| Common Crawl \| Allowed \|`
			`\| Amazonbot \| Amazon/Alexa \| Allowed \|`
			`\| Applebot \| Apple/Siri \| Allowed \|`
			`\| Copilot \| Microsoft \| Allowed \|`
			`\| YouBot \| You.com \| Allowed \|`
			`\| BraveBot \| Brave Search \| Allowed \|`

			`---`

			`## E-E-A-T Signals`

			`The implementation supports Google's E-E-A-T (Experience, Expertise, Authority, Trust) framework:`

			`### Experience`
			`- Detailed work history with responsibilities`
			`- Real project descriptions`
			`- Duration and dates for credibility`

			`### Expertise`
			`- Skills categorized by domain`
			`- Technologies listed per job`
			`- Certifications and courses`

			`### Authority`
			`- Links to LinkedIn, GitHub, portfolio`
			`- Company associations (SAP, Olympic Broadcasting)`
			`- Client count and project metrics in summary`

			`### Trust`
			`- Canonical URLs prevent duplicate content`
			`- HTTPS enforced`
			`- Clear contact information`
			`- Privacy-respecting analytics (Matomo)`

			`---`

			`## Files Overview`

			`\| File \| Purpose \|`
			`\|------\|---------\|`
			\| `templates/index.html` \| Meta tags, JSON-LD schemas \|
			\| `static/robots.txt` \| Search engine and AI bot directives \|
			\| `static/llms.txt` \| AI crawler information file \|
			\| `static/sitemap.xml` \| XML sitemap for search engines \|
			\| `data/cv-en.json` \| SEO fields (pageTitle, metaTitle, etc.) \|
			\| `data/cv-es.json` \| Spanish SEO fields \|

			`---`

			`## SEO Data Model`

			The SEO-specific fields in `data/cv-{lang}.json`:

			```json
			`{`
			`"seo": {`
			`"pageTitle": "Curriculum Vitae",`
			`"metaTitle": "Professional CV",`
			`"metaDescription": "18 years of experience in...",`
			`"ogDescription": "Senior Technical Consultant...",`
			`"keywords": "CV, Resume, FullStack Developer, SAP CDC..."`
			`}`
			`}`
			```

			`---`

			`## Validation & Testing`

			`### Schema Validation`
			`Test structured data at:`
			`- [Google Rich Results Test](https://search.google.com/test/rich-results)`
			`- [Schema.org Validator](https://validator.schema.org/)`

			`### Expected Schema Count`
			`The site generates 12+ JSON-LD blocks:`
			`- 1 Person schema`
			`- 1 WebSite schema`
			`- 1 BreadcrumbList schema`
			`- 1 ProfilePage schema`
			`- N EducationalOccupationalCredential schemas (1 per education)`
			`- N Course schemas (1 per course)`

			`### robots.txt Validation`
			`Test at: [Google Robots.txt Tester](https://www.google.com/webmasters/tools/robots-testing-tool)`

			`---`

			`## Best Practices Implemented`

			`### Content Structure`
			`- [ ] Clear H1-H6 heading hierarchy`
			`- [x] Semantic HTML5 elements (article, section, nav)`
			`- [x] Alt text for images`
			`- [x] Descriptive link text`

			`### Technical SEO`
			`- [x] Mobile-responsive design`
			`- [x] Fast page load (bundled CSS, preload fonts)`
			`- [x] Canonical URLs`
			`- [x] Hreflang for multilingual`
			`- [x] Sitemap.xml`
			`- [x] robots.txt with AI bot rules`

			`### Modern SEO (AI-Era)`
			`- [x] llms.txt file`
			`- [x] Comprehensive JSON-LD schemas`
			`- [x] AI bot permissions in robots.txt`
			`- [x] Clear, parseable content structure`

			`---`

			`## Maintenance`

			`### When to Update`

			1. Content changes: Update `data/cv-{lang}.json` SEO fields
			`2. New sections: Add corresponding Schema.org types`
			3. New AI bots: Add to `robots.txt`
			4. Annual review: Update `llms.txt` with current info

			`### Monitoring`

			`- Google Search Console for traditional SEO`
			`- Matomo Analytics for traffic patterns`
			`- Manual testing in AI chat interfaces (ChatGPT, Claude, Perplexity)`

			`---`

			`## References`

			`- [Schema.org](https://schema.org/)`
			`- [Google Search Central](https://developers.google.com/search)`
			`- [llmstxt.org Standard](https://llmstxt.org/)`
			`- [WPBeginner SEO Guide 2025](https://www.wpbeginner.com/opinion/does-seo-still-work/)`