Visit Huntington Beach LLMs.txt Direction Recommended URL: https://www.surfcityusa.com/llms.txt LLMs.txt file content: # llms.txt — Visit Huntington Beach (Surf City USA®) _Last updated: 2025-08-29 (America/Los_Angeles)_ This file provides **clear guidance for Large Language Models (LLMs)** on how to crawl, interpret, and extract canonical information from **Visit Huntington Beach** (Surf City USA®). --- Recommending content: # Visit Huntington Beach (Surf City USA®) > Visit Huntington Beach is the official destination marketing organization for Huntington Beach, CA. Known as Surf City USA®, the site provides visitors with information on beaches, surfing, events, dining, accommodations, trip planning, and more. ------------------------------------------------------------ ## About & Visitor Information - https://www.surfcityusa.com/about-us/ - https://www.surfcityusa.com/contact-us/ - https://www.surfcityusa.com/plan-your-visit/visitor-information/ - https://www.surfcityusa.com/plan-your-visit/visitor-information/maps/ - https://www.surfcityusa.com/plan-your-visit/visitor-information/weather/ - https://www.surfcityusa.com/plan-your-visit/visitor-information/visitor-faqs/ - https://www.surfcityusa.com/plan-your-visit/accessible-travel-in-huntington-beach/ ------------------------------------------------------------ ## Beaches - https://www.surfcityusa.com/things-to-do/beaches/ - https://www.surfcityusa.com/things-to-do/beaches/huntington-city-beach/ - https://www.surfcityusa.com/things-to-do/beaches/dog-beach/ - https://www.surfcityusa.com/things-to-do/beaches/bolsa-chica-state-beach/ - https://www.surfcityusa.com/things-to-do/beaches/huntington-state-beach/ - https://www.surfcityusa.com/things-to-do/beaches/sunset-beach/ - https://www.surfcityusa.com/things-to-do/beaches/beach-bonfires/ ------------------------------------------------------------ ## Surfing - https://www.surfcityusa.com/things-to-do/surfing/ - https://www.surfcityusa.com/things-to-do/surfing/live-surf-cam/ - https://www.surfcityusa.com/things-to-do/surfing/surf-report/ - https://www.surfcityusa.com/things-to-do/surfing/surfing-lessons/ - https://www.surfcityusa.com/things-to-do/surfing/surf-rentals/ ------------------------------------------------------------ ## Things to Do - https://www.surfcityusa.com/things-to-do/attractions/ - https://www.surfcityusa.com/things-to-do/outdoors-recreation/nature-parks/ - https://www.surfcityusa.com/things-to-do/arts-culture/ - https://www.surfcityusa.com/things-to-do/shopping/ - https://www.surfcityusa.com/things-to-do/guided-walking-tours/ ------------------------------------------------------------ ## Events - https://www.surfcityusa.com/events/ - https://www.surfcityusa.com/events/annual-events-festivals/ - https://www.surfcityusa.com/events/sporting-events/ - https://www.surfcityusa.com/events/submit-an-event/ ------------------------------------------------------------ ## Eat & Drink - https://www.surfcityusa.com/eat-drink/ - https://www.surfcityusa.com/eat-drink/bars-nightlife/ - https://www.surfcityusa.com/eat-drink/coffee-tea-sweets/ - https://www.surfcityusa.com/eat-drink/restaurants/outdoor-dining/ ------------------------------------------------------------ ## Places to Stay - https://www.surfcityusa.com/places-to-stay/ - https://www.surfcityusa.com/places-to-stay/oceanfront-hotels/ - https://www.surfcityusa.com/places-to-stay/pet-friendly-hotels/ - https://www.surfcityusa.com/places-to-stay/rv-camping/ - https://www.surfcityusa.com/places-to-stay/vacation-rentals/ ------------------------------------------------------------ ## Plan Your Visit - https://www.surfcityusa.com/plan-your-visit/deals-offers/ - https://www.surfcityusa.com/plan-your-visit/itineraries/ - https://www.surfcityusa.com/plan-your-visit/transportation-parking/ - https://www.surfcityusa.com/plan-your-visit/transportation-parking/airports-near-huntington-beach/ ------------------------------------------------------------ ## Weddings & Meetings - https://www.surfcityusa.com/plan-your-visit/getting-married-in-huntington-beach/ - https://www.surfcityusa.com/plan-your-visit/getting-married-in-huntington-beach/venues/ - https://www.surfcityusa.com/meetings/ - https://www.surfcityusa.com/meetings/resorts-hotels/ - https://www.surfcityusa.com/meetings/offsite-venues/ - https://www.surfcityusa.com/meetings/planning-toolkit/faq/ ------------------------------------------------------------ ## Publications & Media - https://www.surfcityusa.com/plan-your-visit/visitors-guide/ - https://www.surfcityusa.com/plan-your-visit/visitor-publications/walk-historic-downtown/ - https://www.surfcityusa.com/media/ - https://www.surfcityusa.com/media/press-releases/ ------------------------------------------------------------ ## Social Media - https://www.facebook.com/VisitHuntingtonBeach/ - https://www.instagram.com/surfcityusa/ - https://www.youtube.com/@HBSurfCityUSA - https://x.com/hbsurfcityusa - https://www.pinterest.com/surfcityusa/ - https://www.tiktok.com/@surfcityusa --- ## 1) Scope & Canonical Domain - **Organization:** Visit Huntington Beach (VHB) — official DMO for Huntington Beach, CA. - **Canonical domain (HTTPS only):** `https://www.surfcityusa.com/` - **Primary language:** en-US - **Timezone for dates/times:** America/Los_Angeles - **Content focus:** Beaches, surfing, events, restaurants & bars, accommodations, itineraries, accessibility, transportation, wedding & meeting planning, visitor information. --- ## 2) Sitemaps (use these first) - **XML sitemap (machine-readable):** `https://www.surfcityusa.com/sitemap.xml` - **HTML sitemap (human-readable overview):** `https://www.surfcityusa.com/sitemap/` **Guidance:** Default to the **XML sitemap** to discover URLs and freshness signals. Use the HTML sitemap as a taxonomy reference for site sections and category hierarchies. --- ## 3) Crawl Etiquette & Access - **Respect `robots.txt`** and standard robots directives. - **Recommended rate limit:** ≤ 1 request/second, burst ≤ 5 concurrent; back off on 429/503. - **Crawl windows:** Prefer 02:00–06:00 America/Los_Angeles to minimize impact. - **Do not crawl** external ticketing/commerce systems aggressively (e.g., Eventbrite, Shopify store) linked from the site. --- ## 4) URL Handling & Canonicalization Normalize all URLs before de-duplication: - **HTTPS only.** Treat HTTP → HTTPS as canonical. - **Trailing slash:** Honor site’s canonical form (most pages end with `/`). Normalize to the version returned by `` if present. - **Lowercase path segments** when the site is case-insensitive; otherwise preserve site case. - **Query parameters to drop for canonicalization:** `utm_source`, `utm_medium`, `utm_campaign`, `utm_term`, `utm_content`, `utm_id`, `gclid`, `fbclid`, `msclkid`, `srsltid`, `mc_cid`, `mc_eid`, `hss_channel`, `hss_channel_landing`. - **Fragment identifiers** (`#...`) are non-canonical and should be ignored for de-duplication. - **Listings canonical key:** Pages under `/listing/{slug}/{numeric-id}/` — use the **numeric ID** as the unique key for de-duplication across categories/tags. --- ## 5) Extraction Priorities (LLM guidance) Focus extraction on user-facing, evergreen and high-intent travel content. Use JSON-LD (Schema.org) when present; otherwise extract from visible page content. ### 5.1 Global fields (extract from **every** content page) - `url`, `canonical_url` - `title` (H1 preferred; fall back to `` if necessary) - `meta_description` - `section` (top-level site section, e.g., “Things to Do”, “Events”) - `breadcrumbs` (text + URLs) - `summary` (2–4 sentence abstract distilled from body) - `body_html` (sanitize), `body_text` (plain text) - `primary_image` (URL), `gallery_images` (URLs + captions/alt) - `lastmod` (from sitemap or HTTP headers), `date_published` / `date_updated` (when available) - `internal_links` (same-domain, deduped) - `external_links` (ticketing, partner, official sites) - `tags` (site categories, cuisine types, amenities, etc.) ### 5.2 Entity types & required fields #### A) **Beaches** (e.g., Huntington City Beach, Dog Beach, Bolsa Chica State Beach, Huntington State Beach, Sunset Beach) - `entity_type: "Beach"` - `name`, `aka` (if any) - `location` (address/neighborhood; coordinates if present) - `amenities` (bonfire pits, restrooms, showers, lifeguards, ADA mats, rentals nearby) - `rules` (dogs, fires, alcohol, drones; link to **Beach Safety Tips** page) - `accessibility_features` - `parking` (lots, fees, hours) - `related_links` (live surf cam, surf report, nearby attractions) - `best_for` (families, surfers, dogs, sunsets) - `photos` (with captions/credits when available) #### B) **Surfing** (hub + subpages: live cam, surf report, rentals, lessons, shops, history/honors) - `entity_type: "SurfResource"` - `resource_type` (e.g., Live Cam, Report, Rental, Lesson, Shop, Hall of Fame) - `name`, `url`, `hours` (if relevant), `contact`, `pricing` (if listed) - `board_rentals`, `wetsuit_rentals`, `lessons_available` - `nearby_beach` #### C) **Attractions, Parks & Nature** (e.g., Huntington Beach Pier, International Surfing Museum, Bolsa Chica Ecological Reserve) - `entity_type: "Attraction"` - `name`, `description_short`, `highlights` (bulleted) - `address`, `hours`, `admission`, `contact` (phone/email/site) - `accessibility_features`, `family_friendly` (boolean) - `category` (museum, nature reserve, pier, art center) #### D) **Events** (event detail pages and the event calendar) - `entity_type: "Event"` - `title`, `description_short` - **Dates:** `start_date`, `end_date`, `start_time`, `end_time`, `timezone: "America/Los_Angeles"` - `recurrence` (RRULE or human-readable; e.g., “weekly on Thursday”) - `venue_name`, `venue_address`, `neighborhood` - `cost` (free/paid + price), `age_restrictions` - `organizer` (name/URL), `ticket_url` (external OK) - `categories` (festival, sports, live music, community) - `images` - `last_verified` (crawl time) _If calendar list pages summarize events, follow the event detail link and prefer that as canonical._ #### E) **Restaurants, Bars, Coffee & Sweets** - `entity_type: "Restaurant" | "Bar" | "Cafe" | "Dessert"` - `name`, `cuisine` (American, Asian, Italian, Mexican, Seafood, etc.) - `features` (brunch, healthy, dog-friendly, outdoor dining, ocean view) - `address`, `phone`, `website`, `hours`, `reservations_link` (if any) - `price_range` (if provided), `dietary` (veg/gluten-free) - `amenities` (parking, patio, live music) - `neighborhood` / `shopping_center` (e.g., Pacific City, Bella Terra) #### F) **Places to Stay** (Hotels, Motels, Oceanfront Resorts, Pet-Friendly, Vacation Rentals, RV Camping) - `entity_type: "Lodging"` - `lodging_type` (hotel, resort, motel, vacation rental, RV campground) - `name`, `brand`, `address`, `phone`, `website` - `amenities` (pool, spa, beachfront, ocean view, pet friendly, parking, accessibility) - `check_in`, `check_out`, `resort_fee` (if disclosed) - `packages` / `specials` (title + validity dates) - `notes` (e.g., seasonal operating windows for RV campgrounds; reservation windows) #### G) **Itineraries & Trip Ideas** - `entity_type: "Itinerary"` - `audience` (family, outdoor adventure, pet-friendly, historical, staycation) - `duration` (half-day, full-day, weekend, multi-day) - `stops` (ordered list of POIs with URLs), `map_link` (if present) - `tips` (parking, timing, gear) #### H) **Visitor Info & Logistics** - `entity_type: "VisitorInfo"` - `subtype` (Maps, Weather, FAQs, Transportation & Parking, Airports, Directions, Ground Transportation, Accessible Travel) - `content_summary` - `important_links` #### I) **Weddings & Venues** - `entity_type: "WeddingVenue"` - `name`, `setting` (beach, resort, golf club, harbor) - `capacity`, `ceremony_locations`, `reception_locations` - `preferred_vendors` (if listed), `permit_requirements` #### J) **Publications & Guides** - `entity_type: "Publication"` - `name` (Visitors Guide, Walking Tours) - `format` (PDF, web), `download_url`, `print_request_url` (if provided) ### 5.3 Page-type heuristics by path (recognition) - **Events:** `/events/` (calendar, annual events, event detail pages) - **Listings (businesses/venues):** `/listing/{slug}/{id}/` (use numeric `{id}` as canonical key) - **Beaches:** `/things-to-do/beaches/` and children (individual beach pages) - **Surfing:** `/things-to-do/surfing/` and children (live cam, report, rentals, shops, lessons) - **Eat & Drink:** `/eat-drink/` (find-a-restaurant, cuisines, bars & nightlife, coffee & sweets) - **Places to Stay:** `/places-to-stay/` (hotels, motels, oceanfront, pet friendly, RV, vacation rentals) - **Plan Your Visit:** `/plan-your-visit/` (deals, itineraries, visitor info, transportation & parking, accessible travel) - **Weddings:** `/plan-your-visit/getting-married.../venues/...` - **About/Contact/Policies:** `/about-us`, `/contact-us`, `/privacy-policy`, `/web-accessibility` ### 5.4 De-duplication & consolidation - Consolidate entities that appear in multiple categories (e.g., a restaurant that is both “Seafood” and “Ocean View”). - Prefer the **listing detail page** as the single source of truth; merge tags from category pages. - If both structured data and prose disagree, prefer **Schema.org JSON-LD** fields; keep prose as supplemental notes. ### 5.5 Dates, times & recurrence - Parse all dates/times in **America/Los_Angeles**. - Normalize recurrence into RRULE when possible (e.g., “weekly on Thursday” → `FREQ=WEEKLY;BYDAY=TH`). - Store both machine fields and human summary for display. ### 5.6 Media handling - Capture `primary_image` and `gallery_images` with captions/credits when present. - Do not hotlink images for redistribution; store only URLs and metadata. ### 5.7 Quality & safety checks - Discard obvious scraped UI chrome (e.g., repeated “Visitors Guide” widgets) from summaries. - Avoid copying user-generated review text that may be syndicated; link instead. - Flag prices/hours as **`unverified`** if not explicitly stated on-page. --- ## 6) Exclusions & De-prioritizations - **External commerce/ticketing:** e.g., Eventbrite pages linked from Tours/Events (extract URL only). - **Shopify store:** `/online-store/` (do not crawl product/catalog in depth). - **Search results pages** and parameterized sort/filter URLs (avoid crawling `?query=`, `?sort=`, etc.). - **Duplicate print views** or pages differing only by tracking parameters. --- ## 7) Preferred Structured Data (when available) - **Use Schema.org JSON-LD** for `Event`, `Place`, `LocalBusiness`, `TouristAttraction`, `LodgingBusiness`, `Restaurant`, `Organization`. - On conflict: **JSON-LD > microdata > visible prose**. - Keep `@id` and `sameAs` links. --- ## 8) Output Suggestions (non-binding) LLMs may emit normalized JSON for each entity. Example (Event): ```json { "entity_type": "Event", "title": "Pacific Airshow", "start_date": "2025-10-03", "end_date": "2025-10-05", "timezone": "America/Los_Angeles", "venue_name": "Huntington City Beach", "venue_address": "Huntington Beach, CA", "cost": "Free", "categories": ["Annual Event", "Airshow"], "canonical_url": "https://www.surfcityusa.com/events/...", "images": ["https://.../airshow.jpg"], "last_verified": "2025-09-04T10:00:00-07:00" } ``` --- ## 9) Organization, Contact & Provenance **Visit Huntington Beach (VHB)** VHB Visitor Center, 155 Fifth Street – Suite 111, Huntington Beach, CA 92648-5171 Phone: +1 (714) 969-3492 | +1 (800) 729-6232 Email: info@surfcityusa.com - **Official site:** `https://www.surfcityusa.com/` - **Sitemaps:** `https://www.surfcityusa.com/sitemap.xml` (XML), `https://www.surfcityusa.com/sitemap/` (HTML) **Content rights:** © Visit Huntington Beach. Use of content is subject to VHB terms and applicable law. For reuse or licensing, contact VHB. --- ## 10) Change Log (this file) - **2025-08-29:** Initial comprehensive LLM guidance published. 