Alee
← All resources
SiteGPT & alternatives · 12 min read

What Is SiteGPT? A Complete Guide to AI Chatbots Trained on Your Website

What is SiteGPT? Learn how AI chatbots trained on your website work, what they cost, where they fall short, and how to choose the right tool.

Your website already contains the answers to most of the questions your visitors ask. They're buried in your docs, your pricing page, your FAQ, that one blog post from eighteen months ago. The problem isn't that the answers don't exist — it's that a person on a phone at 11pm doesn't want to dig through six pages to find them. They want to type a question and get a straight answer.

That gap is exactly what SiteGPT was built to close. It's one of the better-known tools in a category that has exploded over the past couple of years: AI chatbots you train on your own content, so they answer in your voice, about your business, with information you actually published.

This guide explains what SiteGPT is, how this kind of chatbot works under the hood, what it's genuinely good at, where the limits are, and how to think about choosing one. We'll be specific and fair — including about where competitors and alternatives like Alee fit in — so you can make a real decision instead of just collecting marketing claims.

What is SiteGPT, in plain terms

SiteGPT is a software-as-a-service tool that lets you build an AI chatbot trained on your website's content. You point it at your site (or upload documents), it reads and indexes that content, and you get a chat widget you can embed on your pages. When a visitor asks a question, the bot answers using your material rather than guessing from generic internet knowledge.

The core promise is simple: instead of writing hundreds of scripted question-and-answer pairs by hand — the way old-school chatbots worked — you let the AI learn from content you've already created. A few minutes of setup replaces what used to be weeks of decision-tree building.

SiteGPT sits in a broader category often called custom AI chatbots, RAG chatbots, or AI chatbots trained on your website. SiteGPT was an early, popular entrant, which is why so many people search for it by name. But the underlying approach is now offered by a wide range of tools, and understanding the approach matters more than understanding any single product.

What it's typically used for

People reach for a website-trained chatbot to handle a handful of recurring jobs:

  • Customer support deflection — answering common questions so fewer tickets reach a human.
  • Pre-sales questions — helping prospects understand pricing, features, and fit before they buy.
  • Lead capture — collecting an email or booking a call when the bot can't fully resolve a question.
  • Internal knowledge lookup — letting employees query internal docs in plain language.
  • Documentation help — turning a static help center into something you can actually ask.

How an AI chatbot trained on your website actually works

The phrase "trained on your website" is a little misleading, and it's worth understanding what's really happening — because it explains both why these bots are so good and why they sometimes fail.

Most of these tools do not retrain a large language model on your data. Training a model from scratch is enormously expensive and slow. Instead, they use a technique called retrieval-augmented generation, or RAG. Here's the flow, step by step.

Step 1: Ingestion

You give the tool your sources — your website URL, a sitemap, a help center, uploaded PDFs, or sometimes a connected knowledge base. The tool crawls and pulls the text from each page or document.

Step 2: Chunking and embedding

The raw text is split into smaller passages, or "chunks." Each chunk is converted into a numerical representation called an embedding — essentially a list of numbers that captures the meaning of that text. Chunks with similar meaning end up close together in this mathematical space. These embeddings are stored in a vector database.

Step 3: Retrieval

When a visitor asks a question, that question is also turned into an embedding. The system searches the vector database for the chunks whose embeddings are closest in meaning to the question. This is semantic search — it can match "How much does it cost?" to a pricing paragraph even if the word "cost" never appears in it.

Step 4: Generation

The most relevant chunks are bundled together with the visitor's question and sent to a large language model (an LLM like GPT-4-class or Claude-class models) along with an instruction such as: "Answer the question using only the context below. If the answer isn't there, say you don't know." The model writes a natural-language answer grounded in your content.

Why this matters

This architecture has three practical consequences you should keep in mind:

  • The bot is only as good as your content. RAG can't invent a refund policy you never wrote down. If your site is thin or contradictory, the bot will be too.
  • Grounding reduces but doesn't eliminate hallucination. Because the model is told to answer from your content, it's far less likely to make things up than a raw chatbot. But it can still misread, overreach, or fill gaps awkwardly, especially when retrieval surfaces the wrong chunk.
  • Updates are fast. Since nothing is being "retrained," you can re-crawl your site and the bot reflects new content quickly — usually a re-index rather than a multi-hour job.

What SiteGPT does well

SiteGPT earned its reputation for good reasons. If you're evaluating it specifically, here's where it tends to shine.

  • Fast setup. Point it at a URL, let it crawl, and you have a working bot in minutes. The time-to-first-answer is genuinely short.
  • Multiple content sources. Beyond crawling your site, it supports uploading files and pulling from help centers, so you can consolidate scattered knowledge.
  • Embeddable widget. The chat widget drops onto your site with a snippet of code, and you can adjust its look to fit your brand.
  • Lead handoff. It can collect visitor details and escalate to email or a human when a conversation goes beyond what it can answer.
  • Multilingual answers. Like most modern LLM-based bots, it can respond in many languages even when your source content is in one.

For a single business that wants one polished bot on one site, SiteGPT is a reasonable, capable choice.

Where website-trained chatbots fall short

No tool in this category is magic. Going in with clear eyes will save you frustration. These limitations apply to SiteGPT and to most of its alternatives — they're properties of the approach, not bugs in one product.

It can still get things wrong

Grounding the model in your content reduces hallucination dramatically, but "reduced" isn't "zero." If two pages on your site contradict each other, or if retrieval pulls a stale chunk, the bot can confidently state something wrong. You'll want a way to review real conversations and correct course.

Retrieval quality is the hidden variable

When a bot gives a weak answer, the culprit is usually retrieval, not the language model. If your content is poorly structured — giant undifferentiated pages, important facts split across many documents, inconsistent terminology — the system struggles to find the right chunk to feed the model. Better-organized content beats almost any tuning.

It won't replace your whole support team

A website-trained chatbot is excellent at the repetitive 60–80% of questions that have a documented answer. It is not a substitute for human judgment on refunds, edge cases, angry customers, or anything requiring account access and real decisions. The right mental model is deflection and triage, not full replacement.

Pricing can scale in surprising ways

Many tools price by messages, "credits," or training volume. A bot that's cheap during a quiet month can get expensive when traffic spikes or when you index a large site. Always check how a tool counts usage before you commit.

SiteGPT vs. the alternatives: how to compare

"What is SiteGPT" almost always leads to a second question: is it the right one for me? Rather than ranking tools, here's the honest way to compare anything in this space. Score each option on the dimensions that actually affect your day-to-day.

| Dimension | What to ask |
| --- | --- |
| Setup effort | How long from sign-up to a working bot on my site? |
| Content sources | Can it crawl my site and take docs, FAQs, help centers? |
| Answer quality | Does it cite sources? Can I see and fix bad answers? |
| Lead capture | Does it collect emails and route them where I need them? |
| Customization | Can I match my brand, tone, and the bot's personality? |
| White-label | Can I remove the vendor's branding — or resell the bot? |
| Pricing model | Per-message, per-credit, or flat? What happens at scale? |
| Multi-bot / multi-site | Can I run several bots from one account? |

Where a white-label tool changes the math

One distinction worth calling out: most chatbot tools are built for a single business putting a bot on its own site. But agencies, freelancers, and platforms often need to deploy bots for clients — under their own brand, not the vendor's.

That's the gap Alee is built around. Alee is a white-label AI chatbot platform: you train a bot on a business's own content using the same RAG approach described above, but you can run it fully under your own brand and stand up bots for multiple clients or sites from one place. If you're an agency or you're rebundling chatbots as part of a larger offering, white-label support stops being a nice-to-have and becomes the whole point. If you're a single business that just wants one bot, the white-label angle matters less — though it doesn't hurt to have the option as you grow.

The honest takeaway: SiteGPT is a strong pick for a standalone single-site bot. A white-label-first tool like Alee makes more sense when you're deploying on behalf of others or want to keep the experience entirely your own. Match the tool to the job rather than the hype.

How to set up an AI chatbot trained on your website

Whichever tool you choose, the setup arc is broadly the same. Here's a practical sequence that produces a bot people actually trust.

1. Clean up your source content first

This is the step everyone skips and everyone regrets. Before you train anything:

  • Fix or merge contradictory pages (two different return windows, two different prices).
  • Make sure your most important answers are written down somewhere — pricing, policies, "how do I get started," top objections.
  • Break up giant walls of text into clearly headed sections so retrieval has clean chunks to grab.

Thirty minutes of content cleanup beats hours of fiddling with bot settings.

2. Connect your sources

Point the tool at your site URL or sitemap, and add any extra material — help center articles, PDFs, a product spec. More relevant content is good; more junk content is not. Skip pages that don't help answer questions (legal boilerplate, archived posts) if the tool lets you.

3. Set the bot's instructions and tone

Most tools let you write a system prompt or persona. Be explicit:

  • Tell it to answer only from your content and to admit when it doesn't know.
  • Set the tone (friendly, concise, formal) to match your brand.
  • Define what to do when stuck — offer to capture an email, point to a contact page, or hand off to a human.

4. Test like a skeptical customer

Don't test with softball questions. Ask the things customers actually ask, including the awkward ones: "Is there a free trial?" "Can I get a refund after 40 days?" "Do you integrate with X?" Note every weak answer — they almost always point to a content gap you can fix.

5. Turn on lead capture and embed it

Configure the bot to collect contact details at the right moment (after it helps, or when it can't fully resolve a question), then drop the widget on your site. Start it on a few pages, watch real conversations, and expand once you trust it.

6. Review conversations weekly

The single highest-leverage habit: read what people actually asked and how the bot answered. Each bad answer is a free roadmap — add the missing content, fix the contradiction, adjust the instructions. A reviewed bot gets noticeably better over a few weeks; an unwatched one quietly drifts.

Is a website-trained chatbot right for you?

A quick gut check. You're a strong candidate if:

  • You field the same questions over and over and the answers already live on your site or in your docs.
  • You're losing visitors who can't find answers fast enough, especially outside business hours.
  • You want to capture leads from people who are clearly interested but not ready to fill out a contact form.
  • You (or your clients) have documentation, FAQs, or knowledge bases that could be doing more work.

You should temper expectations if your business runs almost entirely on bespoke, account-specific answers that aren't documented anywhere, or if you have very little written content to train on. In those cases, fix the content situation first — the chatbot will only ever be as good as what you feed it.

Frequently asked questions

Is SiteGPT the same as ChatGPT?

No. ChatGPT is a general-purpose assistant from OpenAI that answers from broad training data. SiteGPT (and tools like it) wrap a language model in a system that grounds answers in your specific content using retrieval-augmented generation. The model doing the writing may be similar, but the difference is the source of truth: ChatGPT draws on the open internet, while a website-trained bot answers from the pages and documents you give it.

Does the chatbot actually get "trained" on my data?

In most cases, not in the literal sense. These tools rarely retrain a language model on your content. Instead they index your content into a searchable vector database and feed the most relevant pieces to the model at the moment a question is asked. The practical upside is that updates are fast — re-crawl your site and the bot reflects the changes without a lengthy retraining process.

Will the chatbot make up answers?

It can, but grounding makes it far less likely. Because the bot is instructed to answer from your content and to admit uncertainty, it hallucinates much less than a raw chatbot would. The remaining risk usually comes from contradictory content, stale pages, or the system retrieving the wrong passage. Clean, well-organized content and regular conversation reviews are the best defenses.

How much does an AI website chatbot cost?

It varies widely. Many tools charge a monthly subscription with tiers based on the number of messages, "credits," or how much content you index. Costs can climb if your traffic spikes or your site is large, so check how each tool counts usage before committing. Free tiers and trials are common, which makes it cheap to test fit before you pay. Alee offers a free signup so you can try the approach without spending anything up front.

Can I put the chatbot on a client's site under my own brand?

That depends on the tool. Most single-business chatbot products show their own branding and aren't designed for reselling. If you're an agency or platform that needs to deploy bots for clients under your name, look specifically for white-label support — that's the core reason Alee exists, as opposed to single-site tools where branding control is limited.

How long does it take to set one up?

The bot itself can be live in minutes — point it at your site, let it index, embed the widget. Getting it good takes a bit longer: budget some time up front to clean your source content, then a few weeks of light weekly reviews to close gaps. The technical setup is the easy part; the content quality is what separates a bot people trust from one they ignore.

Try it free

If you've read this far, you already know the answers to most of your visitors' questions are sitting on your site waiting to be put to work. The fastest way to understand a website-trained chatbot is to build one and ask it real questions. Alee lets you train a bot on your own content, capture leads automatically, and run it under your own brand — and you can get started free in a few minutes, no credit card required. Point it at your site, ask it the questions your customers actually ask, and decide for yourself.

Build your own AI chatbot with Alee

Train it on your site, embed it anywhere, capture leads 24/7. Free to start.

Related reading