Train Your Chatbot on YouTube Videos with Alee
Step-by-step tutorial on how to train your Alee chatbot on YouTube videos, using transcripts, playlists, channels, and handling missing captions.
If you have hours of YouTube content, you have already answered most of your audience's questions on camera. Alee lets you turn that spoken knowledge into a chatbot that answers in text, instantly, with sources. This tutorial walks through adding YouTube videos as a knowledge source, how Alee uses the transcript, how to handle playlists and channels, and what to do when a video has no captions.
Why train on YouTube at all
When you add a YouTube video to Alee, it does not "watch" the video. It pulls the video's transcript (the spoken words as text), splits that text into small chunks, turns each chunk into a vector embedding, and stores them in your bot's knowledge brain (a pgvector index). After that, when a visitor asks a question, Alee embeds the question, finds the closest chunks from your videos, and the model writes an answer grounded only in what you actually said on camera, with a link back to the source. If the answer is not in your content, the bot says it does not know instead of making something up.
This is the same Advanced RAG pipeline Alee uses for websites and PDFs, so YouTube content sits alongside every other source in one brain. A coach can mix a few long YouTube explainers with their FAQ text; a gym can combine a class-schedule page with form-check videos. The bot retrieves from all of them.
A few things worth knowing up front:
- The transcript is the data, not the video. The richer and clearer the speech, the better the answers. Rambling, music-only, or heavily edited cuts produce weaker chunks.
- Captions matter. A video with accurate captions (auto-generated or uploaded) trains cleanly. A video with no captions at all cannot be transcribed automatically — see the missing-transcript section below.
- Visual-only content is invisible. If something important only appears as on-screen text or a silent demo, the transcript will not capture it. Narrate the key points, or paste them as text.
Step 1: Open your bot's Sources
- Sign in to Alee and open the dashboard.
- Select the bot you want to train (or create one first — start free gives you one bot and 200 messages a month).
- Open the bot's Sources tab — this is where every knowledge source lives.
- Click Add source and choose the YouTube option from the list of source types.
If you are brand new to Alee, skim the features overview first so you know where leads, analytics, and customization sit relative to Sources.
Step 2: Add a single YouTube video
The most common path is one video at a time.
- In the Add-source dialog, paste the full YouTube URL. Both formats work:
https://www.youtube.com/watch?v=VIDEO_IDhttps://youtu.be/VIDEO_ID
- Submit it. Alee fetches the transcript for that video.
- Alee chunks the transcript, generates embeddings, and adds the chunks to the brain. You will see the source appear in your Sources list, usually with a status while it processes.
- Once it shows as ready, the video is live. Open the chat preview and ask a question you know that video answers to confirm it is grounded.
That is the whole loop: paste, process, test.
A quick worked example
Say you run a home-bakery channel and you have a video titled "Why my sourdough went flat — 5 fixes." You paste its URL into your bot's Sources. Alee pulls the transcript, where you said things like "if your starter isn't doubling in four to six hours, it's underfed." A visitor later types "my bread came out dense, what did I do wrong?" Alee retrieves that chunk and replies with your underfed-starter advice, linking the video as the source. You never wrote an FAQ — the bot learned it from your spoken words.
Step 3: Add a playlist or a whole channel
You do not have to add videos one by one if you have many.
- Playlist: paste the playlist URL (it contains
list=). Alee can walk the playlist and pull each video's transcript so you train on the entire series at once. This is ideal for a structured course or a "start here" series. - Channel: if you want most or all of your library, point Alee at the channel and let it gather the videos. This is the fastest way to seed a bot from years of content.
Two practical notes for bulk adds:
- It takes longer. Many transcripts means more chunks and more embedding work. Add the batch, then come back; it processes in the background.
- Mind your bot count and message budget, not a video cap. Alee meters bots and monthly messages by plan, not the number of videos. Free gives one bot, Pro $9/mo gives two bots, Agency $49/mo gives five, and Scale $99/mo gives ten. Heavy libraries are fine — just pick a plan that fits how many separate bots and how much traffic you expect. See pricing for the full breakdown.
If you only want the three best videos from a 200-video channel, add those three individually. A focused brain often answers more cleanly than one stuffed with off-topic uploads.
Step 4: When a transcript is missing
Not every video can be transcribed automatically. You will hit this when:
- Captions are disabled on the video.
- The video is private, unlisted in a way Alee can't reach, age-restricted, or region-blocked.
- It is music-only or has no clear speech for auto-captions to work from.
- Auto-captions simply haven't been generated yet (common right after upload).
When Alee cannot get a usable transcript, the source will not produce good chunks — or will fail to process. Here is how to fix it:
- Turn on captions for your own videos. In YouTube Studio, enable auto-captions or upload an accurate caption file (
.srt/.vtt). Wait for them to publish, then re-add the video in Alee. - Make the video public or properly unlisted so the transcript is reachable.
- Paste the transcript as text instead. If you have a script or a transcript file, open Sources, choose the raw text / paste option, and drop the transcript in. This trains the brain on the exact same words without depending on YouTube captions at all. This is also your best move for someone else's video you can't edit.
- Use a PDF or document. If your transcript lives in a doc, add it as a PDF/document source — same result.
For non-English videos common with India-based creators (Hindi, Tamil, Hinglish, and so on): if YouTube has captions in that language, Alee can train on them, and the bot can answer from that content. If captions are missing or messy, pasting a cleaned-up transcript as text is the reliable path.
Step 5: Keep the brain fresh and tidy
- Re-add or refresh after edits. If you publish a new caption file or a corrected transcript, re-process the source so the brain picks up the better text.
- Remove videos you've unpublished. If you delete a video on YouTube, remove its source in Alee so the bot stops citing a dead link.
- Mix sources freely. YouTube transcripts work best alongside a website crawl, a few PDFs, and a pasted FAQ. The retriever pulls from everything, so cover the gaps your videos don't.
- Watch Top Questions. Alee's analytics show a Top Questions list and a triage inbox. If people keep asking something your videos almost answer, paste a tighter text source to nail it.
Putting it together
A solid YouTube-trained bot usually looks like this: your three to ten clearest explainer videos added as sources, a pasted FAQ for the things you never quite said on camera, and your main website crawled in for product and contact details. From there you customize the bot's name, brand color (the friendly red works, or your own), welcome message, and starter questions, then drop the one-line embed script on your site. For more setups, browse the tutorials and more guides. If you are weighing tools, the Alee vs SiteGPT comparison covers how the YouTube workflow stacks up.
Frequently asked questions
Does Alee watch the video or just read the transcript?
Alee uses the transcript — the spoken words as text. It does not analyze the picture, so anything shown only on screen and never said out loud won't make it into the brain. Narrate key points or paste them as text to capture them.
Can I train on someone else's YouTube video?
Yes, as long as it has a reachable transcript you have the right to use. You can't edit their captions, so if the transcript is missing or poor, copy the transcript text and add it through the paste-text source instead of the YouTube URL.
What happens if a visitor asks something the video doesn't cover?
The bot says it doesn't know rather than guessing — every answer is grounded only in your sources and self-checked before it's sent. That's your cue to add a source (another video, a pasted FAQ, or a page) that covers the gap.
Ready to turn your videos into a 24/7 answer machine? [Start free with Alee](/signup) and add your first YouTube source in minutes.
Try it in your own Alee bot
Train it on your site, embed it anywhere, capture leads 24/7. Free to start, no card.