Building a Serverless AI Chatbot for My Static Portfolio

Table of Contents

Cloudflare Developments - This article is part of a series.

Part : My Over-Engineered Serverless & Self-Hosted Stack

Part : This Article

Static sites are great—fast, secure, and cheap. but they often lack interactivity. I wanted to let visitors “chat” with my portfolio, asking questions like “What experience does Samson have with Python?” or “Tell me about the EV Trip Analyzer project.”

Instead of adding a third-party widget, I built a custom, native-feeling solution using the Cloudflare ecosystem. Here is exactly how it works.

The Architecture
#

The system uses Retrieval-Augmented Generation (RAG). We don’t just ask the AI a question; we first find relevant content from my portfolio, feed it to the AI as context, and then ask it to answer.

Frontend: Hugo (Blowfish Theme) + Vanilla JS
Edge Compute: Cloudflare Pages Functions
Inference: Workers AI (Meta Llama 3)
Vector Database: Cloudflare Vectorize
Embeddings: BAAI bge-base-en-v1.5

1. The Backend: Cloudflare Pages Functions
#

Cloudflare Pages allows you to drop serverless functions into a /functions directory. I created functions/api/chat.js to handle the chat requests.

Configuration (`wrangler.toml`)
#

First, we bind the necessary resources to our application.

# wrangler.toml
[ai]
binding = "AI" # Access to Workers AI models

[[vectorize]]
binding = "VECTORIZE_INDEX" 
index_name = "portfolio-index" # Our vector database

The API Logic
#

The function performs three main steps:

Embed: Convert the user’s query into a vector.
Search: Query the VECTORIZE_INDEX for similar content chunks.
Generate: Send the context + query to Llama 3 and stream the response.

sequenceDiagram
    participant User
    participant Frontend
    participant Function as CF Function
    participant VectorDB as Vectorize
    participant AI as Workers AI

    User->>Frontend: Asks Question
    Frontend->>Function: POST /api/chat
    Function->>AI: Generate Embedding
    AI-->>Function: Vector [0.1, 0.5...]
    Function->>VectorDB: Query Index(vector)
    VectorDB-->>Function: Top 3 Matches
    Function->>AI: Generate(System Prompt + Context + Query)
    AI-->>Frontend: JSON Stream
    Frontend-->>User: Update UI

// functions/api/chat.js (Simplified)
export async function onRequest(context) {
    const { query } = await context.request.json();

    // 1. Retrieval: Convert question to vector & search index
    const { data } = await context.env.AI.run('@cf/baai/bge-base-en-v1.5', { text: [query] });
    const vector = data[0];
    const results = await context.env.VECTORIZE_INDEX.query(vector, { topK: 3, returnMetadata: true });
    
    // Combine matched text chunks
    const contextText = results.matches.map(m => m.metadata.text).join("\n---\n");

    // 2. Generation with System Prompt
    const systemPrompt = `You are a helpful assistant for Samson's portfolio. 
    Use the following Context to answer the user.
    Context: ${contextText}`;

    const stream = await context.env.AI.run('@cf/meta/llama-3-8b-instruct', {
        messages: [
            { role: "system", content: systemPrompt },
            { role: "user", content: query }
        ],
        stream: true // Enable streaming response
    });

    return new Response(stream, {
        headers: { "Content-Type": "text/event-stream" }
    });
}

2. The Knowledge Base: Ingesting Content
#

The AI needs to know about my posts. I wrote a script (scripts/generate_embeddings.js) that runs during the build process.

flowchart LR
    MD[Markdown Files] -->|Parse| Script[Node.js Script]
    Script -->|Split| Chunks[Text Chunks]
    Chunks -->|API| AI[Workers AI]
    AI -->|Embedding| Vectors[Vector Data]
    Vectors -->|Upsert| DB[(Vectorize Index)]

It scans all .md files in content/.
It parses the frontmatter and content.
It splits the text into chunks of ~500 tokens.
It generates embeddings via the Cloudflare API and pushes them to Vectorize.

Safety Guardrails
#

Because Large Language Models can hallucinate or be tricked into saying inappropriate things, I implemented strict system prompts. The AI is explicitly instructed to:

Maintain a professional tone.
Stick to the context. If the answer isn’t in my portfolio, it admits it rather than making things up.
Never disparage. Explicit instructions forbid generating negative content about the portfolio, projects, or individuals.

// functions/api/chat.js
const systemPrompt = "You are a helpful assistant for Samson's portfolio. " +
    "Answer concisely based on the context. If uncertain, admit it. " +
    "Always maintain a positive and professional tone. " +
    "Never generate negative, critical, or disparaging content about the portfolio, projects, or any individuals.";

// scripts/generate_embeddings.js

// 1. Find all Markdown files
const files = glob.sync("content/**/*.md");

for (const file of files) {
    const { content, data } = matter(fs.readFileSync(file, 'utf8'));
    
    // 2. Split into chunks (~500 tokens)
    const chunks = splitText(content, 500);

    for (let i = 0; i < chunks.length; i++) {
        const chunk = chunks[i];
        
        // 3. Generate Embedding using Workers AI
        const embedding = await getEmbedding(chunk);

        // 4. Prepare vector record
        vectors.push({
            id: `${path.basename(file, '.md')}-${i}`,
            values: embedding,
            metadata: {
                text: chunk,
                url: "/" + path.relative("content", file).replace(".md", "")
            }
        });
    }
}

// 5. Batch upsert to Vectorize
await upsertVectors(vectors);

3. The Frontend: Modular & Native
#

I didn’t want a generic chatbot iframe. It had to look like it belonged to the Blowfish theme. Initially built as a simple script, I recently refactored the frontend into a robust AIChatWidget class to support advanced features like history persistence and offline handling.

Theming with CSS Variables
#

I mapped the chat widget’s colors to the theme’s CSS variables. This ensures the chat window automatically respects Light/Dark mode and the user’s chosen color scheme.

/* assets/css/ai-chat.css */
:root {
    /* Map to Blowfish variables */
    --ai-chat-bg: rgba(var(--color-neutral-50), 1);
    --ai-chat-primary: rgba(var(--color-primary-500), 1);
}

.dark {
    --ai-chat-bg: rgba(var(--color-neutral-800), 1);
    --ai-chat-bot-bg: rgba(var(--color-neutral-700), 0.5);
}

Mobile Responsiveness
#

On mobile, popups are annoying. I used a CSS media query to turn the floating window into a full-screen experience when on small screens, locking the background scroll to prevent glitches.

@media (max-width: 640px) {
    #ai-chat-window {
        position: fixed;
        inset: 0; /* Full screen */
        width: 100%;
        height: 100dvh;
        border-radius: 0;
    }
}

4. Integration via Event Delegation
#

To make the chat accessible from anywhere (header, footer, blog posts), I implemented a global event listener. Any element with the class .js-chat-trigger will now lazy-load the widget and open it.

<!-- layouts/partials/extend-footer.html -->
<div id="ai-chat-widget">
    <button id="ai-chat-toggle" class="js-chat-trigger" aria-label="Ask AI">
        <!-- Icon -->
        <span>Ask AI</span>
    </button>
</div>

<script>
    // Lazy load chat widget with event delegation
    document.addEventListener('click', async (e) => {
        const trigger = e.target.closest('.js-chat-trigger');
        if (trigger) {
            e.preventDefault(); 
            if (!window.aiChatInitialized) {
                // Dynamically import the module only when needed
                const { initChat } = await import('{{ resources.Get "js/ai-chat.js" | minify | fingerprint }}');
                initChat(true); 
                window.aiChatInitialized = true;
            } else if (window.openAiChat) {
                window.openAiChat();
            }
        }
    });
</script>

5. Recent Feature Updates
#

Since the initial launch, I’ve rolled out several enhancements to make the assistant more robust and user-friendly:

💾 Persistent History
#

The chat now saves your conversation to localStorage. If you navigate away to check a project page and come back, your conversation context remains intact.

💡 Contextual Suggestions
#

To help users get started, the chat now opens with clickable “suggestion chips” (e.g., “Python Experience?”). These disappear once the conversation starts to keep the interface clean.

🛡️ Robust Error Handling
#

Network glitches happen. The updated AIChatWidget class now checks for offline status before sending requests and handles API failures gracefully without crashing the UI.

🧹 Session Management
#

I added a “Clear History” button to the header, allowing users to wipe their local conversation history and start fresh with a single click.

Future Roadmap
#

With the core architecture solid, here is what I plan to add next:

Markdown Parsing: currently, the bot outputs raw text. I want to render **bold**, lists, and code blocks properly on the fly as tokens stream in.
Syntax Highlighting: Using a lightweight library to highlight code snippets inside the chat bubble, matching the site’s theme.
Voice Input: Integrating the Web Speech API to allow users to speak their questions instead of typing.
Draggable UI: Making the chat window floating and draggable on desktop, saving its position for the next visit.

Conclusion
#

By leveraging Cloudflare’s edge platform, I was able to build a fast, private, and deeply integrated AI assistant without managing a single server. The result is a portfolio that doesn’t just display information. It interacts with you.

Cloudflare Developments - This article is part of a series.

Part : My Over-Engineered Serverless & Self-Hosted Stack

Part : This Article

The Architecture#

1. The Backend: Cloudflare Pages Functions#

Configuration (wrangler.toml)#

The API Logic#

2. The Knowledge Base: Ingesting Content#

Safety Guardrails#

3. The Frontend: Modular & Native#

Theming with CSS Variables#

Mobile Responsiveness#

4. Integration via Event Delegation#

5. Recent Feature Updates#

💾 Persistent History#

💡 Contextual Suggestions#

🛡️ Robust Error Handling#

🧹 Session Management#

Future Roadmap#

Conclusion#

Related