Notes on shipping
AI in production.

Practical engineering posts from Ergini, a senior software & AI developer in Kosovo. RAG, human-in-the-loop patterns, AI scheduling, MVP cost, and the things vendor blogs skip.

Jun 15, 2026Founders12 min read

EU AI Act for Startups: What to Build Before August 2026

A builder's guide to the EU AI Act for startups: risk tiers, deployer duties, the August 2 2026 deadline, and the technical controls to ship now.

Read post→

Jun 15, 2026Kosovo & Eastern Europe10 min read

AI in Albanian (Shqip): What Works in 2026 and What Doesn't

How well do ChatGPT and Claude handle Albanian (shqip) in 2026? An honest look at quality, the gaps, and how to build reliable Albanian-language AI features.

Read post→

Jun 15, 2026Build Guides11 min read

AI Automation for Real Estate: 7 Workflows That Pay Off

Seven AI automation workflows for real estate agencies in 2026: lead qualification, listing copy, document handling, follow-up, and more, with build notes.

Read post→

Jun 15, 2026Build Guides11 min read

AI Receptionist for Small Business: Build vs Buy (2026)

What an AI receptionist actually does in 2026, the build-vs-buy math, and how to set one up that books appointments and answers calls without annoying callers.

Read post→

Jun 14, 2026Kosovo & Eastern Europe11 min read

Claude AI in Kosovo: How to Access and Build With It (2026)

Claude AI is not on claude.ai for Kosovo as of 2026. How to access Claude from Kosovo, the Albanian-language reality, and building on the Claude API.

Read post→

May 24, 2026AI Tools15 min read

AI Chatbot for Website: Build, Buy, or Both? (2026)

Honest 2026 guide to AI chatbots for websites: 8 platforms compared, build-your-own costs, knowledge-base sync, and hallucination control.

Read post→

May 21, 2026Founders14 min read

Build MVP for Startup: Real Cost in 2026 (Solo Dev)

Real MVP costs in 2026 from a developer who ships them solo. Line items, AI-MVP add-ons, where founders waste money, and a transparent rate card.

Read post→

May 18, 2026AI Engineering16 min read

RAG Architecture Tutorial: Production System in 2026

Step-by-step RAG architecture tutorial with TypeScript code, retrieval evaluation, and the five failure modes I hit shipping production RAG.

Read post→

May 15, 2026AI Tools13 min read

AI Scheduling Assistant: 9 Tools Tested in 2026

Honest 2026 review of 9 AI scheduling assistants from a developer who shipped one. Pricing, capabilities, build vs buy, and what the marketing pages omit.

Read post→

May 12, 2026AI Engineering11 min read

Human in the Loop AI: Patterns That Ship in 2026

Real human-in-the-loop AI patterns from production: approval queues, confidence-based routing, post-hoc audit, and active learning. With code and case studies.

Read post→

May 11, 2026Build Guides15 min read

Build an AI Voice Agent with Twilio and ElevenLabs (2026)

End-to-end AI voice agent on a real phone number. Twilio + ElevenLabs + GPT, with interruption handling, sub-800ms latency, and warm transfer.

Read post→

May 8, 2026AI Engineering13 min read

Vector Database Comparison 2026: Pinecone, Qdrant, pgvector

A senior engineer's take on Pinecone vs Qdrant vs Weaviate vs pgvector. Benchmarks, total cost, and the crossover point where self-hosting wins.

Read post→

May 6, 2026Build Guides12 min read

Build an AI Meeting Notes Tool (Your Own Otter)

Build an AI meeting transcription and summary tool with Whisper, speaker diarization, and structured output. Self-hosted, private, customizable.

Read post→

May 5, 2026Kosovo & Eastern Europe10 min read

Outsource Software Development to Kosovo: 2026 Guide

Kosovo offers CET-aligned, EU-adjacent, English-speaking senior dev talent at 60% off Western rates. The practical guide to making it work.

Read post→

May 4, 2026AI Tools11 min read

LangChain vs Vercel AI SDK: Which to Pick in 2026

A senior engineer's take on LangChain vs Vercel AI SDK for TypeScript AI apps. Architecture, bundle size, streaming, and when to use both.

Read post→

May 2, 2026Hiring12 min read

How to Hire an AI Developer in 2026 (Founder's Guide)

A senior AI dev's hiring playbook for founders. What to look for, how to interview, how to test for real LLM experience vs prompt-fluency theater.

Read post→

May 1, 2026AI Tools12 min read

n8n vs Zapier vs Make for AI Automation (2026)

Which automation platform wins for AI workflows in 2026? n8n's AI nodes vs Zapier's polish vs Make's visual canvas - broken down by use case.

Read post→

Apr 29, 2026Founders11 min read

The SaaS MVP Tech Stack I Use in 2026 (Ship in a Week)

Next.js, Supabase, Vercel, Stripe, Clerk, Resend. The exact MVP stack I have shipped 6+ products on, with the boilerplate decisions made for you.

Read post→

Apr 26, 2026Founders12 min read

Cost to Build an AI Chatbot in 2026: Honest Numbers

What it really costs to build an AI chatbot. Tiers from $5K demo to $200K enterprise, with what each tier gets you and where money is wasted.

Read post→

Apr 25, 2026Build Guides13 min read

AI Email Automation: Build Your Own Triage Agent (2026)

Build a private AI email agent for Gmail or Outlook that triages, labels, drafts replies, and respects your tone - without sending without approval.

Read post→

Apr 22, 2026AI Engineering12 min read

OpenAI API Cost in 2026: Real Numbers from Production

What OpenAI's API actually costs at scale. GPT-5, GPT-5-mini, caching, Batch API, and the cost-control patterns that cut my client bills 60%.

Read post→

Apr 19, 2026AI Tools11 min read

pgvector vs Pinecone: When to Switch (and When Not To)

pgvector handles 50M vectors on a normal Postgres box. Pinecone earns its price past 100M. Here is the honest crossover from production benchmarks.

Read post→

Apr 18, 2026Build Guides14 min read

Build an AI Customer Support Bot That Doesn't Hallucinate

End-to-end guide to building a production AI support bot with RAG, escalation logic, evals, and Intercom or Zendesk integration. Code included.

Read post→

Apr 16, 2026AI Tools11 min read

Claude Code vs Cursor: Daily-Driver Test (2026)

Claude Code is a terminal agent. Cursor is an AI-native IDE. I use both - here is the task-by-task breakdown of when each one earns its keep.

Read post→

Apr 15, 2026AI Tools11 min read

Claude vs ChatGPT for Developers (2026): Honest Take

A working dev's comparison of Claude Opus 4.7 vs GPT-5 on coding, context, pricing, and API reliability - based on daily shipped work.

Read post→

Apr 12, 2026AI Engineering13 min read

Fine-Tuning vs RAG: A 2026 Decision Framework

When to use RAG, when to fine-tune, when to do both. Real cost data, accuracy numbers, and a decision tree for picking the right approach.

Read post→

Apr 9, 2026Build Guides13 min read

Build an AI Lead Generation Tool (Not Another Scraper)

A signal-based AI lead-gen tool - intent detection, enrichment, scoring, and outbound copy - built from scratch with the Vercel AI SDK.

Read post→

Apr 8, 2026AI Engineering14 min read

Agentic RAG: Architecture Patterns That Ship in 2026

Agentic RAG turns retrieval into a reasoning loop. Here is how to design, evaluate, and ship one without melting your latency budget or your token bill.

Read post→

Apr 5, 2026AI Engineering11 min read

OpenAI Structured Outputs: Strict JSON Schema in 2026

Production guide to OpenAI structured outputs with Zod and Pydantic. Schema design, refusals, streaming, and migrating from JSON mode.

Read post→

Apr 2, 2026AI Engineering12 min read

Prompt Injection Defense: 8 Patterns That Work in 2026

Defense-in-depth strategies for prompt injection. Channel separation, output filtering, tool scoping, and the OWASP LLM Top 10 in plain English.

Read post→

Apr 1, 2026Build Guides13 min read

AI Document Extraction: From PDF Chaos to Clean JSON

Build an LLM-powered document extraction pipeline for invoices, contracts, and forms. Layout-aware OCR, schema design, validation, human review.

Read post→

Mar 30, 2026AI Engineering12 min read

LLM Eval Framework: DeepEval vs Braintrust vs RAGAS

Comparing the LLM eval frameworks engineers actually run in CI. Covers metrics, latency, cost, and which one to pick for RAG, agents, or chat.

Read post→

Mar 28, 2026AI Engineering12 min read

LLM Observability: Langfuse vs LangSmith vs Helicone

Side-by-side review of the top LLM observability platforms. Cost, integration time, framework support, and which to pick for agents vs RAG vs chat.

Read post→

Mar 25, 2026Tutorials14 min read

Build a Production MCP Server in TypeScript (2026)

MCP server tutorial in TypeScript with tools, resources, OAuth, and remote SSE transport. Wire it into Claude Code, Cursor, and Claude Desktop.

Read post→

Mar 22, 2026Build Guides12 min read

Build an AI Resume Screener That HR Will Trust

Build a bias-aware AI resume screener: structured extraction, rubric scoring, explanations, and the audit log that keeps recruiters confident.

Read post→

Mar 19, 2026Build Guides10 min read

AI Content Moderation: OpenAI Moderation API in Production

Build content moderation with the free OpenAI Moderation API, then layer custom classifiers and human review for edge cases.

Read post→

Mar 15, 2026AI Engineering11 min read

LLM Tool Calling Best Practices for Production Agents

How to design, name, scope, and document tools for reliable LLM tool calling. Parallel calls, error handling, prompt-injection-safe tool design.

Read post→

Mar 12, 2026AI Engineering9 min read

AI Workflow vs AI Agent: When to Pick Which

Workflows are predictable and cheap. Agents are flexible and expensive. Here is how to tell which one your problem actually needs, with examples.

Read post→

Mar 8, 2026AI Tools11 min read

Supabase vs Firebase for AI Apps in 2026

Supabase ships pgvector and Edge Functions; Firebase ships Genkit and Gemini integration. Here is the practical pick for AI-native apps.

Read post→

Mar 5, 2026AI Engineering11 min read

Embedding Models 2026: OpenAI vs Cohere vs Voyage vs BGE

Compares the leading embedding models for RAG. MTEB scores, context length, multilingual support, and price per million tokens.

Read post→

Mar 2, 2026Hiring11 min read

Cost to Hire an AI Developer in 2026 (US vs EU vs Balkans)

AI dev rates broken down by region, seniority, and engagement model. Includes the hidden costs most cost guides ignore.

Read post→

Feb 26, 2026Kosovo & Eastern Europe11 min read

Hiring Developers in the Balkans: The 2026 Honest Guide

Serbia, Albania, Kosovo, North Macedonia, Bosnia: rates, talent depth, English proficiency, and the practical playbook for hiring well.

Read post→

Feb 22, 2026AI Engineering12 min read

AI Agent Design Patterns: Reflection, Planning, Tool Use

The agent patterns that actually ship: reflection, planner-executor, ReAct, multi-agent handoff, and when to use a workflow instead.

Read post→

Feb 18, 2026AI Engineering12 min read

AI SaaS Architecture: Patterns from 5 Shipped Products

The architecture decisions every AI SaaS converges on: multi-tenant data, per-user limits, model routing, eval pipelines, and BYO-key support.

Read post→

Feb 14, 2026Founders10 min read

RAG Cost Per Query: The Full Breakdown (2026)

Embedding, vector DB, retrieval, generation. The full per-query economics of RAG at three scales - with a downloadable calculator.

Read post→

Feb 10, 2026Founders10 min read

AI MVP Checklist: 30 Things to Decide Before You Code

The 30-item checklist I run with every AI MVP client before writing a line of code. Eval strategy, cost guardrails, fallback model, data policy.

Read post→

Feb 5, 2026Kosovo & Eastern Europe10 min read

Kosovo's Tech Scene in 2026: A Founder's Field Guide

A ground-level tour of Kosovo's tech ecosystem - hubs, talent pools, salary bands, key companies, and how Western founders should engage.

Read post→

Feb 1, 2026Kosovo & Eastern Europe10 min read

Where to Find AI Engineering Talent in Eastern Europe

The Eastern European AI talent map: Poland, Ukraine, Romania, Kosovo, Serbia. Where senior LLM engineers live and what they cost in 2026.

Read post→

Jan 28, 2026Build Guides11 min read

AI Sales Automation Beyond Send More Emails (2026)

Build AI sales automation that scores intent, drafts context-aware outreach, and routes to humans - not another mass-email blaster.

Read post→

Jan 24, 2026Tutorials10 min read

Stream OpenAI Responses in Next.js 15 (2026 Tutorial)

A clean 2026 tutorial for streaming OpenAI responses in Next.js App Router using Server Actions and the Vercel AI SDK. With tool calls.

Read post→

Jan 20, 2026Tutorials10 min read

Vercel AI SDK Tool Calling: A Real-World Tutorial

How to design, ship, and debug tool calls with the Vercel AI SDK. Includes parallel tools, streaming UI, error handling, and Zod validation.

Read post→

Jan 16, 2026Build Guides12 min read

Build an Internal AI Knowledge Base Your Team Will Use

Build an internal AI knowledge assistant from Notion, Slack, Drive, and Linear. Permissions-aware, freshness-aware, citation-first.

Read post→

Jan 12, 2026Founders9 min read

No-Code vs Custom MVP: A Founder's Decision Guide

When Bubble or Webflow beats a custom build, and when it doesn't. Real cost, speed, and ceiling tradeoffs for early-stage founders.

Read post→

Jan 8, 2026Hiring10 min read

Freelance AI Developer vs Agency: Honest Comparison

When a freelance AI developer beats an agency, and when it doesn't. Cost, speed, quality, risk - broken down for founders.

Read post→

Jan 4, 2026Build Guides10 min read

Build an AI Code Review Bot with GitHub Actions

A pragmatic guide to building or installing an AI code reviewer for your PRs. CodeRabbit, PR-Agent, or your own GitHub Action - all compared.

Read post→

Notes on shippingAI in production.

EU AI Act for Startups: What to Build Before August 2026

AI in Albanian (Shqip): What Works in 2026 and What Doesn't

AI Automation for Real Estate: 7 Workflows That Pay Off

AI Receptionist for Small Business: Build vs Buy (2026)

Claude AI in Kosovo: How to Access and Build With It (2026)

AI Chatbot for Website: Build, Buy, or Both? (2026)

Build MVP for Startup: Real Cost in 2026 (Solo Dev)

RAG Architecture Tutorial: Production System in 2026

AI Scheduling Assistant: 9 Tools Tested in 2026

Human in the Loop AI: Patterns That Ship in 2026

Build an AI Voice Agent with Twilio and ElevenLabs (2026)

Vector Database Comparison 2026: Pinecone, Qdrant, pgvector

Build an AI Meeting Notes Tool (Your Own Otter)

Outsource Software Development to Kosovo: 2026 Guide

LangChain vs Vercel AI SDK: Which to Pick in 2026

How to Hire an AI Developer in 2026 (Founder's Guide)

n8n vs Zapier vs Make for AI Automation (2026)

The SaaS MVP Tech Stack I Use in 2026 (Ship in a Week)

Cost to Build an AI Chatbot in 2026: Honest Numbers

AI Email Automation: Build Your Own Triage Agent (2026)

OpenAI API Cost in 2026: Real Numbers from Production

pgvector vs Pinecone: When to Switch (and When Not To)

Build an AI Customer Support Bot That Doesn't Hallucinate

Claude Code vs Cursor: Daily-Driver Test (2026)

Claude vs ChatGPT for Developers (2026): Honest Take

Fine-Tuning vs RAG: A 2026 Decision Framework

Build an AI Lead Generation Tool (Not Another Scraper)

Agentic RAG: Architecture Patterns That Ship in 2026

OpenAI Structured Outputs: Strict JSON Schema in 2026

Prompt Injection Defense: 8 Patterns That Work in 2026

AI Document Extraction: From PDF Chaos to Clean JSON

LLM Eval Framework: DeepEval vs Braintrust vs RAGAS

LLM Observability: Langfuse vs LangSmith vs Helicone

Build a Production MCP Server in TypeScript (2026)

Build an AI Resume Screener That HR Will Trust

AI Content Moderation: OpenAI Moderation API in Production

LLM Tool Calling Best Practices for Production Agents

AI Workflow vs AI Agent: When to Pick Which

Supabase vs Firebase for AI Apps in 2026

Embedding Models 2026: OpenAI vs Cohere vs Voyage vs BGE

Cost to Hire an AI Developer in 2026 (US vs EU vs Balkans)

Hiring Developers in the Balkans: The 2026 Honest Guide

AI Agent Design Patterns: Reflection, Planning, Tool Use

AI SaaS Architecture: Patterns from 5 Shipped Products

RAG Cost Per Query: The Full Breakdown (2026)

AI MVP Checklist: 30 Things to Decide Before You Code

Kosovo's Tech Scene in 2026: A Founder's Field Guide

Where to Find AI Engineering Talent in Eastern Europe

AI Sales Automation Beyond Send More Emails (2026)

Stream OpenAI Responses in Next.js 15 (2026 Tutorial)

Vercel AI SDK Tool Calling: A Real-World Tutorial

Build an Internal AI Knowledge Base Your Team Will Use

No-Code vs Custom MVP: A Founder's Decision Guide

Freelance AI Developer vs Agency: Honest Comparison

Build an AI Code Review Bot with GitHub Actions

Notes on shipping
AI in production.