I built a production-ready serverless LLM API on GCP designed for low cost, strong security,

I built a production-ready serverless LLM API on GCP designed for low cost, strong security,I built a production-ready serverless LLM API on GCP designed for low cost, strong security,

The network for creativity

Join 1.25M professional creatives like you

Connect with clients, get discovered, and run your business 100% commission-free

Creatives on Contra have earned over $150M and we are just getting started

Back to feedPost

Paul G

• Feb 2

I built a production-ready serverless LLM API on GCP designed for low cost, strong security, and fast inference. Requests flow through CDN, load balancing, WAF, and API management before hitting a Cloud Run FastAPI service that handles prompts, session history, caching, and model routing. The system switches between Gemini 2.5 Pro for deep reasoning and Gemini Flash for fast responses, with RAG support using Vector Search over 768-dim embeddings. Data is stored in Firestore, cached in Redis, and logged to BigQuery. Everything is secured with VPC Service Controls, Workload Identity, KMS, Secret Manager, and DLP. CI/CD is fully automated with Terraform and Cloud Build using canary rollouts and auto-rollback on SLO violations. At around 50K requests per day, the platform runs at about $1K/month and scales to zero when idle.

Cloud Security Consulting Corel Vector Golang Google Gemini Cloud Infrastructure

Hamza Nafasat

• Jul 3

Echo is a multi-tenant SaaS for AI customer support with realtime chat, an AI voice agent, and AI automation. I was the primary developer on the full build, frontend, backend, and the AI layer. The AI is the core. It runs OpenAI, Claude, Gemini, and Grok through one multi-model setup, so a client can switch providers without a rewrite. A RAG pipeline connected to a vector database grounds every answer in the client's own content, so the chatbot never returns generic output. VAPI powers the voice agent, so customers can speak to support on a live call. Each tenant gets its own AI agent built on its own documents. The stack is Next.js 15 and React 19 inside a Turborepo monorepo, with separate apps for the dashboard, the embeddable chat widget, and backend services. Realtime chat runs on Convex. Clerk handles auth. API keys are encrypted per tenant through AWS Secrets Manager, so no two clients share credentials or data. Launch day held 60 live conversations at once with zero dropped sessions. What the client gets: an AI chatbot and voice agent that answer from their own content, work across multiple LLM providers, and stay isolated and secure per tenant.:

Backend Development AI Development Full Stack Development Next.js OpenAI Vapi

Ethan Okawa

pro

• Jul 6

Perfect

Hamza Nafasat

• Jul 3

Form Flow is a B2B SaaS for enterprise and financial teams where AI handles the full form logic. I built the AI layer that turns plain English into working software behavior.

The rule engine sends an admin's plain-English input to the OpenAI API, which generates executable JavaScript handlers from the description. Those handlers run inside a vm2 sandbox on Node.js, so AI-generated code executes without touching the rest of the app. It works like a locked box: code runs inside, nothing escapes. Rules chain in order and trigger status changes, alerts, and display actions. Getting output consistent enough to run in production without malformed handlers took real iteration, the careful side of AI integration.

A RAG layer reads the surrounding form structure and the exact field a user is on, then sends a prompt scoped to that field, so the AI assistant returns an answer that fits the question instead of something generic. On financial intake forms, this cut abandonment with no human agent involved.

Branding automation pulls colors, logos, and typography from any URL for white-label deployment. Identity verification runs through IDmission with live face matching for compliance.

What the client gets: non-technical admins define complex form logic in plain English, an AI assistant guides users field by field, and AI-generated code runs safely inside a sandbox.

Backend Development AI Engineer Full Stack Development Google Gemini Next.js Node.js

Ethan Okawa

pro

• Jul 6

Perfect

Hamza Nafasat

• Jul 3

Meet AI is a video conferencing SaaS where AI agents join live meetings as active participants and respond in real time. I built the full application stack.

The hard part is the real-time layer. The backend provisions an AI agent the moment a meeting starts and keeps it running for the full call. Speech runs through a transcript pipeline and generates responses through the OpenAI API in near real time, so the agent replies while the conversation is still moving, not minutes later. After the meeting, the backend writes a structured summary through the same API, handled async by Inngest so nothing blocks the live app.

The stack is Next.js 15 and React 19 with the Stream Video and Stream Chat SDKs, tRPC and Drizzle ORM on Neon PostgreSQL, and Better Auth across the stack.

The platform runs 30 simultaneous live sessions, each with its own active AI agent, with no backend slowdown.

What the client gets: live AI agents that take part in real meetings, plus automatic summaries, on a stack built to hold many sessions at once.

AI Agent Engineer AI Engineer Full Stack Development Next.js Node.js OpenAI

Ethan Okawa

pro

• Jul 6

Perfect

Back to feed

The network for creativity

Join 1.25M professional creatives like you

Connect with clients, get discovered, and run your business 100% commission-free

Creatives on Contra have earned over $150M and we are just getting started

Challenges

View all

base44giveitaglowchallenge

$15K3d left

Envato Challenge

$50K10d left

Trending

Claude

Claude has entered the design space. How are you using Claude Design?

Contra University

Learn from expert creatives how to earn more using next-gen AI tools.

fifaworldcup2026

The World Cup is here and the whole world's watching. How are you designing for the world stage?

creativeaiflow

Creative AI workflows are evolving. What tools do you use, and what are their strengths and weaknesses?

freelancerlife

Freelancer life is wins, pivots, and everything in between. What’s yours right now?

Hamza Nafasat

• Jul 3

Backend Development AI Development Full Stack Development Next.js OpenAI Vapi

Ethan Okawa

pro

• Jul 6

Perfect

Hamza Nafasat

• Jul 3

Form Flow is a B2B SaaS for enterprise and financial teams where AI handles the full form logic. I built the AI layer that turns plain English into working software behavior.

Branding automation pulls colors, logos, and typography from any URL for white-label deployment. Identity verification runs through IDmission with live face matching for compliance.

What the client gets: non-technical admins define complex form logic in plain English, an AI assistant guides users field by field, and AI-generated code runs safely inside a sandbox.

Backend Development AI Engineer Full Stack Development Google Gemini Next.js Node.js

Ethan Okawa

pro

• Jul 6

Perfect

Hamza Nafasat

• Jul 3

Meet AI is a video conferencing SaaS where AI agents join live meetings as active participants and respond in real time. I built the full application stack.

The stack is Next.js 15 and React 19 with the Stream Video and Stream Chat SDKs, tRPC and Drizzle ORM on Neon PostgreSQL, and Better Auth across the stack.

The platform runs 30 simultaneous live sessions, each with its own active AI agent, with no backend slowdown.

What the client gets: live AI agents that take part in real meetings, plus automatic summaries, on a stack built to hold many sessions at once.

AI Agent Engineer AI Engineer Full Stack Development Next.js Node.js OpenAI

Ethan Okawa

pro

• Jul 6

Perfect