- Golden datasets + regression tests (offline).
- Online metrics: acceptance rate, correction rate, cost/req, p95 latency; traces in Langfuse/OpenTelemetry.
Deployment & SDKs
- Infra: Vercel/Cloud Run/Kubernetes; IaC (Terraform),...
- SDKs: Flutter/React Native, Swift/Kotlin, JS/TS (Next.js), Python; REST/gRPC endpoints or any SDK you prefer to work on.
- Runbooks + dashboards; feature flags for gradual rollout.
Security & Compliance (On negotiation!)
- SSO (OAuth/SAML), per-tenant encryption, Row-Level Security where applicable.
- Redaction on ingest + response filters; audit trails for tool calls.
- Cost ceilings and rate limits per user/tenant.
Measurable Outcomes (On negotiation!)
• ≥X% deflection of support tickets via AI replies.
• ≤p95 latency budget for search/answer.
• ≥Y% accuracy on golden evals; continuous regression gating in CI.
Packaging (client-friendly) + End to End product
- Deliverables: Architecture doc, infra code, MCP servers, AI gateway, pipelines, SDKs, dashboards, eval suites, runbooks.
- Handover: Admin panel (+ feature flags), cost dashboards, playbooks for new use-cases.
- Golden datasets + regression tests (offline).
- Online metrics: acceptance rate, correction rate, cost/req, p95 latency; traces in Langfuse/OpenTelemetry.
Deployment & SDKs
- Infra: Vercel/Cloud Run/Kubernetes; IaC (Terraform),...
- SDKs: Flutter/React Native, Swift/Kotlin, JS/TS (Next.js), Python; REST/gRPC endpoints or any SDK you prefer to work on.
- Runbooks + dashboards; feature flags for gradual rollout.
Security & Compliance (On negotiation!)
- SSO (OAuth/SAML), per-tenant encryption, Row-Level Security where applicable.
- Redaction on ingest + response filters; audit trails for tool calls.
- Cost ceilings and rate limits per user/tenant.
Measurable Outcomes (On negotiation!)
• ≥X% deflection of support tickets via AI replies.
• ≤p95 latency budget for search/answer.
• ≥Y% accuracy on golden evals; continuous regression gating in CI.
Packaging (client-friendly) + End to End product
- Deliverables: Architecture doc, infra code, MCP servers, AI gateway, pipelines, SDKs, dashboards, eval suites, runbooks.
- Handover: Admin panel (+ feature flags), cost dashboards, playbooks for new use-cases.