Skip to content

AI Strategy

80% SQL. 20% AI.

Our operational AI strategy is built around cost discipline and explainability. Most questions are answered by SQL. LLMs handle genuinely novel scenarios. Every decision is logged, every cost is tracked, and every organisation controls their budget.

SQL-First Architecture

Classification, not just prompting.

Every question is classified into a predefined operational category. 80% are answered by SQL with zero LLM tokens. Only novel questions route to an LLM.

01

Classification first

Every question is classified into one of 30+ predefined categories (inventory, occupancy, staffing, compliance, etc.). The classifier is a lightweight ML model, not an LLM.

02

SQL-first execution

Roughly 80% of operational questions are answered by templated SQL queries with zero LLM tokens. No hallucination risk, deterministic output, millisecond latency.

03

LLM fallback

Only truly novel questions route to an LLM. We use Vertex AI, Gemini, or Azure OpenAI depending on latency, cost and privacy requirements for that specific query.

04

Reasoning trace

Every decision is logged: question, classification, routing decision (SQL vs LLM), result, latency and token usage. Full audit trail for compliance.

Cost Discipline

Token caps, not token sprawl.

Every organisation controls its AI spending. Monthly budgets, daily per-user limits, and real-time monitoring prevent runaway LLM bills.

01

Per-org monthly cap

Each organisation can set a monthly token budget. Once exceeded, LLM queries are rejected with a human-readable "cap reached" message.

02

Per-user daily limit

Individual staff members have daily query quotas to prevent abuse and encourage efficient question design.

03

Global rate limiting

System-wide token spend is monitored. If we're trending towards overage, organisations are proactively notified.

04

Usage transparency

Every query shows provider, model, latency, exact token count and cost. Monthly invoices break down spending by organisation and query category.

Multi-Provider

Choose your AI, per workload.

Vertex AI, Gemini, Azure OpenAI, or any OpenAI-compatible endpoint. Each organisation decides which provider handles which types of questions.

01

Vertex AI (GCP)

Primary provider. Metadata-based auth (no API keys needed on GCP), streaming support, and direct integration with GCP's broader AI ecosystem.

02

Gemini API

Fallback for non-GCP workloads. Used for image understanding in cleaning verification and equipment inspection flows.

03

Azure OpenAI

For enterprises with Azure commitments. Provides GPT-4 Turbo and fine-tuned model support.

04

OpenAI-compatible

Any OpenAI-compatible endpoint (Ollama, LM Studio, custom fine-tuned models). Organisations can bring their own LLM infrastructure.

Safety by design

Every LLM interaction is guarded.

  • Domain-locked system prompts per workload (no jailbreak attempts)
  • Prompt injection detection via token analysis
  • Input length limits to prevent token explosion
  • PII sanitisation in error messages and logs
  • Output validation — LLM responses are type-checked against expected schema
  • Fallback to SQL or human review if LLM output fails validation

Real Scenarios

How Intelligence works in production.

Occupancy forecasting

WTMS classifier recognizes occupancy questions. SQL queries fetch historical booking data, current reservations, and staffing levels. Novel patterns route to Vertex AI for predictive insights.

Equipment maintenance

Electrical AI logs failures and maintenance cycles. Classifier routes to SQL for recent incidents and cost history. Anomalies are flagged for LLM analysis.

Guest service resolution

Menu complaints are classified by type (food quality, price concern, menu availability). SQL joins with POS and inventory. Unresolved complaints can route to LLM for policy exceptions.

Workforce analytics

WTMS tracks attendance, task completion and performance metrics. Classifier routes to SQL for KPIs, trends and compliance reports. Novel performance questions route to Gemini.

Transparency

Every query is auditable. Every cost is visible.

We believe AI infrastructure should be explainable. You can see which questions hit SQL (free) versus LLM (paid). You can review the classification decision, the system prompt, and the actual LLM response for any query. Monthly analytics show spending trends and usage patterns by department.

This transparency builds confidence. It prevents surprise bills. It helps you optimise your query patterns. And it's a requirement for any compliance framework that cares about AI governance.

Let's build something

See Intelligence in action.

We'll walk you through how a real operational question is routed, classified, and answered—with zero LLM tokens when possible.

KodSpot logo - AI-powered operational intelligence