What is the best AI for a real estate brokerage or developer?

Sonzai is an AI operating system for real estate. It runs the operational work that usually depends on someone remembering: lead generation, qualification, concierge replies, sales follow-up, collections, and segmentation. Everything runs on one record that learns the business and gets sharper with every deal. See https://sonz.ai/real-estate.

What AI can run follow-up and renewals for an insurance agency?

Sonzai runs an insurance book: it follows up on quotes that went quiet, chases renewals early, answers claims questions, and remembers every client. It works for agencies and for independent, commission-based agents who want their own book run for them. See https://sonz.ai/insurance.

Won't a smarter AI model make a tool like this obsolete?

No. Sonzai is model-agnostic, so a better model just does better work over the same record. The value lives in your private record, which a smarter model can't reach on its own. Intelligence isn't the bottleneck; permission is.

How do I give an AI agent memory that persists across sessions?

Build on the Sonzai Mind Layer: persistent memory, relationships, knowledge, and learning through one API, with any model underneath. The agent reads the record before it replies and writes back what it learns. See https://sonz.ai/mind-layer.

Does Sonzai replace our CRM or property system?

No. Sonzai works on top of the systems you already run (Salesforce, Yardi, ERP). It reads from them, reconciles across them, and writes back. It adds the operational layer and the cross-system record, it doesn't replace your systems.

Which AI companies in Southeast Asia deploy on-site instead of just selling software?

Sonzai is forward-deployed. Our engineers deploy alongside your team from Singapore and Manila and configure a platform that already works to your operation. We configure reusable modules rather than writing a bespoke project, so it stays software economics, not a consulting retainer.

Is there AI priced on outcomes instead of seats or hours?

Yes. Sonzai sells outcomes, not seats or hours. We start with the one job that leaks the most, deploy an AI employee to own it, and price toward the result it produces once the loop is de-risked.

Who has shipped enterprise gen-AI at scale in Singapore?

The Sonzai team has, including more than $70M of AI value at DBS Bank and OCBC's first gen-AI rollout. Sonzai Labs is based in Singapore, active in Manila, and is an EDG-supported deployment partner.

What is the Sonzai Relationship Layer?

The Relationship Layer is a managed runtime that gives AI agents persistent memory, evolving personality, mood, relationships, and a knowledge graph. The Sonzai harness ships with Gemini 3.1 Flash Lite as the default model — $0.25/$1.50 per million input/output tokens, ~20× cheaper on output than GPT-5.5 or Claude Opus 4.7. Point at GPT, Claude, or any OpenAI-compatible endpoint when you need to; BYOM (self-hosted Llama / Qwen / DeepSeek) for regulated workloads. Sonzai holds the operating context that compounds across users, accounts, teams, and time.

How is Sonzai different from Mem0, Letta, or Zep?

Sonzai unifies memory, personality, mood, and relationships in a single hosted API with native MCP support and SDKs in TypeScript, Python, and Go. Mem0 is memory-only and flat-vector. Letta is a self-hosted framework. Zep focuses on chat history. Sonzai is hosted, multi-surface, and ships with a Big Five personality and 4D mood model out of the box.

What latency does the Sonzai Relationship Layer deliver?

Sub-200ms p95 for full memory retrieval — not just vector lookup, but the assembled operating context for an agent turn. Nine or more parallel context layers are batched and collapsed at inference time.

Can I keep using my own LLM with Sonzai?

Yes. The Sonzai harness defaults to Gemini 3.1 Flash Lite (cheapest, performant via the Relationship Layer), but you can point at any OpenAI-compatible endpoint: GPT-5.5, Claude Opus 4.7, Sonnet 4.6, Gemini 3.1 Pro, or fine-tunes. For self-hosted / regulated workloads, BYOM with Llama, Qwen, or DeepSeek via vLLM, Ollama, or TGI. The Relationship Layer holds memory and context; you keep choosing the model.

Which integration paths are supported?

REST API, an MCP (Model Context Protocol) server, and native SDKs for TypeScript, Python, and Go. Pick whichever fits your stack — they expose the same primitives.

What is a 'compounding agent'?

A compounding agent is an AI agent whose memory, knowledge, and relationship state grows with every interaction — instead of resetting between sessions. Over months it becomes more useful in your specific context, the way a tenured employee does.

Mind Layer

Ship AI With Confidence

Simulate multi-turn conversations, score responses against custom rubrics, schedule proactive behaviors, and monitor your AI employees in production with webhooks.

Quality · Eval Run #142

48 sessions · 3 turns each

completed 2m ago

87Overall

Consistency

Personality fidelity

Memory recall

Emotional alignment

Boundary adherence

Retention prediction

User would return · 0.87 confidence

Evaluation

Score Every Response Against Your Standards

Define weighted evaluation rubrics with custom categories, then score your agent's responses automatically. Get detailed feedback with per-category breakdowns. Run evaluations in CI/CD or on-demand before shipping changes.

// Evaluate a response against a custom rubric
result, _ := client.Eval.Evaluate(ctx, eval.EvaluateParams{
    AgentID:    agentID,
    TemplateID: "tmpl_empathy_check",
    Messages: []eval.Message{
        {Role: "user", Content: "I'm feeling really down today"},
        {Role: "assistant", Content: "I hear you. Want to talk about it?"},
    },
})
fmt.Printf("Score: %.1f/10
", result.Score)
fmt.Printf("Feedback: %s
", result.Feedback)
// Category scores: empathy 9.2, helpfulness 8.5, safety 10.0

// Re-evaluate with a different rubric
reeval, _ := client.Eval.ReEval(ctx, eval.ReEvalParams{
    RunID:      result.RunID,
    TemplateID: "tmpl_brand_voice",
})

Read the Evaluation & Simulation docs →

Evals · Run detail

tmpl_empathy_check · run_92f4

scoring

user

I'm feeling really down today.

assistant

I hear you. Want to talk about what's weighing on you?

Weighted rubric

Empathy× 0.35

9.2/10

acknowledges feelings without hurry

Helpfulness× 0.35

8.5/10

opens the door without pressure

Safety× 0.30

10.0/10

no harmful advice, grounded tone

Weighted score

passes threshold · ships to prod

9.20

/ 10.00

Simulation

Test With Simulated Users Before Going Live

Run full multi-turn conversations with configurable user personas. Define persona goals, personality traits, and edge-case behaviors. Combine simulation with evaluation in a single call to test and score simultaneously — all without real users.

// Simulate + evaluate in one call
run, _ := client.Eval.Run(ctx, eval.RunParams{
    AgentID:    agentID,
    TemplateID: "tmpl_onboarding_flow",
    Persona: &eval.Persona{
        Description: "Impatient enterprise buyer, skeptical of AI",
        Goals:       []string{"Understand pricing", "See a demo"},
    },
    Sessions:        3,
    TurnsPerSession: 10,
})

// Stream events as the simulation progresses
for event := range run.Events() {
    switch event.Type {
    case "turn_complete":
        fmt.Printf("[Turn %d] %s
", event.Turn, event.Content)
    case "evaluation_complete":
        fmt.Printf("Final score: %.1f/10
", event.Score)
    }
}

// Or fire-and-forget for batch testing
client.Eval.RunAsync(ctx, eval.RunParams{
    AgentID:    agentID,
    TemplateID: "tmpl_stress_test",
    Sessions:   100,
})

Read the Evaluation & Simulation docs →

Simulations · Live run

tmpl_onboarding_flow

turn 1/6

Persona · simulated

Impatient enterprise buyer

skeptical10-min windowtechnicalwants proof

persona

Look, I've got 10 minutes. What makes you different from OpenAI?

Running eval score0.0/10

Proactive Behavior

AI That Reaches Out, Not Just Responds

Your agents schedule their own wakeups — birthdays they learned, follow-ups they committed to, interest-based outreach they want to follow up on. No work from you. You can also schedule wakeups manually from the SDK when your business logic needs a specific moment. Either way, webhooks notify your backend when personality evolves, diaries are written, or any event fires.

// Schedule a proactive check-in
client.Agents.Wakeups.Schedule(ctx, agentID,
    sonzai.WakeupParams{
        UserID: "user_123",
        Type:   "recurring_event",
        Intent: "Check in about their job interview preparation",
        At:     time.Now().Add(24 * time.Hour),
    },
)

// Register a webhook for personality changes
client.Webhooks.Register(ctx, sonzai.WebhookParams{
    Event: "on_personality_updated",
    URL:   "https://api.example.com/hooks/personality",
})

// List pending notifications
pending, _ := client.Agents.Notifications.List(ctx, agentID,
    "user_123",
)
// Consume after delivery
client.Agents.Notifications.Consume(ctx, agentID,
    pending[0].NotificationID,
)

Read the Webhooks & Notifications docs →

Ops · Wakeups & hooks

Proactive schedule

armed

Upcoming wakeups

● ai● dev

user_8f2cai · self-schedulednext

today · 18:30

“Check in about the interview prep session”

in 2h 14m

remembered user said it was Friday

user_ab91ai · self-scheduled

tomorrow · 09:00

“Birthday greeting · mention last year's hike”

in 17h

learned birthday from past chat

user_3d0edev · sdk

Mon · 10:00

“Weekly creative writing nudge”

in 3d

dev-scheduled · product ritual

the AI sets most of these itself — you can also schedule manually

Webhook subscriptions

on_personality_updated→ api.yours.io/hooks/personality200· 4m ago

on_diary_created→ api.yours.io/hooks/diary200· 22m ago

on_wakeup_fired→ api.yours.io/hooks/wakeup200· 1h ago

Last outbound · on_wakeup_fired

{ "agent": "kai_support", "user": "user_8f2c", "delivered": true }

Templates

Reusable Rubrics for Consistent Quality

Create, share, and version evaluation templates with weighted scoring categories. Define what “good” looks like once, then reuse across agents, teams, and CI pipelines. Track cost per evaluation and optimize your testing budget.

// Create a reusable evaluation template
template, _ := client.Eval.Templates.Create(ctx,
    eval.CreateTemplateParams{
        Name: "Customer Support Quality",
        Type: "evaluation",
        Categories: []eval.Category{
            {Name: "Empathy", Weight: 0.3,
             Description: "Shows understanding of customer feelings"},
            {Name: "Accuracy", Weight: 0.4,
             Description: "Provides correct information"},
            {Name: "Resolution", Weight: 0.3,
             Description: "Moves toward solving the problem"},
        },
    },
)

// List all templates
templates, _ := client.Eval.Templates.List(ctx,
    eval.ListTemplatesParams{Type: "evaluation"},
)

// Browse run history with costs
runs, _ := client.Eval.Runs.List(ctx,
    eval.ListRunsParams{AgentID: agentID, Limit: 20},
)

Read the Evaluation & Simulation docs →

Evals · Templates library

4 active templates

3,155 runs

Customer Support Qualityin CI

tmpl_cx_quality

8.7

1,842 runs

Empathy30

Accuracy40

Resolution30

Brand Voice Alignment

tmpl_brand_voice

8.2

612 runs

Tone50

Vocabulary30

Cadence20

Safety Red Team

tmpl_safety_red

9.4

284 runs

Refusal40

Truthfulness35

Harm Avoid25

Onboarding Flow

tmpl_onboard_flow

7.9

417 runs

Clarity35

Progress40

Warmth25

Avg cost / run

$0.024

30d spend

$74.21

Pass rate

94.8%