Confidence Scores

Confidence scores indicate how certain your agent is about each response. They’re calculated based on how well the user’s question matches your knowledge base content, helping you identify where your agent might need more training data.

How Confidence Works

When a user asks a question:

Vector search — The question is compared against your knowledge base
Match scoring — Each document gets a similarity score (0-100%)
Confidence calculation — The highest match score becomes the confidence
Response generation — The AI uses matched documents to answer

Score Ranges

Score	Level	Meaning
80-100%	High	Strong match to knowledge base
60-79%	Medium	Partial match, may need more context
40-59%	Low	Weak match, answer may be general
0-39%	Very Low	No good match, relying on general knowledge
`null`	No Match	No knowledge base search performed

Viewing Confidence Scores

In Conversations

Go to Conversations
Open a conversation
Each assistant message shows its confidence score

Messages with low confidence are highlighted so you can easily spot potential issues.

In Analytics

Go to Analytics
View the Confidence section:
- Average confidence over time
- Distribution (how many responses at each level)
- Trend (improving or declining)

On Agent Cards

The main agents list shows average confidence for each agent, giving you a quick health check.

Interpreting Scores

High Confidence (80-100%)

✅ What it means: Your knowledge base contains relevant information for this question. Example:

User: “What are your business hours?” Agent: “We’re open Monday to Friday, 9 AM to 6 PM.” (95% confidence)

The agent found a direct match in your documentation.

Medium Confidence (60-79%)

⚠️ What it means: The agent found related content but may be extrapolating. Example:

User: “Can I return an item after 60 days?” Agent: “Our return policy is 30 days from purchase.” (68% confidence)

The agent found return policy info but the specific 60-day scenario wasn’t documented.

Low Confidence (40-59%)

🟡 What it means: Limited relevant content found. The answer may be based on general knowledge. Example:

User: “Do you offer enterprise pricing?” Agent: “Please contact our sales team for enterprise options.” (45% confidence)

Consider adding enterprise pricing documentation.

Very Low Confidence (0-39%)

🔴 What it means: The agent is essentially guessing or using fallback responses. Example:

User: “What’s your CEO’s favorite color?” Agent: “I don’t have that information.” (12% confidence)

This is expected for off-topic questions.

Using Confidence for Automation

Email Channel

For the email channel, confidence scores drive automation:

Confidence	Action
Above threshold	Auto-reply to customer
Below threshold	Forward to human for review

Configure the threshold in Channels → Email → Settings.

Recommended Thresholds

Use Case	Threshold	Why
Customer support	70%	Balance automation with accuracy
Sales inquiries	80%	Higher stakes, be more careful
Internal tools	60%	Lower risk, more automation
Documentation	65%	Users expect accuracy

Improving Confidence

Add Missing Content

Filter conversations by low confidence
Identify common low-confidence questions
Add relevant documentation to knowledge base

Review Low-Confidence Responses

Weekly routine:

Go to Conversations
Sort by confidence (low to high)
Review responses
Add missing info to knowledge base

Track Trends

In Analytics, monitor:

Is average confidence increasing over time?
Which topics have lowest confidence?
Are there sudden drops after knowledge base changes?

Confidence vs Accuracy

High confidence doesn’t guarantee accuracy. It means the agent found relevant content, not that the response is correct.

When Confidence Can Mislead

Outdated content — High match to old, incorrect info
Similar but wrong — Question matches related but different topic
Partial information — Good match but incomplete answer

Best Practice

Combine confidence monitoring with:

User feedback (thumbs up/down)
Regular content audits
Human review of edge cases

API Access

Access confidence data programmatically:

Per-Message Confidence

curl "https://api.ansa.so/conversations/{id}" \
  -H "Authorization: Bearer $ANSA_API_KEY"

Response includes confidenceScore for each assistant message.

Aggregate Analytics

# Get confidence summary
curl "https://api.ansa.so/analytics/confidence/summary?agentId=xxx" \
  -H "Authorization: Bearer $ANSA_API_KEY"

# Get confidence distribution
curl "https://api.ansa.so/analytics/confidence/distribution?agentId=xxx" \
  -H "Authorization: Bearer $ANSA_API_KEY"

# Get confidence over time
curl "https://api.ansa.so/analytics/confidence/over-time?agentId=xxx" \
  -H "Authorization: Bearer $ANSA_API_KEY"

Filtering Conversations

Find conversations that need attention:

Go to Conversations
Use filters:
- Min Confidence — Show only above threshold
- Max Confidence — Show only below threshold
Review and improve knowledge base

Best Practices

Set a weekly calendar reminder to review low-confidence conversations. This is the fastest way to improve your agent.

Start with 70% threshold — Adjust based on your needs
Review weekly — Check low-confidence conversations
Track trends — Monitor average confidence over time
Combine with feedback — Use both metrics together
Update regularly — Keep knowledge base current

Getting Started

User Guides

Use Cases

Developer Guides

Tools & Forms

Integrations

API Reference

Support

Confidence Scores

How Confidence Works

Score Ranges

Viewing Confidence Scores

In Conversations

In Analytics

On Agent Cards

Interpreting Scores

High Confidence (80-100%)

Medium Confidence (60-79%)

Low Confidence (40-59%)

Very Low Confidence (0-39%)

Using Confidence for Automation

Email Channel

Recommended Thresholds

Improving Confidence

Add Missing Content

Review Low-Confidence Responses

Track Trends

Confidence vs Accuracy

When Confidence Can Mislead

Best Practice

API Access

Per-Message Confidence

Aggregate Analytics

Filtering Conversations

Best Practices

Next Steps

Knowledge Base

Conversation History

Getting Started

User Guides

Use Cases

Developer Guides

Tools & Forms

Integrations

API Reference

Support

​How Confidence Works

​Score Ranges

​Viewing Confidence Scores

​In Conversations

​In Analytics

​On Agent Cards

​Interpreting Scores

​High Confidence (80-100%)

​Medium Confidence (60-79%)

​Low Confidence (40-59%)

​Very Low Confidence (0-39%)

​Using Confidence for Automation

​Email Channel

​Recommended Thresholds

​Improving Confidence

​Add Missing Content

​Review Low-Confidence Responses

​Track Trends

​Confidence vs Accuracy

​When Confidence Can Mislead

​Best Practice

​API Access

​Per-Message Confidence

​Aggregate Analytics

​Filtering Conversations

​Best Practices

​Next Steps

Knowledge Base

Conversation History

How Confidence Works

Score Ranges

Viewing Confidence Scores

In Conversations

In Analytics

On Agent Cards

Interpreting Scores

High Confidence (80-100%)

Medium Confidence (60-79%)

Low Confidence (40-59%)

Very Low Confidence (0-39%)

Using Confidence for Automation

Email Channel

Recommended Thresholds

Improving Confidence

Add Missing Content

Review Low-Confidence Responses

Track Trends

Confidence vs Accuracy

When Confidence Can Mislead

Best Practice

API Access

Per-Message Confidence

Aggregate Analytics

Filtering Conversations

Best Practices

Next Steps