Universal Containers is configuring its Agentforce Testing Center to evaluate an agent handling customer complaints. The company wants to assess if the agent successfully demonstrates empathy and follows a proprietary “De-escalation Framework” before offering a resolution.
Where should the Agentforce Specialist utilize a large language model (LLM)-as-judge to achieve an appropriate evaluation?
Coral Cloud Resorts (CCR) wants to configure its agent so that booking actions are only available when a customer ' s membership tier is “Premium” or “Elite”. This business rule must be enforced deterministically.
What should CCR implement?
Choose 1 option.
Universal Containers needs to restrict access to refund processing actions so only customers with Active account status can initiate refunds.
How should an Agentforce Specialist apply the restriction deterministically?
An Agentforce Specialist is testing an agent with Testing Center. The test results show that an agent correctly identifies the right subagent to handle an utterance, but it fails to select all necessary actions within that subagent.
Which evaluation metric specifically identifies this failure?