spot_imgspot_img

Recently Published

spot_img

Related Posts

Appier Research Unveils Agentic AI Breakthrough: A Risk-Aware Decision Framework

Quantifying LLM Reliability Across Risk Scenarios for Trustworthy Enterprise AI

Appier announced new research advancing the reliability of Agentic AI systems. To expand the impact of its research and development efforts, Appier’s AI research team continues to focus on frontier topics in Agentic AI and Large Language Models (LLMs), exploring forward-looking technical challenges that push the boundaries of marketing technology innovation.

In its latest paper, Answer, Refuse, or Guess? Investigating Risk-Aware Decision Making in Language Models,” the team introduces a systematic evaluation framework to measure how language models make decisions under different risk conditions. The approach significantly improves model reliability in high-risk scenarios through a novel methodological design.

The research addresses a key challenge in deploying Agentic AI in enterprise environments: ensuring that autonomous AI decisions are trustworthy. The findings further strengthen Appier’s technological leadership in AI while contributing practical insights for the broader Agentic AI ecosystem.

As enterprises move from AI copilots toward autonomous AI agents, reliability has become a critical barrier to adoption. According to a 2025 McKinsey survey, 62% of organizations have already begun experimenting with AI agents, yet inaccuracy remains the most commonly cited risk in enterprise AI adoption.

As an AI-native Agentic AI-as-a-Service (AaaS) company, Appier continues to translate cutting-edge research into enterprise-ready methodologies and product capabilities. This study specifically addresses two major enterprise concerns: AI hallucinations and decision reliability. To tackle this challenge, the research introduces a Risk-Aware Decision-Making framework that converts LLM decisions across varying risk conditions into quantifiable metrics, providing a stronger governance foundation for enterprise AI deployment.

Marketing Technology News: MarTech Interview with Miguel Lopes, CPO @ TrafficGuard

Turning Risk-Aware Strategies into Quantifiable Metrics
Traditional LLM evaluations focus primarily on whether an answer is correct. In enterprise environments, however, the cost of being wrong and the value of refusing to answer differ significantly. The study introduces structured risk parameters—including rewards for correct answers, penalties for incorrect responses, and costs for refusal—to simulate different risk scenarios. Under this framework, models must evaluate their capability, confidence level, and risk conditions before deciding whether to answer, refuse, or guess. Decision quality is then measured by whether the model maximizes expected reward, providing a more realistic assessment of strategic decision-making.

Key Finding: Strategic Imbalance in Existing Models
Using the Risk-Aware Decision-Making framework, the research finds that many leading LLMs display strategic imbalance across risk scenarios. In high-risk settings, models often over-guess despite potential negative consequences. In low-risk scenarios, they may become overly conservative and refuse to answer too frequently. This inconsistency limits both the autonomy and safety of AI systems in enterprise environments. The study suggests the issue is not purely knowledge-related but stems from the model’s difficulty in integrating multiple capabilities into a stable decision strategy.

Skill Decomposition Enables More Optimal Decisions
To address this challenge, the research proposes a Skill Decomposition approach, breaking decision-making into three steps:

  1. Task Execution — solving the task to generate an initial answer
  2. Confidence Estimation — evaluating confidence in that answer
  3. Expected-Value Reasoning — reasoning about outcomes under risk conditions

This structured reasoning process enables models to determine whether answering or refusing yields the best outcome. The approach allows models to better integrate multiple capabilities and produce more rational and stable decisions in high-risk environments, offering a practical path toward more reliable enterprise AI systems.

Marketing Technology News: Disrupt or Be Disrupted: The AI Wake-Up Call for B2B Marketers

“For Agentic AI to operate in critical enterprise workflows, the key is not only making AI smarter, but making its autonomous decisions more reliable,” said Chih-Han Yu, CEO and Co-founder of Appier. “Appier has built its products around AI and continuously invested in world-class research. By turning LLM risk awareness into a quantifiable methodology, this research strengthens the foundation for trustworthy enterprise AI and helps accelerate the real-world adoption of Agentic AI and translate it into scalable business value and ROI.”

The research findings have been further integrated into Appier’s Agentic AI-powered platforms, including Ad Cloud, Personalization Cloud, and Data Cloud, helping enterprises advance autonomous workflows in a more reliable and trustworthy way.

Looking ahead, Appier will continue leveraging its strong AI research capabilities, proprietary data assets, and deep industry expertise to advance Agentic AI innovation and support enterprises in building more efficient and trustworthy AI-driven operations.

Write in to psen@itechseries.com to learn more about our exclusive editorial packages and programs.

PRNewswire
PRNewswirehttp://prnewswire.com
PR Newswire, a Cision company, is the premier global provider of multimedia platforms and distribution that marketers, corporate communicators, sustainability officers, public affairs and investor relations officers leverage to engage key audiences. Having pioneered the commercial news distribution industry over 60 years ago, PR Newswire today provides end-to- end solutions to produce, optimize and target content -- and then distribute and measure results. Combining the world's largest multi-channel, multi-cultural content distribution and optimization network with comprehensive workflow tools and platforms, PR Newswire powers the stories of organizations around the world. PR Newswire serves tens of thousands of clients from offices in the Americas, Europe, Middle East, Africa and Asia-Pacific regions.

Popular Articles