New AI Technique Cracks the Code on Overconfidence in Autonomous Systems

A detailed map of route intelligence overlay sheet C, displaying roads, highways, and text with information about cities and towns along the route.

New AI Technique Cracks the Code on Overconfidence in Autonomous Systems

Researchers at Salesforce AI Research have developed a new method to improve the reliability of artificial intelligence systems. Jiaxin Zhang, Caiming Xiong, and Chien-Sheng Wu introduced Holistic Trajectory Calibration (HTC), an AI technique designed to tackle overconfidence in autonomous AI agents. Their work focuses on making AI decisions more dependable, especially in complex, multi-step tasks where current methods often fall short.

Existing calibration techniques struggle when AI systems handle tasks with multiple stages. These methods typically assess confidence at single points rather than across the entire process. HTC changes this by analysing an agent's full 'trajectory'—the sequence of actions and decisions from start to finish.

The approach extracts detailed features at every stage, from broad dynamics to fine-grained stability. This allows HTC to provide deeper insights into why an AI succeeds or fails. Unlike other solutions, it works with different types of models, making it adaptable for various AI applications. Testing on the HLE dataset showed strong results. HTC-Reduced, a streamlined version of the method, achieved an Expected Calibration Error (ECE) of 0.031 and a Brier Score of 0.09. These metrics indicate better accuracy and reliability compared to existing baselines. Beyond calibration, HTC enhances other key areas. It improves discrimination between correct and incorrect decisions, offers clearer interpretability, and supports better transferability and generalisation across tasks. The researchers argue that these strengths set it apart from previous AI approaches.

The introduction of HTC marks a step forward in addressing AI overconfidence. By evaluating entire decision-making processes, the method helps ensure more consistent and trustworthy AI performance. Its flexibility and strong test results suggest potential for wider adoption in autonomous systems.

Neueste Nachrichten

Poster with text promoting mindful food consumption and waste reduction.

Health • 11.05.2026• 2 Min.

How Zepbound and a Balanced Diet Can Boost Weight Loss Success

Struggling to lose weight? Zepbound may help—but only when paired with the right foods. Here's how to eat smarter for lasting results.

Black and white photograph of a hospital room with beds, tables, chairs, windows, and medical equipment, labeled "Krasnodar Hospital, Moscow, Russia" at the bottom.

Health • 11.05.2026• 2 Min.

Yakutsk's new SMART clinic transforms healthcare with cutting-edge tech and expanded access

From urgent care to pediatrics, Yakutsk's futuristic clinic redefines regional medicine. See how 55,000 residents will benefit from its high-tech services.

Medical City South Luzon building surrounded by trees, plants, a street pole, a car, utility poles, wires, and a cloudy sky.

Fitness und Bewegung • 11.05.2026• 2 Min.

Leyte police launch free circumcision drive with legal awareness campaign

A police-led health initiative in Leyte blends medical care with legal education. How one event is changing lives—and raising awareness.

Poster with text "healthy eating may reduce your risk of some kinds of cancer" alongside images of bread, a strawberry, and grapes.

Klimawandel • 11.05.2026• 2 Min.

Hunger rewires how we pick food—taste beats health every time

Your growling stomach isn't just loud—it's hijacking your decisions. Scientists uncover why hunger makes us blind to nutrition and obsessed with flavor.

Poster featuring a man with a determined expression and text stating "This man is your friend, Chinese he fights for freedom."

Kriminalität und Justiz • 11.05.2026• 2 Min.

Two Americans Imprisoned in China After 'Blind Mule' Drug Scams

Their families call them victims of deception—now trapped in China's harsh prisons. Can diplomacy set them free before their health fails?

Children sitting at desks in a classroom with books and pens on tables, papers on the wall behind them.

Fitness und Bewegung • 11.05.2026• 2 Min.

Sharjah schools and nurseries to resume full in-person classes by May 2026

A major shift back to classrooms is coming—but flexibility remains key. Discover how Sharjah is balancing education continuity with adaptability for future challenges.