Published on September 25, 2023
By AIM
In Leaders Opinion

Leader’s Opinion: LLMs Ride the Overconfidence Wave with Mukundan Rengaswamy

Ideally, one would want to select a model at the sweet spot between underfitting and overfitting. This is the goal, but is very difficult to do in practice.

In the world of machine learning, developers often grapple with the enigmatic quirks of Language Model Models (LLMs). Jonathan Whitaker and Jeremy Howard from fast.ai embarked on an intriguing experiment, unearthing a subtle yet pervasive issue with these models: overconfidence, a distinct phenomenon from the notorious LLM hallucination.

Mukundan Rengaswamy, Head of Data Engineering, Innovation & Architecture of Webster Bank weighed in on the matter, stating, “LLMs (Large Learning Models) and Gen AI have been in the news ever since ChatGpt was introduced to the public. Generative AI is powered by very large machine learning models that are pre-trained on vast amounts of data. A lot of research is being done on these models to better understand the behavior and refine them for broader usage.”

Overconfidence, they found, occurs when a model tenaciously clings to information from its training data, even when it’s blatantly incorrect for a given question. The culprits behind this phenomenon? The familiar duo of overfitting and underfitting. Additionally he said that, “The “overconfidence” highlighted in the paper could be due to overfitting of models. Ideally, one would want to select a model at the sweet spot between underfitting and overfitting. This is the goal, but is very difficult to do in practice. There are several techniques that could be used to mitigate the challenges that may also arise with such fast-learning models.”

Overfitting is when a model becomes too intricate, mirroring its training data too closely, while underfitting arises when the model lacks enough data to make accurate predictions. Striking the balance between these extremes is the elusive bias-variance tradeoff.

To combat these challenges, developers employ various techniques, some successful and others introducing new conundrums. Whitaker and Howard ventured into the uncharted territory of training a model on a single example, yielding unexpected results.

Enter the world of overconfident LLMs. These models, when exposed to novel, unseen data, exhibit unwarranted self-assuredness in their predictions, even when they are unequivocally wrong. This contradicts the conventional wisdom that neural networks require copious examples due to the intricacies of loss surfaces during training.

The implications are vast. Imagine a medical LLM, primed to diagnose diseases based on patient descriptions. With clear-cut symptoms, it confidently prescribes a disease. However, when symptoms blur or multiple diagnoses are possible, it expresses uncertainty. He further stated that “Researchers recently noticed an unusual training pattern in fine tuning LLMs which led them to infer that the specific model was rapidly learning to recognize examples even by just seeing them once. Though this behavior is very good, it goes against the idea that slowly trained models over lengthy periods of time with varied data sets produce better results.”

Surprisingly, a single example during training had a profound impact on these models, making them overconfident, particularly in the early stages. The quest was to find a way for machines to learn efficiently, retaining reliability while regulating confidence scores.

Furthermore he added, “The researchers’ “memorization hypothesis” is based on observations of behavior while fine tuning pre-trained models using specific data sets consisting of Kaggle science exam questions. This needs to be further studied with other data sets and tested to confirm their findings.”

Overconfidence and overfitting, though related, are not one and the same.

Overconfidence can stem from limited data or unrepresentative datasets, challenging the fine balance between underfitting and overfitting. These findings, specific to fine-tuning pre-trained models, open new doors, shedding light on the nuanced world of machine learning. Yet, questions remain, including the elusive details of the base model that set this intriguing journey in motion.

📣 Want to advertise in AIM Media House? Book here >

AIM

AIM is the world's leading media and analyst firm dedicated to advancements and innovations in Artificial Intelligence. Reach out to us at info@aimmediahouse.com

Global leaders, intimate gatherings, bold visions for AI.

CDO Vision World Series

CDO Vision is a premier, year-round networking initiative connecting top Chief
Data Officers (CDOs) & Enterprise AI Leaders across major cities worldwide.

Leader’s Opinion: LLMs Ride the Overconfidence Wave with Mukundan Rengaswamy

Is IBM’s Granite 4.0 the Gateway to Open Source Enterprise AI?

Halloween Arrived Early For SaaS As OpenAI Crashes The Party

Perplexity Acquires Visual Electric Team to Enhance Design Capabilities

Uber Acquires Segments.ai to Transition From Consumer Tech to Enterprise AI

Indegene Acquires BioPharm Communications for $106 Million

Agentforce Vibes Brings More Lock-In Than Liberation

Cisco’s Agentic AI Expands Beyond Contact Centers

OpenAI Achieves $500 Billion Valuation to Overtake SpaceX

ServiceNow’s AI Interface Faces the Reality of Enterprise Adoption

Hype Behind Amazon Alexa+ Just a Front?

DualEntry Raises $90 Million to Modernize ERP Software

Ex-OpenAI and DeepMind Researchers Raise $300M for New AI Venture

CoreWeave’s $14B Meta Deal Culminates a Major Course Correction

OpenAI Appoints Mike Liberatore as Business Finance Officer

Kaseya Appoints New CRO and CTO

Explore our year-round AI events across U.S. cities >>