GPT-4 faces a challenger: Can Writer’s finance-focused LLM take the lead in banking? – Tearsheet

July 4, 2025

Connect With Us

10 Best AI Agents...

# Tags

Advanced Plugin Tools

AI Artistry

AI Chatbots

AI Content Creation

AI Development Tools

AI Driven Marketing Strategies

AI Image Generation

AI in eCommerce

AI in Email Marketing

AI Marketing Solutions

AI Plugins

AI Programming Languages

AI SEO Tools

AI Software Development

AI Website Design

AI-Generated Images

AI-Powered Email Campaigns

AI-Powered Web Design

Artificial Intelligence for Online Retail

Artificial Intelligence in Digital Marketing

Artificial Intelligence in SEO

Artificial Intelligence Integration

Automated Customer Interactions

Automated Email Writing

Automated Website Development

Automated Writing

Content Automation Tools

Conversational AI

Creative AI Algorithms

Deep Learning Art

Deep Learning Libraries

eCommerce Automation

eCommerce Optimization with AI

Generative Adversarial Networks (GANs)

Intelligent Content Generation

Intelligent Conversational Agents

Intelligent Email Automation

Machine Learning Add-ons

Machine Learning For Marketing

Machine Learning Frameworks

Machine Learning SEO

Machine Learning Websites

Marketing Automation AI

Natural Language Processing

Natural Language Processing in Emails

Predictive Analytics for Retail

SEO AI Optimization

SEO Automation Software

Smart Plugin Solutions

Smart Website Creation

Tech

Trending

#Chatbots

GPT-4 faces a challenger: Can Writer’s finance-focused LLM take the lead in banking? – Tearsheet

Team ZYT Web3 / 8 hours
July 3, 2025
0
4 min read

Welcome to the forefront of conversational AI as we explore the fascinating world of AI chatbots in our dedicated blog series. Discover the latest advancements, applications, and strategies that propel the evolution of chatbot technology. From enhancing customer interactions to streamlining business processes, these articles delve into the innovative ways artificial intelligence is shaping the landscape of automated conversational agents. Whether you’re a business owner, developer, or simply intrigued by the future of interactive technology, join us on this journey to unravel the transformative power and endless possibilities of AI chatbots.
Blockchain/ Crypto
Banking
Data
Embedded Finance
Lending
Marketing
Payments
Green Finance
Δ

Banks are heavily investing in Large Language Models (LLMs) to enhance both internal operations and customer interactions — yet building a model that excels at both is a significant challenge.

A recent study by Writer, a San Francisco-based generative AI company that provides a full-stack AI platform for enterprise use, found that ‘thinking’ LLMs produce false information in up to 41% of tested cases.

The study evaluated advanced reasoning models in real-world financial scenarios, highlighting the risks such inaccuracies pose to regulated industries like financial services. The research also showed that traditional chat LLMs outperform thinking models in accuracy.

LLMs are used in three main ways within financial services:

We often focus on chatbots built by banks and financial firms, but today, we explore the underlying technology behind them — the engines driving chatbot interactions and platform automation.

We take a closer look at the LLMs driving these AI systems, their challenges, and how financial firms can train enterprise-grade models to capitalize on their potential while controlling their risks.

Thinking LLMs, also referred to as CoT (Chain-of-Thought) models, are designed to simulate multi-step reasoning and decision-making processes to provide more nuanced responses beyond only retrieving or summarizing information, says Waseem Alshikh, CTO and co-founder of Writer.

Morgan Stanley’s AI Assistant, for example, uses OpenAI’s GPT-4 to scan 100,000+ research reports and provide quick insights to financial advisors. It enhances portfolio strategy recommendations by summarizing complex data beyond retrieving reports.

“These models are not truly ‘thinking’ but are instead trained to generate outputs that resemble reasoning patterns or decompose complex problems into intermediate reasoning steps,” Waseem notes.

Morgan Stanley’s AI tool encountered accuracy issues stemming from hallucinated responses. Shortly after its launch in 2023, sources within the company described the tool as ‘spotty on accuracy,’ with users frequently receiving responses like “I’m unable to answer your question.”

While Morgan Stanley has been proactive in fine-tuning OpenAI’s GPT-4 model to assist its financial advisors, the company acknowledges the challenges posed by AI hallucinations. To reduce inaccuracies, the bank curated training data and limited prompts to business-related topics.

Traditional chat LLMs, however, tend to be more accurate, according to Waseem. These models mainly use pattern matching and next-token prediction, responding in a conversational manner based on pre-trained knowledge and contextual cues. While these models may struggle with complex queries at times, they produce fewer hallucinations, making them more reliable for regulatory compliance, according to Writer’s research.

Bank of America’s virtual assistant, Erica, uses a traditional chat model to assist customers with banking tasks like balance inquiries, bill payments, and credit report updates. By leveraging structured data and predefined algorithms, it provides accurate and reliable responses while reducing the likelihood of misinformation.

But how can financial firms navigate the trade-off between AI sophistication and accuracy?

Given the advanced capabilities of thinking LLMs, financial firms can’t simply rule them out, but they can deploy them effectively with the right strategic approaches.

Waseem outlines the key steps:

source

Haryana CM launches AI...

I created a chatbot...

Ever Wondered What Your...

AI ChatGPT Responds to...

Grok 3 Is in...

10 Best AI Agents...

GPT-4 faces a challenger: Can Writer’s finance-focused LLM take the lead in banking? – Tearsheet

Haryana CM launches AI chatbot 'Sarathi'.

I created a chatbot of myself.

Ever Wondered What Your Signature Candle.

AI ChatGPT Responds to UNs Proposed.