Hey, it’s Matt. This week on AI Street:

🤖 Wharton: AI Paying Off on Wall Street

🧰 Anthropic Adds Excel for Finance

🥇 Ranking Macro Analysts Like Stock Pickers

Forwarded this? Subscribe here. Join readers from McKinsey, JPMorgan, BlackRock & more.

ADOPTION

Finance Firms See Clear ROI From AI: Wharton

Wharton came out with a survey this week on AI adoption that pushes back on a couple of narratives that have emerged recently :1) businesses are struggling to get ROI out of AI and 2) the technology is shrinking opportunities for recent college graduates. The business school and GBK Collective surveyed about 800 “senior decision makers” across industries including banking and finance.

Finance is one of the leading industries for AI ROI

  • 83% of Banking and Finance firms report positive ROI from GenAI initiatives, on par with Tech and Professional Services.

AI’s returns are highest where work is digital and data-heavy

  • Fewer than 7% of firms in any sector reported negative ROI, implying most AI programs are at least self-funding.

Finance leaders are using AI Agents for automation and accuracy

  • Contract lifecycle management automation with 92% accuracy.

  • Invoice matching, fraud flagging, and reporting in finance operations.

Budgets for AI are growing

  • 88% of executives expect AI budgets to rise in the next year; two-thirds of enterprises already budget $5 million or more for GenAI initiatives.

Banking and finance execs expect “revolutionary” impact

  • These executives are also the most confident in continued ROI acceleration over the next two to five years.

Entry-level roles will see the biggest shifts

  • 17% of finance leaders expect fewer intern hires, while nearly half (49%) expect to hire more. Wharton notes this suggests automation will restructure entry-level pipelines rather than eliminate them, as firms move toward AI-augmented junior roles.

This study challenges an MIT survey that said about 95% of corporate GenAI pilots fail to deliver measurable business results. JPMorgan’s Dimon said earlier this month AI is already paying for itself. Other studies this week point to measurable benefits: AI adoption among financial services compliance professionals jumped 26 points to 71%, according to a new industry survey.

Takeaway

It’s only been three years (!) since ChatGPT went mainstream and AI has quickly permeated the corporate world and adoption is accelerating. Many business tasks don’t demand critical thinking, a skill AI (still) lacks.

Further Reading

  • Gen AI Fast-Tracks Into the Enterprise | Wharton

  • How Are Companies Using AI? The short answer: a lot. | WSJ

  • JPM’s Derek Waldron on building an AI-first bank culture | McKinsey

USE CASE

AI to Help Write Employee Reviews

JPMorgan and Citigroup are letting employees use AI to help write performance reviews—a task we’re all glad to give to the bots.

Made with ChatGPT .. Duh

USE CASE

Anthropic Adds Excel for Finance

Anthropic, the AI startup backed by Amazon and Google, announced updates to its Financial Services offering, including a beta preview of an Excel plugin that lets users analyze and interact with their spreadsheets through its chatbot Claude.

LLMs, by themselves, perform poorly at math, and therefore are not well suited for Excel. That’s because language models are built to predict the next token, not perform calculations. (And also why most AI-for-finance startups have focused on chatbot-driven research.)

Anthropic has been working to connect these two worlds: LLM probability and deterministic code.

It released Model Context Protocol last year to help these different systems work together, a standard that’s been quickly adopted by competitors including OpenAI and Google.

Anthropic also rolled out what it calls “Agent Skills,” pre-built modules that handle finance tasks like building discounted cash flow models, running comps, analyzing earnings, and writing coverage reports across its apps.

Anthropic also announced new data integrations.

  • Aiera: Earnings call transcripts, investor event summaries and expert interviews via Third Bridge

  • LSEG: Market data, including equities, FX, and macro indicators

  • Moody’s: Ratings, company financials, and research

  • Chronograph: Private capital portfolios and fund-level data

  • Egnyte: Secure search of internal data rooms and models

  • MT Newswires: Global, multi-asset financial news

Takeaway

A year ago, lots of commentary focused on how LLMs were unreliable. Folks were using them as a Swiss army knife, rather than just another tool. Now, things have moved toward integrating a probabilistic system into deterministic workflows.

Further Reading

  • Advancing Claude for Financial Services | Anthropic

  • Anthropic beefs up Claude for Financial Services | Finextra

More AI Adoption + Wall Street News

  • Goldman, JPM See Junior Bankers Managing AI Bots | BI

  • VC Firms Lean on AI for More Than Just Returns | BBG

  • Sequoia Eyes Investment in Rogo, an AI for I-Bankers | BBG

  • Stockholm-Based AI Startup Grasp Raises $7 Million | PR

  • Boosted.ai Adds AI You Can Talk To for Investment Research | PR

  • BofA Reports Record Corporate Adoption of AI CashPro | Digital Banker

USE CASE

Tracking Macro Analysts Like Stock Pickers

Buy, Sell, and Hold stock recommendations have been part of Wall Street research for decades. They caught on in part because they’re simple and easy to measure: you have a price target and a time horizon, and later you can check whether that call actually made money.

Macro forecasts are harder to track. It’s trickier to judge whether an analyst’s outlook on inflation was actually right—and if it was, whether it was right for the reasons the analyst gave.

In theory, you could track all this yourself, but it’s a manual, time-consuming task—one that large language models are well suited for because they excel at organizing messy data.

Menos AI, a San Jose startup, ranks analysts on both their calls and reasoning. Its Voice Scoring system uses large language models trained on financial data and agentic workflows to evaluate whether an analyst’s explanations proved correct, producing a “voice score.”

The company is piloting Voice Scoring with several global-macro hedge funds, per the company. Menos AI has raised $5.2 million in seed capital and was founded last year by William Wu, Chris Yang, and Xiang Pan.

Takeaway

AI can now extract and connect data at a cost that makes new indicators like Menos AI’s voice scores economically viable.

Further Reading

RESEARCH

AI Tracks the Economy’s Mood Through WSJ

Researchers tracked day-to-day changes in macroeconomic sentiment in Wall Street Journal stories using AI to identify and categorize growth and inflation narratives to show that news-based sentiment mirrors key economic indicators, according to research from the Bank for International Settlements.

The paper uses OpenAI’s GPT-4.1 and GPT-5 mini models to analyze 25 years of Wall Street Journal coverage of the U.S. economy. By analyzing about 200,000 articles, the models generated daily “growth” and “inflation” sentiment scores and broke them down into demand- and supply-side drivers such as fiscal policy, commodity prices, and supply disruptions.

Researchers found that the LLM-derived sentiment indices closely track traditional indicators such as payrolls, inflation, and industrial production.

“The LLM-based approach bypasses statistical inference and directly draws on human-curated narratives,” the paper said, adding that the method “provides a real-time, high-frequency quantitative assessment of the state of the economy.”

The BIS, sometimes called the central bank of central banks, has released the resulting indices and plans to update them over time.

As for current economic conditions, the AI models suggest that recent articles point to slower growth, with more coverage of weak spending and tighter financial conditions, along with renewed inflation pressures from government actions and energy costs.

A big issue with AI forecasting is that we don't know what data went into training the model, which opens the door to look-ahead bias—testing it on data it already “knows.” So as the BIS releases more data, we’ll be able to see how the models perform on information it’s definitely not seen.

Takeaway

LLMs can quantify economic sentiment by analyzing news narratives. The BIS study shows their results closely track traditional indicators like payrolls, inflation, and industrial production.

Further Reading

  • Parsing the pulse: decomposing macroeconomic sentiment with LLMs | BIS

SPONSORSHIPS

Reach Wall Street’s AI Decision-Makers

Advertise on AI Street to reach a highly engaged audience of decision-makers at firms including JPMorgan, Citadel, BlackRock, Skadden, McKinsey, and more. Sponsorships are reserved for companies in AI, markets, and finance. Email me ([email protected]) for more details.

ROUNDUP

What Else I’m Reading

  • 16 Charts that Explain the AI Boom | Understanding AI

  • AI Stock Valuations to Fall as Hype Becomes Reality | BBG Opinion

  • Google, Anthropic Announce Cloud Deal Worth Tens of Billions | PR

  • AI Has Limited Impact on US Employment: BofA | PYMNTS

  • Citadel Adds Ex Millennium Compute Head for AI Systems | Hedgeweek

  • Nordic Capital Inks $250 Million Deal for Market Data Firm BMLL | BBG

CALENDAR

Upcoming AI + Finance Conferences

  • ACM ICAIF 2025 – November 15–18, 2025 • Singapore

    Top-tier academic/industry conference on AI in finance and trading.

  • AI for Finance – November 24–26, 2025 • Paris

    Artefact’s AI for Finance summit, focused on generative AI, future of finance, digital sovereignty, and regulation 

  • NeurIPS Workshop: Generative AI in Finance – Dec. 6/7 • San Diego One-day academic workshop at NeurIPS focused on generative AI applications in finance, organized by ML researchers.

Is there a conference I missed? Reach out: [email protected]

How did you like today's newsletter?

Login or Subscribe to participate

Reply

or to participate

More From AI Street

No posts found