Close Menu
Alpha Leaders
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
What's On
Manycore bets on ‘spatial intelligence’ after HK IPO

Manycore bets on ‘spatial intelligence’ after HK IPO

18 April 2026
Iran and White House say the Strait of Hormuz is ‘completely open.’ But it remains closed for now

Iran and White House say the Strait of Hormuz is ‘completely open.’ But it remains closed for now

18 April 2026
Tether extends 7M to crypto platform Drift as critics blast Circle for not freezing stolen funds

Tether extends $127M to crypto platform Drift as critics blast Circle for not freezing stolen funds

18 April 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Alpha Leaders
newsletter
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
Alpha Leaders
Home » Multimodal AI In 2025: From Healthcare To eCommerce And Beyond
Innovation

Multimodal AI In 2025: From Healthcare To eCommerce And Beyond

Press RoomBy Press Room6 January 20255 Mins Read
Facebook Twitter Copy Link Pinterest LinkedIn Tumblr Email WhatsApp
Multimodal AI In 2025: From Healthcare To eCommerce And Beyond

Multimodality is set to redefine how enterprises leverage AI in 2025. Imagine an AI that understands not just text but also images, audio, and other sensor data. Humans are naturally multimodal. However, humans are limited in how much input we can process. Take healthcare as an example, during my time at Google Health, I heard many stories where patients overwhelmed doctors with data:

Imagine a patient with atrial fibrillation (AFIB) showing up with five years of detailed sleep data collected from their smartwatch. Or take the cancer patient arriving with a 20-pound stack of medical records documenting every treatment they’ve had. Both of these situations are very real. For doctors, the challenge is the same: separating the signal from the noise.

What’s needed is an AI that can summarize and highlight the key points. Large language models, like ChatGPT, already do this with text, pulling out the most relevant information. But what if we could teach AI to do the same with other types of data — like images, time series, or lab results?

How Does Multimodality AI Work?

To understand how multimodality works, let’s start with the fact that AI needs data both to be trained and to make predictions. Multimodal AI is designed to handle diverse data sources — text, images, audio, video, and even time-series data — at the same time. By combining these inputs, multimodal AI offers a richer, more comprehensive understanding of the problems it tackles.

Multimodal AI is more of a discovery tool. The different data modalities are stored by the AI. Once a new data point is input, the AI finds topics that are close. For example, by inputting the sleep data from someone’s smartwatch alongside information about their atrial fibrillation (AFIB) episodes, the doctor might find indications of sleep apnea.

Note that this is based on “closeness,” not correlation. It is the scaled-up version of what Amazon once popularized: “people who shopped for this item also bought this item.” In this case, it’s more like: “People with this type of sleep pattern have also been diagnosed with AFIB.”

Multimodal Explained: Encoders, Fusion and Decoders

A multimodal AI system consists of three main components: Encoders, Fusion and Decoders.

Encoding Any Modality

Encoders convert raw data (e.g., text, images, sound, log files, etc.) into a representation the AI can work with. These are called vectors, which are stored in a latent space. To simplify, think of this process as storing an item in a warehouse (latent space), where each item has a specific location (vector). Encoders can process virtually anything: images, text, sound, videos, log files, IoT (sensor) information, time series — you name it.

Fusion Mechanism: Combining Modalities

When working with one type of data, like images, encoding is enough. But with multiple types — images, sounds, text, or time-series data — we need to fuse the information to find what’s most relevant.

Decoders: Generating Outputs We Understand

Decoders “decodes” the information from the latent space — aka the warehouse — and deliver it to us. It moves from raw, abstract information to something we understand. For example, finding an image of a “house.”

If you want to learn more about encoding, decoding, and reranking, join my eCornell Online Certificate course on “Designing and Building AI Solutions.” It’s a no-coding program that explores all aspects of AI solutions.

Transforming eCommerce with Multimodality

Let’s look at another example: eCommerce. Amazon’s interface hasn’t changed much in 25 years — you type a keyword, scroll through results, and hope to find what you need. Multimodality can transform this experience by letting you describe a product, upload a photo, or provide context to find your perfect match.

Fixing Search with Multimodal AI

At r2decide, a company a few Cornellians and I started, we’re using multimodality to merge Search, Browse, and Chat into one seamless flow. Our customers are eCommerce companies tired of losing revenue because their users couldn’t find what they needed. At the core of our solution is multimodal AI.

For example, in an online jewelry store, a user searching for “green” would — in the past — only see green jewelry if the word “green” appeared in the product text. Since r2decide’s AI also encodes images into a shared latent space (e.g., warehouse), it finds “green” across all modalities. The items are then re-ranked based on the user’s past searches and clicks to ensure they receive the most relevant “green” options.

Users can also search for broader contexts, like “wedding,” “red dress,” or “gothic.” The AI encodes these inputs into the latent space, matches them with suitable products, and displays the most relevant results. This capability even extends to brand names like “Swarovski,” surfacing relevant items — even if the shop doesn’t officially carry Swarovski products.

AI-Generated Nudges to Give Chat-Like Advice

Alongside search results, R2Decide also generates AI-driven nudges — contextual recommendations or prompts designed to enhance the user experience. These nudges are powered by AI agents, as I described in my post on agentic AI yesterday. Their purpose is to guide users effortlessly toward the most relevant options, making the search process intuitive, engaging, and effective.

Multimodality in 2025: Infinite Possibilities for Enterprises

Multimodality is transforming industries, from healthcare to eCommerce. And it doesn’t stop there. Startups like TC Labs use multimodal AI to streamline engineering workflows, boosting efficiency and quality, while Toyota uses it for interactive, personalized customer assistance.

2025 will be the year multimodal AI transforms how enterprises work. Follow me here on Forbes, or on LinkedIn for more of my 2025 AI predictions.

AI AI Explainer ecommerce Encoder Decoder Pair enterprise healthcare r2decide
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link

Related Articles

This Sam Altman-Backed $1.8 Billion Startup Bets AI Can Get Drugs Through Clinical Trials Faster

17 April 2026
How Arizona-Based Lectric eBikes Is Dominating The D2C Market

How Arizona-Based Lectric eBikes Is Dominating The D2C Market

16 April 2026
This AI Unicorn Is Powering The World’s Most Realistic Avatars—And Disrupting A 0 Billion Market

This AI Unicorn Is Powering The World’s Most Realistic Avatars—And Disrupting A $200 Billion Market

16 April 2026

Energy Storage Boom Propels Former Huawei Executive Into Billionaire Ranks

16 April 2026

Mutiny Killed Its SaaS Business And Grew MRR 12 Times Faster

15 April 2026

Meet The Asian Billionaires Powering The Global AI Boom

15 April 2026
Don't Miss
Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

By Press Room27 December 2024

Every year, millions of people unwrap Christmas gifts that they do not love, need, or…

Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

30 December 2024
Moltbook is the talk of Silicon Valley. But the furor is eerily reminiscent of a 2017 Facebook research experiment

Moltbook is the talk of Silicon Valley. But the furor is eerily reminiscent of a 2017 Facebook research experiment

6 February 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Latest Articles
Oil is back to early war days, S&P 500 jumps to all-time high

Oil is back to early war days, S&P 500 jumps to all-time high

17 April 20261 Views
White House chief of staff to meet with Anthropic CEO about dangerous new Mythos model, official says

White House chief of staff to meet with Anthropic CEO about dangerous new Mythos model, official says

17 April 20262 Views
Half of Iran’s workforce faces unemployment risk as US-Israel war’s ‘hidden target’ was labor market

Half of Iran’s workforce faces unemployment risk as US-Israel war’s ‘hidden target’ was labor market

17 April 20265 Views
Exclusive: Adam Silver wins Edison Award: ‘Some of the most important forms of innovation are human’

Exclusive: Adam Silver wins Edison Award: ‘Some of the most important forms of innovation are human’

17 April 20264 Views

Recent Posts

  • Manycore bets on ‘spatial intelligence’ after HK IPO
  • Iran and White House say the Strait of Hormuz is ‘completely open.’ But it remains closed for now
  • Tether extends $127M to crypto platform Drift as critics blast Circle for not freezing stolen funds
  • Something is different about Trump’s $1 trillion war on Iran and its stress on the national debt, Harvard Kennedy scholar says
  • Oil is back to early war days, S&P 500 jumps to all-time high

Recent Comments

No comments to show.
About Us
About Us

Alpha Leaders is your one-stop website for the latest Entrepreneurs and Leaders news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks
Manycore bets on ‘spatial intelligence’ after HK IPO

Manycore bets on ‘spatial intelligence’ after HK IPO

18 April 2026
Iran and White House say the Strait of Hormuz is ‘completely open.’ But it remains closed for now

Iran and White House say the Strait of Hormuz is ‘completely open.’ But it remains closed for now

18 April 2026
Tether extends 7M to crypto platform Drift as critics blast Circle for not freezing stolen funds

Tether extends $127M to crypto platform Drift as critics blast Circle for not freezing stolen funds

18 April 2026
Most Popular
Something is different about Trump’s  trillion war on Iran and its stress on the national debt, Harvard Kennedy scholar says

Something is different about Trump’s $1 trillion war on Iran and its stress on the national debt, Harvard Kennedy scholar says

18 April 20261 Views
Oil is back to early war days, S&P 500 jumps to all-time high

Oil is back to early war days, S&P 500 jumps to all-time high

17 April 20261 Views
White House chief of staff to meet with Anthropic CEO about dangerous new Mythos model, official says

White House chief of staff to meet with Anthropic CEO about dangerous new Mythos model, official says

17 April 20262 Views

Archives

  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • March 2022
  • January 2021
  • March 2020
  • January 2020

Categories

  • Blog
  • Business
  • Entrepreneurs
  • Global
  • Innovation
  • Leadership
  • Living
  • Money & Finance
  • News
  • Press Release
© 2026 Alpha Leaders. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.