Close Menu
Alpha Leaders
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
What's On
Why AI Literacy Has Become A Boardroom And Investor Priority

Why AI Literacy Has Become A Boardroom And Investor Priority

20 May 2026
Google’s AI Smartglasses Could Challenge The App Economy

Google’s AI Smartglasses Could Challenge The App Economy

20 May 2026
When Is the Next UFC? Date, Times and Full Schedule

When Is the Next UFC? Date, Times and Full Schedule

20 May 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Alpha Leaders
newsletter
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
Alpha Leaders
Home » Kioxia AiSAQ Improves AI Inference With Lower DRAM Costs
Innovation

Kioxia AiSAQ Improves AI Inference With Lower DRAM Costs

Press RoomBy Press Room8 July 20253 Mins Read
Facebook Twitter Copy Link Pinterest LinkedIn Tumblr Email WhatsApp
Kioxia AiSAQ Improves AI Inference With Lower DRAM Costs

In April this year, Kioxia’s Rory Bolt gave me a briefing on Kioxia’s AiSAQ, an open-source project intended to promote the expanded use of SSDs in RAG AI solutions. The focus on AI is moving from generating foundational models with massive and expensive training to cost effective and scalable ways to create inference solutions that can solve real world problems.

Retrieval-Augmented Generation is an approach to AI that combined traditional information retrieval systems with large language models. RAG enhances the performance of LLMs by allowing them to access and incorporate information from external knowledge sources, such as databases, websites, and internal documents, before generating a response. This approach helps LLMs produce more accurate, contextually relevant, and up-to-date information, especially when dealing with specific domains or real-time data.

Kioxia has used AI to improve the output of its NAND fabs since 2017, mostly using machine vision to monitor trends and defect rates. In 2020 Kioxia used AI to generate the world’s first AI-designed Manga, Phaedo, drawing on manga drawings and stories based on Osuma Tezuka’s work.

I was told that although larger data centers feed data to their AI models using hard drives, many in-house solutions train using data on SSDs. These solutions often work with foundational LLM models created with very large data sets and use RAG using in-house and perhaps more up to date data to tune the foundational model for a particular application and to avoid hallucinations. The image below illustrates how a database can be used for tuning of the original LLM.

Here the customer query is answered using the LLM as well as domain specific and up to date information in a vector data base. Such RAG solutions can be done with the data base index and vectors all in DRAM, but such an approach can use a lot of memory, making them very expensive, particularly for large data bases.

Microsoft developed Disk ANN which moved the bulk of the vector DB content to SSDs. This reduced the required DRAM footprint for the DB enabling greater scaling of vector DBS. This is used in products such as Azure Vector DB and Cosmos DB.

Kioxia’s All-in-Storage ANNS with Product Quantization, or AiSAQ completes the move of database vectors into storage, further reducing the DRAM requirements. These three approaches are represented in the drawing below.

Kioxia says that this approach enabled greater scalability for RAG workflows and thus better accuracy in the models. The image below shows the significant reduction of DRAM required for large databases compared to the DRAM-based, and DiskANN approach and the improved query accuracy.

In early July Kioxia announced further improvements to its AiSAQ. This new open source release allows flexible controls that allow system architects to define the balance point between search performance and the number of vectors, which are opposing factors with the fixed capacity of SSD storage in the system. The resulting benefit enables architects of RAG systems to fine-tune the optimal balance between specific workloads and their requirements, without any hardware modifications.

Kioxia’s AiSAQ allows more scalable RAG AI inference systems by moving database vectors entirely into storage, thus avoiding DRAM growth with increasing database sizes.

AI AiSAQ Artificial Intelligence DRAM Inference Kioxia NAND RAG Scalability SSD
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link

Related Articles

Why AI Literacy Has Become A Boardroom And Investor Priority

Why AI Literacy Has Become A Boardroom And Investor Priority

20 May 2026
Google’s AI Smartglasses Could Challenge The App Economy

Google’s AI Smartglasses Could Challenge The App Economy

20 May 2026
When Is the Next UFC? Date, Times and Full Schedule

When Is the Next UFC? Date, Times and Full Schedule

20 May 2026
As Doctor Shortage Rages On, Physician Assistant Pay Hits 0,000

As Doctor Shortage Rages On, Physician Assistant Pay Hits $140,000

20 May 2026
Here’s When The Series Finale Drops On Prime Video

Here’s When The Series Finale Drops On Prime Video

20 May 2026
NYT ‘Pips’ Hints, Answers And Walkthrough For Wednesday, May 20

NYT ‘Pips’ Hints, Answers And Walkthrough For Wednesday, May 20

20 May 2026
Don't Miss
Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

By Press Room27 December 2024

Every year, millions of people unwrap Christmas gifts that they do not love, need, or…

Exclusive: DeFi platform Azura launches after raising .9 million from Initialized

Exclusive: DeFi platform Azura launches after raising $6.9 million from Initialized

22 October 2024
Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

30 December 2024
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Latest Articles
As Doctor Shortage Rages On, Physician Assistant Pay Hits 0,000

As Doctor Shortage Rages On, Physician Assistant Pay Hits $140,000

20 May 20262 Views
We found the real reason 70% of transformations fail

We found the real reason 70% of transformations fail

20 May 20262 Views
Here’s When The Series Finale Drops On Prime Video

Here’s When The Series Finale Drops On Prime Video

20 May 20260 Views
The 30-year yield hasn’t been this high since the Great Recession. Do the bond vigilantes ride again?

The 30-year yield hasn’t been this high since the Great Recession. Do the bond vigilantes ride again?

20 May 20261 Views

Recent Posts

  • Why AI Literacy Has Become A Boardroom And Investor Priority
  • Google’s AI Smartglasses Could Challenge The App Economy
  • When Is the Next UFC? Date, Times and Full Schedule
  • Mamdani’s New York is coming to tax your private jet. Here’s how to prepare
  • As Doctor Shortage Rages On, Physician Assistant Pay Hits $140,000

Recent Comments

No comments to show.
About Us
About Us

Alpha Leaders is your one-stop website for the latest Entrepreneurs and Leaders news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks
Why AI Literacy Has Become A Boardroom And Investor Priority

Why AI Literacy Has Become A Boardroom And Investor Priority

20 May 2026
Google’s AI Smartglasses Could Challenge The App Economy

Google’s AI Smartglasses Could Challenge The App Economy

20 May 2026
When Is the Next UFC? Date, Times and Full Schedule

When Is the Next UFC? Date, Times and Full Schedule

20 May 2026
Most Popular
Mamdani’s New York is coming to tax your private jet. Here’s how to prepare

Mamdani’s New York is coming to tax your private jet. Here’s how to prepare

20 May 20262 Views
As Doctor Shortage Rages On, Physician Assistant Pay Hits 0,000

As Doctor Shortage Rages On, Physician Assistant Pay Hits $140,000

20 May 20262 Views
We found the real reason 70% of transformations fail

We found the real reason 70% of transformations fail

20 May 20262 Views

Archives

  • May 2026
  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • March 2022
  • January 2021
  • March 2020
  • January 2020

Categories

  • Blog
  • Business
  • Entrepreneurs
  • Global
  • Innovation
  • Leadership
  • Living
  • Money & Finance
  • News
  • Press Release
© 2026 Alpha Leaders. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.