, ,

Here’s why 100TB+ SSDs will play a huge role in ultra large language models in the near future


  • Kioxia reveals new project called AiSAQ which wants to substitute RAM with SSDs for AI data processing
  • Bigger (read: 100TB+) SSDs could improve RAG at a lower cost than using memory only
  • No timeline has been given, but expect Kioxia’s rivals to offer similar tech

Large language models often generate plausible but factually incorrect outputs – in other words, they make stuff up. These “hallucination”s can damage reliability in information-critical tasks such as medical diagnosis, legal analysis, financial reporting, and scientific research.

Retrieval-Augmented Generation (RAG) mitigates this issue by integrating external data sources, allowing LLMs to access real-time information during generation, reducing errors, and, by grounding outputs in current data, improving contextual accuracy. Implementing RAG effectively requires substantial memory and storage resources, and this is particularly true for large-scale vector data and indices. Traditionally, this data has been stored in DRAM, which, while fast, is both expensive and limited in capacity.

To address these challenges, ServeTheHome reports that at this year’s CES, Japanese memory giant Kioxia introduced AiSAQ – All-in-Storage Approximate Nearest Neighbor Search (ANNS) with Product Quantization – that uses high-capacity SSDs to store vector data and indices. Kioxia claims AiSAQ significantly reduces DRAM usage compared to DiskANN, offering a more cost-effective and scalable approach for supporting large AI models.

More accessible and cost-effective

Kioxia AiSAQ RAG

(Image credit: Kioxia)

Shifting to SSD-based storage allows for the handling of larger datasets without the high costs associated with extensive DRAM use.

While accessing data from SSDs may introduce slight latency compared to DRAM, the trade-off includes lower system costs and improved scalability, which can support better model performance and accuracy as larger datasets provide a richer foundation for learning and inference.

By using high-capacity SSDs, AiSAQ addresses the storage demands of RAG while contributing to the broader goal of making advanced AI technologies more accessible and cost-effective. Kioxia hasn’t revealed when it plans to bring AiSAQ to market, but its safe to bet rivals like Micron and SK Hynix will have something similar in the works.

ServeTheHome concludes, “Everything is AI these days, and Kioxia is pushing this as well. Realistically, RAG is going to be an important part of many applications, and if there is an application that needs to access lots of data, but it is not used as frequently, this would be a great opportunity for something like Kioxia AiSAQ.”

More from TechRadar Pro

https://www.techradar.com/pro/heres-why-100tb-ssds-will-play-a-huge-role-in-ultra-large-language-models-in-the-near-future


Leave a Reply

Your email address will not be published. Required fields are marked *

January 2025
M T W T F S S
 12345
6789101112
13141516171819
20212223242526
2728293031  

About Us

Welcome to encircle News! We are a cutting-edge technology news company that is dedicated to bringing you the latest and greatest in everything tech. From automobiles to drones, software to hardware, we’ve got you covered.

At encircle News, we believe that technology is more than just a tool, it’s a way of life. And we’re here to help you stay on top of all the latest trends and developments in this ever-evolving field. We know that technology is constantly changing, and that can be overwhelming, but we’re here to make it easy for you to keep up.

We’re a team of tech enthusiasts who are passionate about everything tech and love to share our knowledge with others. We believe that technology should be accessible to everyone, and we’re here to make sure it is. Our mission is to provide you with fun, engaging, and informative content that helps you to understand and embrace the latest technologies.

From the newest cars on the road to the latest drones taking to the skies, we’ve got you covered. We also dive deep into the world of software and hardware, bringing you the latest updates on everything from operating systems to processors.

So whether you’re a tech enthusiast, a business professional, or just someone who wants to stay up-to-date on the latest advancements in technology, encircle News is the place for you. Join us on this exciting journey and be a part of shaping the future.

Podcasts

TWiT 1014: Just Say It's Capitalism – CES 2025, Meta News, Newag DRM This Week in Tech (Audio)

The panel discusses CES 2025 How Watch Duty's wildfire tracking app became a crucial lifeline for LA Worst in Show awards 2025 Aaron Swartz v Sam Altman We've not been trained for this: life after the Newag DRM disclosure All the Meta stuff (fact checking, etc.) Heritage Foundation plans to 'identify and target' Wikipedia editors The Government Wants to Protect Robux From Hackers Twitch Streamers Come Home After Big-Money Contracts at Rivals Dried Up Candy Crush, Tinder, MyFitnessPal: See the Thousands of Apps Hijacked to Spy on Your Location Host: Leo Laporte Guests: Nicholas De Leon, Fr. Robert Ballecer, SJ, and Cory Doctorow Download or subscribe to This Week in Tech at https://twit.tv/shows/this-week-in-tech Get episodes ad-free with Club TWiT at https://twit.tv/clubtwit Sponsors: coda.io/twit expressvpn.com/twit threatlocker.com for This Week in Tech uscloud.com bitwarden.com/twit
  1. TWiT 1014: Just Say It's Capitalism – CES 2025, Meta News, Newag DRM
  2. TWiT 1013: Calamari in Crisis – Touching the Sun, Fake Spotify Artists, Banished Words
  3. TWiT 1012: Our Best Of 2024 – The Best Moments From TWiT's 2024
  4. TWiT 1011: The Year in Review – A Look at the Top Stories of 2024
  5. TWiT 1010: The Densest State in the US – TikTok Ban, Drones Over Jersey, GM Quits Robotaxis