, , , , , , , , , , , ,

Researchers upend AI status quo by eliminating matrix multiplication in LLMs

Illustration of a brain inside of a light bulb.

Enlarge / Illustration of a brain inside of a light bulb. (credit: Getty Images)

Researchers claim to have developed a new way to run AI language models more efficiently by eliminating matrix multiplication from the process. This fundamentally redesigns neural network operations that are currently accelerated by GPU chips. The findings, detailed in a recent preprint paper from researchers at the University of California Santa Cruz, UC Davis, LuxiTech, and Soochow University, could have deep implications for the environmental impact and operational costs of AI systems.

Matrix multiplication (often abbreviated to “MatMul”) is at the center of most neural network computational tasks today, and GPUs are particularly good at executing the math quickly because they can perform large numbers of multiplication operations in parallel. That ability momentarily made Nvidia the most valuable company in the world last week; the company currently holds an estimated 98 percent market share for data center GPUs, which are commonly used to power AI systems like ChatGPT and Google Gemini.

In the new paper, titled “Scalable MatMul-free Language Modeling,” the researchers describe creating a custom 2.7 billion parameter model without using MatMul that features similar performance to conventional large language models (LLMs). They also demonstrate running a 1.3 billion parameter model at 23.8 tokens per second on a GPU that was accelerated by a custom-programmed FPGA chip that uses about 13 watts of power (not counting the GPU’s power draw). The implication is that a more efficient FPGA “paves the way for the development of more efficient and hardware-friendly architectures,” they write.

Read 13 remaining paragraphs | Comments

https://arstechnica.com/?p=2033314


Leave a Reply

Your email address will not be published. Required fields are marked *

June 2024
M T W T F S S
 12
3456789
10111213141516
17181920212223
24252627282930

About Us

Welcome to encircle News! We are a cutting-edge technology news company that is dedicated to bringing you the latest and greatest in everything tech. From automobiles to drones, software to hardware, we’ve got you covered.

At encircle News, we believe that technology is more than just a tool, it’s a way of life. And we’re here to help you stay on top of all the latest trends and developments in this ever-evolving field. We know that technology is constantly changing, and that can be overwhelming, but we’re here to make it easy for you to keep up.

We’re a team of tech enthusiasts who are passionate about everything tech and love to share our knowledge with others. We believe that technology should be accessible to everyone, and we’re here to make sure it is. Our mission is to provide you with fun, engaging, and informative content that helps you to understand and embrace the latest technologies.

From the newest cars on the road to the latest drones taking to the skies, we’ve got you covered. We also dive deep into the world of software and hardware, bringing you the latest updates on everything from operating systems to processors.

So whether you’re a tech enthusiast, a business professional, or just someone who wants to stay up-to-date on the latest advancements in technology, encircle News is the place for you. Join us on this exciting journey and be a part of shaping the future.

Podcasts

TWiT 985: TikTok With Wings – AT&T Landlines, US Bans Kaspersky and DJI This Week in Tech (Audio)

AT&T Landlines, US Bans Kaspersky and DJI Microsoft delays Recall after security concerns, and asks Windows Insiders for help I just ordered the cheapest Surface Pro option – why I (probably) won't regret it Biden bans US sales of Kaspersky software over Russia ties The DJI Drone Ban: A Uniquely American Clusterf*ck Surgeon General: Social Media Platforms Need a Health Warning The Surgeon General Is Wrong. Social Media Doesn't Need Warning Labels LAUSD approves cellphone ban as Newsom calls for statewide action EU Council has withdrawn the vote on Chat Control US sues Adobe for hiding termination fees and making it difficult to cancel subscriptions Apple Won't Roll Out AI Tech In EU Market Over Regulatory Concerns AT&T can't hang up on landline phone customers, California agency rules Amazon mulls $5 to $10 monthly price tag for unprofitable Alexa service, AI revamp What Game of Thrones did to the media Elon Musk Tweeted a Thing This Old House' Pays Tribute to Creator Russell Morash Host: Leo Laporte Guests: Amanda Silberling, Louise Matsakis, and Ed Bott Download or subscribe to this show at https://twit.tv/shows/this-week-in-tech Get episodes ad-free with Club TWiT at https://twit.tv/clubtwit Sponsors: NetSuite.com/TWIT eufy.com canary.tools/twit – use code: TWIT wix.com/studio expressvpn.com/twit
  1. TWiT 985: TikTok With Wings – AT&T Landlines, US Bans Kaspersky and DJI
  2. TWiT 984: Fifty-three Clicks – Bot Farms in Ukraine, LA Public Health Dept. Phished
  3. TWiT 983: Digital Snackwells – NVIDIA's Thor, Adobe's TOS, Insta's Unskippable Ads
  4. TWiT 982: International Trash – Startup Chaos, Breaking Ticketmaster, Ultrasonic Coffee
  5. TWiT 981: Grab Your Rabbit – Sky's voice, Copilot+ Surface devices, Car Thing's discontinuation