Chip Architectures Under Advanced Computing Sanctions

August Ning, David Wentzlaff

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The rise of large scale machine learning models has generated unprecedented requirements and demand on computing hardware to enable these trillion parameter models. However, the importance of these bleeding-edge chips to the global economy, technological advancement, and strategic national interests have made them targets of sanctions. Recent advanced computing sanctions set limits on a device's Total Processing Performance, device bandwidth, and Performance Density and placed export controls on flagship data center and consumer products. In this work, we present the first study on the architectural and economic externality implications of these advanced computing sanctions and their effects on large language model (LLM) inference. We identify which architectural parameters are limited under existing regulations, and perform thorough design space exploration of compliant designs. Optimized designs are able to improve LLM inference prefill performance by 4% and decoding performance by 27% compared to a restricted device baseline. We then demonstrate how an architecture-first approach for computing policies allows chip designers and policymakers to craft efficient guidelines that achieve desired goals while minimizing negative externalities. We show how architectural features can unify marketing-based data center vs. non-data center regulations and how policies can be specified to create gaming-focused device architectures which are inherently limited in AI performance. Augmenting existing performance metrics with insightful architectural constraints better predict workload performance. Combined metrics achieved up to 42.4x narrower distributions compared to using theoretical compute performance alone, enable targeted and efficient policies.

Original languageEnglish (US)
Title of host publicationISCA 2025 - Proceedings of the 52nd Annual International Symposium on Computer Architecture
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1225-1239
Number of pages15
ISBN (Electronic)9798400712616
DOIs
StatePublished - Jun 21 2025
Event52nd Annual International Symposium on Computer Architecture, ISCA 2025 - Tokyo, Japan
Duration: Jun 21 2025Jun 25 2025

Publication series

NameProceedings - International Symposium on Computer Architecture
ISSN (Print)1063-6897
ISSN (Electronic)2575-713X

Conference

Conference52nd Annual International Symposium on Computer Architecture, ISCA 2025
Country/TerritoryJapan
CityTokyo
Period6/21/256/25/25

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture

Keywords

  • Advanced Computing Sanctions
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Chip Architectures Under Advanced Computing Sanctions'. Together they form a unique fingerprint.

Cite this