Abstract
This paper describes how media processing programs may be accelerated by using the multimedia instruction extensions that have been added to general-purpose microprocessors. As a concrete example, it describes MAX2, a minimalist, second-gene ration set of multimedia instructions included in the PA-RISC 2.0 processor architecture. MAX2 implements subword parallel instructions, which utilize the microprocessor's 64-bit wide datapaths to process multiple pieces of lower-precision data in parallel It also includes innovative, new instructions like Mix, which are very useful for matrix transpose and other common data rearrangements. The paper examines some typical multimedia kernels, like Block Match, Matrix Transpose, Box Filter and the IDCT, coded with and without the MAX2 instructions, to illustrate programming techniques for exploiting subword parallelism and superscalar instruction parallelism. The kernels using MAX2 show significant speedups in execution time, and more efficient utilization of the processor's resources.
Original language | English (US) |
---|---|
Pages (from-to) | 122-133 |
Number of pages | 12 |
Journal | Proceedings of SPIE - The International Society for Optical Engineering |
Volume | 3021 |
DOIs | |
State | Published - Jan 17 1997 |
Event | Multimedia Hardware Architectures 1997 - San Jose, United States Duration: Feb 8 1997 → Feb 14 1997 |
All Science Journal Classification (ASJC) codes
- Electronic, Optical and Magnetic Materials
- Condensed Matter Physics
- Computer Science Applications
- Applied Mathematics
- Electrical and Electronic Engineering
Keywords
- Code optimizations
- MAX2
- Media processing
- Multimedia extensions
- PA-RISC
- Packed arithmetic
- SIMD
- Subword parallelism