Abstract
MicroSIMD architectures incorporating subword parallelism are very efficient for application-specific media processors as well as for fast multimedia information processing in general-purpose processors. This paper addresses the unsolved problem of the need to permute the subwords packed in registers for maximum parallelism performance, especially for two-dimensional (2-D) multimedia algorithms. We propose a new systematic approach for identifying the fundamental data rearrangement needs in current and future 2-D pixel processing programs based on the hierarchical decomposition of frames and objects into atomic 2-D structures. We define new subword permutation instructions, Check, Excheck, Exchange, and Permset, that achieve these data rearrangements across multiple registers. We also define an alphabet of subword permutation primitives, including these new instructions and the Mix instruction defined for PA-RISC MAX-2 and IA-64, which supports the data rearrangement needs of 2-D frames and objects. We show the sufficiency and efficiency of this alphabet for achieving all possible permutations of hierarchical 2-D blocks.
Original language | English (US) |
---|---|
Pages | 3-14 |
Number of pages | 12 |
State | Published - 2000 |
Event | 2000 IEEE International Conference on Application-Specific Systems, Architectures, and Processors - Boston, MA, USA Duration: Jul 10 2000 → Jul 12 2000 |
Other
Other | 2000 IEEE International Conference on Application-Specific Systems, Architectures, and Processors |
---|---|
City | Boston, MA, USA |
Period | 7/10/00 → 7/12/00 |
All Science Journal Classification (ASJC) codes
- Hardware and Architecture
- Computer Networks and Communications