VECTOR FRIENDLY INSTRUCTION FORMAT AND EXECUTION THEREOF

A vector friendly instruction format and execution thereof. According to one embodiment of the invention, a processor is to execute an instruction set. The instruction set includes a first instruction format, wherein the first instruction format includes a first plurality of templates, wherein the f...

Full description

Saved in:
Bibliographic Details
Main Authors OULD-AHMED-VALL, ELMOUSTAPHA, SAMUDRALA, SRIDHAR, VALENTINE, ROBERT C, GIRKAR, MILIND BABURAO, TOLL, BRET L, SANS, ROGER ESPASA, SAN ADRIAN, JESUS CORBAL, ABRAHAM, SETH, FORSYTH, ANDREW THOMAS, HALL, JONATHAN CANNON, DURAN, SANTIAGO GALAN, WIEDEMEIER, JEFFREY G, ABEL, JAMES C, BRADFORD, DENNIS R, CAVIN, ROBERT D, YOUNT, CHARLES, WU, LISA, GROCHOWSKI, EDWARD THOMAS, SAIR, SULEYMAN, CHARNEY, MARK
Format Patent
LanguageEnglish
Polish
Published 26.06.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A vector friendly instruction format and execution thereof. According to one embodiment of the invention, a processor is to execute an instruction set. The instruction set includes a first instruction format, wherein the first instruction format includes a first plurality of templates, wherein the first instruction format has a plurality of fields including a base operation field, a data element width field, and a write mask field, wherein the first instruction format supports, through different values in the base operation field, specification of different vector operations, wherein each of the vector operations is to generate a destination vector operand including a plurality of data elements at different data element positions, wherein the first instruction format supports, through different values in the data element width field, specification of different data element widths, wherein the base operation field, the data element width field, and the write mask field may each store only one value on each occurrence of an instruction in the first instruction format in instruction streams. The processor includes a decode unit to decode the occurrences of the instructions in the first plurality of templates, including to: distinguish, for each of the occurrences, which one of the data element widths to use based on a value in the data element width field; and distinguish, for each of the occurrences, the data elements resulting from the occurrence's vector operation to be reflected in the destination vector operand's corresponding data element positions based on the write mask field's content and the data element width for the occurrence. Different values that may be stored in the write mask field distinguish different write mask registers, of a set of write mask registers, that are to store configurable write masks. The data element width for the occurrence distinguishes which of the data element positions of the destination vector operand correspond with which bits of the configurable write masks.
Bibliography:Application Number: PL20180177235T