VECTOR FRIENDLY INSTRUCTION FORMAT AND EXECUTION THEREOF

A vector friendly instruction format and execution thereof. According to one embodiment of the invention, a processor is to execute an instruction set. The instruction set includes a first instruction format, wherein the first instruction format includes a first plurality of templates, wherein the f...

Full description

Saved in:
Bibliographic Details
Main Authors CHARNEY MARK, YOUNT CHARLES, OULD AHMED VALL ELMOUSTAPHA, WIEDEMEIER JEFFREY G, SANS ROGER ESPASA, VALENTINE ROBERT C, ABRAHAM SETH, DURAN SANTIAGO GALAN, WU LISA, ABEL JAMES C, SAIR SULEYMAN, GROCHOWSKI EDWARD THOMAS, SAMUDRALA SRIDHAR, FORSYTH ANDREW THOMAS, HALL JONATHAN CANNON, CAVIN ROBERT D, GIRKAR MILIND BABURAO, BRADFORD DENNIS R, SAN ADRIAN JESUS CORBAL, TOLL BRET L
Format Patent
LanguageEnglish
Korean
Published 17.12.2013
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A vector friendly instruction format and execution thereof. According to one embodiment of the invention, a processor is to execute an instruction set. The instruction set includes a first instruction format, wherein the first instruction format includes a first plurality of templates, wherein the first instruction format has a plurality of fields including a base operation field, a data element width field, and a write mask field, wherein the first instruction format supports, through different values in the base operation field, specification of different vector operations, wherein each of the vector operations is to generate a destination vector operand including a plurality of data elements at different data element positions, wherein the first instruction format supports, through different values in the data element width field, specification of different data element widths, wherein the base operation field, the data element width field, and the write mask field may each store only one value on each occurrence of an instruction in the first instruction format in instruction streams. The processor includes a decode unit to decode the occurrences of the instructions in the first plurality of templates, including to: distinguish, for each of the occurrences, which one of the data element widths to use based on a value in the data element width field; and distinguish, for each of the occurrences, the data elements resulting from the occurrence's vector operation to be reflected in the destination vector operand's corresponding data element positions based on the write mask field's content and the data element width for the occurrence. Different values that may be stored in the write mask field distinguish different write mask registers, of a set of write mask registers, that are to store configurable write masks. The data element width for the occurrence distinguishes which of the data element positions of the destination vector operand correspond with which bits of the configurable write masks.
Bibliography:Application Number: KR20137029045