Self-Attentive Sequential Recommendation

Sequential dynamics are a key feature of many modern recommender systems, which seek to capture the 'context' of users' activities on the basis of actions they have performed recently. To capture such patterns, two approaches have proliferated: Markov Chains (MCs) and Recurrent Neural...

Full description

Saved in:
Bibliographic Details
Published inProceedings (IEEE International Conference on Data Mining) pp. 197 - 206
Main Authors Kang, Wang-Cheng, McAuley, Julian
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.11.2018
Subjects
Online AccessGet full text
ISSN2374-8486
DOI10.1109/ICDM.2018.00035

Cover

Loading…
Abstract Sequential dynamics are a key feature of many modern recommender systems, which seek to capture the 'context' of users' activities on the basis of actions they have performed recently. To capture such patterns, two approaches have proliferated: Markov Chains (MCs) and Recurrent Neural Networks (RNNs). Markov Chains assume that a user's next action can be predicted on the basis of just their last (or last few) actions, while RNNs in principle allow for longer-term semantics to be uncovered. Generally speaking, MC-based methods perform best in extremely sparse datasets, where model parsimony is critical, while RNNs perform better in denser datasets where higher model complexity is affordable. The goal of our work is to balance these two goals, by proposing a self-attention based sequential model (SASRec) that allows us to capture long-term semantics (like an RNN), but, using an attention mechanism, makes its predictions based on relatively few actions (like an MC). At each time step, SASRec seeks to identify which items are 'relevant' from a user's action history, and use them to predict the next item. Extensive empirical studies show that our method outperforms various state-of-the-art sequential models (including MC/CNN/RNN-based approaches) on both sparse and dense datasets. Moreover, the model is an order of magnitude more efficient than comparable CNN/RNN-based models. Visualizations on attention weights also show how our model adaptively handles datasets with various density, and uncovers meaningful patterns in activity sequences.
AbstractList Sequential dynamics are a key feature of many modern recommender systems, which seek to capture the 'context' of users' activities on the basis of actions they have performed recently. To capture such patterns, two approaches have proliferated: Markov Chains (MCs) and Recurrent Neural Networks (RNNs). Markov Chains assume that a user's next action can be predicted on the basis of just their last (or last few) actions, while RNNs in principle allow for longer-term semantics to be uncovered. Generally speaking, MC-based methods perform best in extremely sparse datasets, where model parsimony is critical, while RNNs perform better in denser datasets where higher model complexity is affordable. The goal of our work is to balance these two goals, by proposing a self-attention based sequential model (SASRec) that allows us to capture long-term semantics (like an RNN), but, using an attention mechanism, makes its predictions based on relatively few actions (like an MC). At each time step, SASRec seeks to identify which items are 'relevant' from a user's action history, and use them to predict the next item. Extensive empirical studies show that our method outperforms various state-of-the-art sequential models (including MC/CNN/RNN-based approaches) on both sparse and dense datasets. Moreover, the model is an order of magnitude more efficient than comparable CNN/RNN-based models. Visualizations on attention weights also show how our model adaptively handles datasets with various density, and uncovers meaningful patterns in activity sequences.
Author McAuley, Julian
Kang, Wang-Cheng
Author_xml – sequence: 1
  givenname: Wang-Cheng
  surname: Kang
  fullname: Kang, Wang-Cheng
  email: wckang@ucsd.edu
  organization: UC San Diego, San Diego, CA, USA
– sequence: 2
  givenname: Julian
  surname: McAuley
  fullname: McAuley, Julian
  email: jrncauley@ucsd.edu
  organization: UC San Diego, San Diego, CA, USA
BookMark eNotjLtOw0AQAA8EEnGgpqBJSWNnz7f32DIyr0iJkAjU0fq8loz8gNgg8feAoJopRpOok37oRalLDZnWQMt1cbPNctAhAwBjj1SirQmOtCU4VrPceEwDBnemknF8_UmcMzBT1ztp63Q1TdJPzacsdvL-8avcLp4kDl0nfcVTM_Tn6rTmdpSLf87Vy93tc_GQbh7v18Vqkzba2yl1Ab2xVcyJyMc6p9p658tgywpJe6oAfcCSOXLJgNE4qtlBxMiSR0YzV1d_30ZE9m-HpuPD1z5YwoBovgHbtEHF
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICDM.2018.00035
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 1538691590
9781538691595
EISSN 2374-8486
EndPage 206
ExternalDocumentID 8594844
Genre orig-research
GroupedDBID 29O
6IE
6IF
6IH
6IK
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-i175t-684735dc29997cf29f5767b85bd49179d04784baacaba04c369fa60c4cae2ca43
IEDL.DBID RIE
IngestDate Wed Aug 27 02:49:58 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-684735dc29997cf29f5767b85bd49179d04784baacaba04c369fa60c4cae2ca43
PageCount 10
ParticipantIDs ieee_primary_8594844
PublicationCentury 2000
PublicationDate 2018-Nov
PublicationDateYYYYMMDD 2018-11-01
PublicationDate_xml – month: 11
  year: 2018
  text: 2018-Nov
PublicationDecade 2010
PublicationTitle Proceedings (IEEE International Conference on Data Mining)
PublicationTitleAbbrev ICDM
PublicationYear 2018
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0036630
Score 2.3780503
Snippet Sequential dynamics are a key feature of many modern recommender systems, which seek to capture the 'context' of users' activities on the basis of actions they...
SourceID ieee
SourceType Publisher
StartPage 197
SubjectTerms Adaptation models
Collaborative Filtering
Context modeling
Markov processes
Predictive models
Recommender systems
Recurrent neural networks
Sequential Recommendation
Task analysis
Title Self-Attentive Sequential Recommendation
URI https://ieeexplore.ieee.org/document/8594844
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwED21nZgKtIhvZWBgwKnb2IkzokJVkIqQSqVulX2xJURpEUoY-PX4krQgxMBmefCn7Kez370HcOFiabjKJNOICRNORlSSTAhtnTMxd45-dCcP8Xgm7udy3oCrbS6MtbYkn9mQiuVffrbGgp7Keoq0RYRoQtMHblWu1ubWjTxy8lq6p8_T3t3wZkLELWJKcvJy--GdUkLHqA2TTacVY-QlLHIT4ucvPcb_jmoXut9JesHjFn72oGFX-9DeuDQE9aHtwOXULh27znPiBX3YYFqSp_3BXgYUe7761itfpS7MRrdPwzGr_RHYswf9nMWKfIMz9IiSJugGqfPBQ2KUNJnwUViakfKOMFqjNpoLjOLU6ZijQG0HqEV0AK3VemUPIUAtEyucch7RSa5HJX5BB31hUoVonDyCDk188VZJYCzqOR__XX0CO7T0VcreKbTy98KeeezOzXm5aV-Zg5qV
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwGP2CeNATKhh_u4MHDw4Ka7f2aFACyogJkHAjbdcmRhzGDA_-9fbbBhrjwVvTQ3-mffna970HcGVDpghPmC-1jnxqWYAl5lMqjbUqJNbij248CvtT-jBjswrcbHJhjDE5-cw0sZj_5SdLvcKnshZHbRFKt2Db4T4VRbbW-t4NHHaSUrynTURr0L2LkbqFXEmCbm4_3FNy8OjVIF53W3BGXpqrTDX15y9Fxv-Oaw8a32l63tMGgPahYtIDqK19Grzy2NbhemwW1r_NMmQGfRhvnNOn3dFeeBh9vrrWC2elBkx795Nu3y8dEvxnB_uZH3J0Dk60wxQRadsR1oUPkeJMJdTFYSJB7R2qpNRSSUJ1EAorQ6KplqajJQ0OoZouU3MEnpYsMtRy6zAdBXt45Ba006ZKcK2VZcdQx4nP3woRjHk555O_qy9hpz-Jh_PhYPR4Cru4DUUC3xlUs_eVOXdInqmLfAO_AOrxneU
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+%28IEEE+International+Conference+on+Data+Mining%29&rft.atitle=Self-Attentive+Sequential+Recommendation&rft.au=Kang%2C+Wang-Cheng&rft.au=McAuley%2C+Julian&rft.date=2018-11-01&rft.pub=IEEE&rft.eissn=2374-8486&rft.spage=197&rft.epage=206&rft_id=info:doi/10.1109%2FICDM.2018.00035&rft.externalDocID=8594844