Method for training strategy model and method and device for determining advertisement putting strategy
The embodiment of the invention provides a method for training a strategy model and a method and device for determining an advertisement putting strategy. The method for training the strategy model comprises the steps that sample information and network parameters related to advertisement putting ar...
Saved in:
Main Author | |
---|---|
Format | Patent |
Language | Chinese English |
Published |
04.09.2020
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | The embodiment of the invention provides a method for training a strategy model and a method and device for determining an advertisement putting strategy. The method for training the strategy model comprises the steps that sample information and network parameters related to advertisement putting are acquired; calculating a loss value of the evaluation network according to the state of the first moment, the state of the second moment, the advertisement putting strategy of the first moment, the reward value of the first moment and the first network parameter; updating the first network parameter by using the loss value of the evaluation network to obtain a third network parameter; calculating the gradient of the second network parameter according to the state of the first moment, the advertisement putting strategy of the first moment, the second network parameter and the third network parameter; and updating the second network parameter according to the gradient of the second network parameter. According to th |
---|---|
AbstractList | The embodiment of the invention provides a method for training a strategy model and a method and device for determining an advertisement putting strategy. The method for training the strategy model comprises the steps that sample information and network parameters related to advertisement putting are acquired; calculating a loss value of the evaluation network according to the state of the first moment, the state of the second moment, the advertisement putting strategy of the first moment, the reward value of the first moment and the first network parameter; updating the first network parameter by using the loss value of the evaluation network to obtain a third network parameter; calculating the gradient of the second network parameter according to the state of the first moment, the advertisement putting strategy of the first moment, the second network parameter and the third network parameter; and updating the second network parameter according to the gradient of the second network parameter. According to th |
Author | ZHOU PENGCHENG |
Author_xml | – fullname: ZHOU PENGCHENG |
BookMark | eNqNjDsOwjAQBV1Awe8OywEoApJTowhEAxV9ZMUvwVK8tuwlErfnEwpKqjfFzJurCQfGTHVnyC1YakMiScax447yiwTdg3yw6MmwJT9qb7QYXINPYSFIfoyMHZDEZXiwULyL_F4t1bQ1fcbquwu1Ph6u1WmDGGrkaBowpK4uRVHorS5Lvd_94zwBUXZCaw |
ContentType | Patent |
DBID | EVB |
DatabaseName | esp@cenet |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Sciences Physics |
DocumentTitleAlternate | 训练策略模型的方法、确定广告投放策略的方法和装置 |
ExternalDocumentID | CN111626776A |
GroupedDBID | EVB |
ID | FETCH-epo_espacenet_CN111626776A3 |
IEDL.DBID | EVB |
IngestDate | Fri Jul 19 13:07:09 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | Chinese English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-epo_espacenet_CN111626776A3 |
Notes | Application Number: CN202010446815 |
OpenAccessLink | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20200904&DB=EPODOC&CC=CN&NR=111626776A |
ParticipantIDs | epo_espacenet_CN111626776A |
PublicationCentury | 2000 |
PublicationDate | 20200904 |
PublicationDateYYYYMMDD | 2020-09-04 |
PublicationDate_xml | – month: 09 year: 2020 text: 20200904 day: 04 |
PublicationDecade | 2020 |
PublicationYear | 2020 |
RelatedCompanies | CHUANGXIN QIZHI (XI'AN) TECHNOLOGY CO., LTD |
RelatedCompanies_xml | – name: CHUANGXIN QIZHI (XI'AN) TECHNOLOGY CO., LTD |
Score | 3.4102945 |
Snippet | The embodiment of the invention provides a method for training a strategy model and a method and device for determining an advertisement putting strategy. The... |
SourceID | epo |
SourceType | Open Access Repository |
SubjectTerms | CALCULATING COMPUTING COUNTING DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES ELECTRIC DIGITAL DATA PROCESSING HANDLING RECORD CARRIERS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR |
Title | Method for training strategy model and method and device for determining advertisement putting strategy |
URI | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20200904&DB=EPODOC&locale=&CC=CN&NR=111626776A |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3fS8MwED7m_PmmU9H5gwiyt6Lr2s4-FHFpxxDWDZmyt5E0ydSHbtgO0b_eS9ZuvuhbScnRXvlyl-uX-wCumRKKK84tJhWznKQpLO6rpqV8HTBsgSmCrkP2Y6_37DyO3XEF3suzMKZP6KdpjoiIShDvuVmv5-siVmi4ldkNf8Oh2X13FISNYnesS_340cNOEA0H4YA2KA1o3IifAoQ0pu7ttvewAZuYRrc1GqKXjj6VMv8dUrr7sDVEa2l-AJXv1xrs0lJ5rQY7_eKHdw22DUMzyXCwQGF2CNO-kX0mmG-SUuKBZMs2s1_EaNsQlgqyVIc2l0LqFcHMEAUBRk9ipRyzrhGS-cKQoFemjuCqG41oz8Jnn6wcNaHx-jVbx1BNZ6k8ASKVzx0u7pTkzPE8l9kC98SumzBhy1veOoX633bq_908gz3tdMO5cs6hmn8s5AUG6ZxfGu_-AB3xmk8 |
link.rule.ids | 230,309,783,888,25576,76876 |
linkProvider | European Patent Office |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1bT8IwFD5BvOCbokbxVhPD26LANuRhMdJBUNkgBg1vpF1bLw-DuBGjv97TsqEv-rZ06cnW5us5Pf16PoBzpoTiinOLScUsO6oJi7dUzVIt7TDqAkMEnYcMQrf3aN-NnXEB3vK7MKZO6IcpjoiIihDvqVmvZz9JLN9wK5ML_opN0-vuyPOr2e5Yp_px0v221xkO_AGtUurRsBo-eAhpDN2bTfdmBVYxxG5qNHSe2vpWyuy3S-luwdoQrcXpNhS-XspQornyWhk2guzAuwzrhqEZJdiYoTDZgefAyD4TjDdJLvFAkkWZ2U9itG0IiwVZqEObRyH1imB6iIwAozuxXI5Z5wjJbG5I0EtTu3DW7Yxoz8JvnywHakLDn99s7EExnsZyH4hULW5zcaUkZ7brOqwucE_sOBETdXnJGwdQ-dtO5b-Xp1DqjYL-pH8b3h_Cpp4Aw7-yj6CYvs_lMTrslJ-Ykf4Gh96dQg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Method+for+training+strategy+model+and+method+and+device+for+determining+advertisement+putting+strategy&rft.inventor=ZHOU+PENGCHENG&rft.date=2020-09-04&rft.externalDBID=A&rft.externalDocID=CN111626776A |