Visual attention guided bit allocation in video compression

A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions in video. From the top salient locations from the computed saliency map, a guidance map is generated to guide the bit allocation strategy thro...

Full description

Saved in:
Bibliographic Details
Published inImage and vision computing Vol. 29; no. 1; pp. 1 - 14
Main Authors Li, Zhicheng, Qin, Shiyin, Itti, Laurent
Format Journal Article
LanguageEnglish
Published Elsevier B.V 2011
Subjects
Online AccessGet full text

Cover

Loading…
Abstract A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions in video. From the top salient locations from the computed saliency map, a guidance map is generated to guide the bit allocation strategy through a new constrained global optimization approach, which can be solved in a closed form and independently of video frame content. Fifty video sequences (300 frames each) and eye-tracking data from 14 subjects were collected to evaluate both the accuracy of the attention prediction model and the subjective quality of the encoded video. Results show that the area under the curve of the guidance map is 0.773 ± 0.002, significantly above chance (0.500). Using a new eye-tracking-weighted PSNR (EWPSNR) measure of subjective quality, more than 90% of the encoded video clips with the proposed method achieve better subjective quality compared to standard encoding with matched bit rate. The improvement in EWPSNR is up to over 2 dB and on average 0.79 dB.
AbstractList A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions in video. From the top salient locations from the computed saliency map, a guidance map is generated to guide the bit allocation strategy through a new constrained global optimization approach, which can be solved in a closed form and independently of video frame content. Fifty video sequences (300 frames each) and eye-tracking data from 14 subjects were collected to evaluate both the accuracy of the attention prediction model and the subjective quality of the encoded video. Results show that the area under the curve of the guidance map is 0.773 ± 0.002, significantly above chance (0.500). Using a new eye-tracking-weighted PSNR (EWPSNR) measure of subjective quality, more than 90% of the encoded video clips with the proposed method achieve better subjective quality compared to standard encoding with matched bit rate. The improvement in EWPSNR is up to over 2 dB and on average 0.79 dB.
Author Qin, Shiyin
Li, Zhicheng
Itti, Laurent
Author_xml – sequence: 1
  givenname: Zhicheng
  surname: Li
  fullname: Li, Zhicheng
  organization: School of Automation Science and Electrical Engineering, Beihang University, Beijing, China
– sequence: 2
  givenname: Shiyin
  surname: Qin
  fullname: Qin, Shiyin
  organization: School of Automation Science and Electrical Engineering, Beihang University, Beijing, China
– sequence: 3
  givenname: Laurent
  surname: Itti
  fullname: Itti, Laurent
  email: itti@pollux.usc.edu
  organization: Computer Science Department, University of Southern California, Los Angeles, CA, USA
BookMark eNqFkN1KAzEQhYMo2FbfwIt9gV0n-5esgiDFPyh4o96G2WQqKdtNSdKCb2_aeuWFXs3wcc5h5kzZ6ehGYuyKQ8GBt9erwq5xZ0NRQkIgCgB-wiZcijKXvJKnbAJlm3bZtOdsGsIKAASIbsJuP2zY4pBhjDRG68bsc2sNmay3McNhcBoP1I7ZLnGXabfeeAohwQt2tsQh0OXPnLH3x4e3-XO-eH16md8vcl1BG_NSCkSjO4285rLp-5ZAGm60xLoT1DW6KWUjsOw7wj6plkIgJNCappN1Vc3YzTFXexeCp6XSNh7Oih7toDiofQ1qpY41qH0NCoRKNSRz_cu88Unmv_6z3R1tlB7bWfIqaEujJmM96aiMs38HfAOdLHx5
CitedBy_id crossref_primary_10_1016_j_patcog_2018_02_004
crossref_primary_10_1109_TBC_2018_2795459
crossref_primary_10_1016_j_image_2015_04_011
crossref_primary_10_1109_ACCESS_2019_2960807
crossref_primary_10_1108_AEAT_10_2012_0164
crossref_primary_10_1007_s11042_020_08686_z
crossref_primary_10_1371_journal_pone_0150673
crossref_primary_10_1080_02564602_2016_1231023
crossref_primary_10_4304_jsw_7_11_2591_2598
crossref_primary_10_1007_s10514_018_9752_3
crossref_primary_10_1109_ACCESS_2023_3286577
crossref_primary_10_1007_s11042_015_2465_0
crossref_primary_10_3389_fncom_2016_00124
crossref_primary_10_1016_j_neucom_2020_06_003
crossref_primary_10_1016_j_image_2017_09_007
crossref_primary_10_1109_TIP_2017_2722238
crossref_primary_10_1109_JSEN_2019_2899102
crossref_primary_10_1007_s12243_012_0352_5
crossref_primary_10_1016_j_eswa_2020_113654
crossref_primary_10_1109_MPRV_2021_3051309
crossref_primary_10_1117_1_JEI_25_6_061626
crossref_primary_10_1016_j_jvcir_2017_08_003
crossref_primary_10_1109_TCSVT_2016_2595324
crossref_primary_10_7498_aps_66_109501
crossref_primary_10_1109_JSTSP_2011_2165199
crossref_primary_10_1007_s11263_020_01371_6
crossref_primary_10_1109_TCSVT_2023_3342903
crossref_primary_10_5594_JMI_2022_3160541
crossref_primary_10_1109_TCSVT_2015_2450175
crossref_primary_10_1016_j_jvcir_2018_01_014
crossref_primary_10_1007_s12559_016_9406_8
crossref_primary_10_1109_TIP_2013_2247409
crossref_primary_10_1109_TCE_2011_6131159
crossref_primary_10_1109_TPAMI_2019_2924417
crossref_primary_10_1016_j_image_2013_07_003
crossref_primary_10_1016_j_dib_2019_103991
crossref_primary_10_1109_ACCESS_2021_3110292
crossref_primary_10_1109_ACCESS_2018_2826562
crossref_primary_10_3390_electronics12030680
crossref_primary_10_1109_TCSVT_2015_2389491
crossref_primary_10_1016_j_imavis_2020_104001
crossref_primary_10_1109_ACCESS_2018_2843384
crossref_primary_10_1109_JSTSP_2016_2634458
crossref_primary_10_1016_j_engappai_2024_109806
crossref_primary_10_1002_cav_2287
crossref_primary_10_1007_s11045_018_0610_4
crossref_primary_10_1016_j_patcog_2017_09_023
crossref_primary_10_1016_j_image_2015_04_007
crossref_primary_10_1109_TIP_2019_2960869
crossref_primary_10_1007_s11277_016_3704_z
crossref_primary_10_1371_journal_pcbi_1011512
crossref_primary_10_3390_e22101174
crossref_primary_10_1145_3369110
crossref_primary_10_1017_ATSIP_2013_5
crossref_primary_10_1007_s11042_017_4914_4
crossref_primary_10_1109_TMM_2019_2928494
crossref_primary_10_1007_s11042_017_4725_7
crossref_primary_10_1016_j_imavis_2016_07_007
crossref_primary_10_1016_j_imavis_2020_103964
crossref_primary_10_1109_ACCESS_2024_3394222
crossref_primary_10_1016_j_neucom_2012_08_029
crossref_primary_10_1109_TCSVT_2015_2474075
crossref_primary_10_1007_s11042_017_5334_1
crossref_primary_10_1109_TIP_2011_2165292
crossref_primary_10_1016_j_ijleo_2016_05_027
crossref_primary_10_1016_j_image_2013_01_001
crossref_primary_10_1109_TMM_2013_2264655
crossref_primary_10_4304_jsw_8_10_2541_2548
crossref_primary_10_1109_TIP_2013_2279941
crossref_primary_10_1007_s12559_014_9246_3
crossref_primary_10_1007_s11263_023_01950_3
crossref_primary_10_1016_j_ins_2017_01_019
crossref_primary_10_1109_TIP_2012_2233485
crossref_primary_10_1175_JTECH_D_16_0092_1
crossref_primary_10_1109_ACCESS_2021_3050489
crossref_primary_10_1007_s11042_015_3054_y
crossref_primary_10_1007_s11042_016_4124_5
crossref_primary_10_1109_TCSVT_2014_2308642
crossref_primary_10_1109_JSTSP_2012_2215006
crossref_primary_10_1109_TNNLS_2016_2522440
crossref_primary_10_3390_rs10040652
crossref_primary_10_1109_TGRS_2024_3479190
crossref_primary_10_1109_TMM_2017_2743987
crossref_primary_10_1109_JSAC_2022_3223408
crossref_primary_10_1080_13506285_2012_667456
crossref_primary_10_1109_LGRS_2013_2253443
crossref_primary_10_1145_3129289
crossref_primary_10_1109_TBC_2017_2781118
crossref_primary_10_1109_TIP_2013_2282897
crossref_primary_10_1109_JSTSP_2014_2313717
crossref_primary_10_1016_j_patrec_2013_06_004
crossref_primary_10_1007_s10462_012_9385_4
crossref_primary_10_1109_TIP_2018_2837106
crossref_primary_10_1016_j_knosys_2015_02_028
crossref_primary_10_1007_s13319_017_0121_3
crossref_primary_10_1016_j_image_2015_07_002
crossref_primary_10_1007_s00521_022_06895_1
crossref_primary_10_1109_ACCESS_2018_2876427
crossref_primary_10_1109_TCSVT_2019_2911396
crossref_primary_10_1109_TCSVT_2022_3172971
crossref_primary_10_1109_TIP_2018_2861217
crossref_primary_10_1109_TMM_2017_2721544
crossref_primary_10_1155_2014_343860
crossref_primary_10_1109_ACCESS_2018_2883967
crossref_primary_10_1109_TCSVT_2024_3419910
crossref_primary_10_1016_j_ijleo_2016_11_191
crossref_primary_10_1109_TCSVT_2021_3056134
crossref_primary_10_1007_s11045_015_0347_2
crossref_primary_10_1109_JSTSP_2014_2314864
crossref_primary_10_1109_TCSVT_2018_2886277
crossref_primary_10_1109_TIP_2021_3091909
crossref_primary_10_1016_j_neucom_2017_08_054
crossref_primary_10_1016_j_jvcir_2013_02_007
crossref_primary_10_1109_TIP_2014_2307434
crossref_primary_10_1017_ATSIP_2015_4
crossref_primary_10_1109_TMM_2023_3327886
crossref_primary_10_1167_jov_24_12_11
ContentType Journal Article
Copyright 2010 Elsevier B.V.
Copyright_xml – notice: 2010 Elsevier B.V.
DBID AAYXX
CITATION
DOI 10.1016/j.imavis.2010.07.001
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
Engineering
EISSN 1872-8138
EndPage 14
ExternalDocumentID 10_1016_j_imavis_2010_07_001
S0262885610001083
GroupedDBID --K
--M
.~1
0R~
1B1
1~.
1~5
29I
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
AABNK
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
ABBOA
ABFNM
ABFRF
ABJNI
ABMAC
ABOCM
ABTAH
ABXDB
ABYKQ
ACDAQ
ACGFO
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADJOM
ADMUD
ADTZH
AEBSH
AECPX
AEFWE
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ASPBG
AVWKF
AXJTR
AZFZN
BJAXD
BKOJK
BLXMC
CS3
DU5
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
F0J
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
G8K
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
IHE
J1W
JJJVA
KOM
LG9
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
RNS
ROL
RPZ
SBC
SDF
SDG
SDP
SES
SEW
SPC
SPCBC
SST
SSV
SSZ
T5K
TN5
UHS
UNMZH
VOH
WUQ
XFK
XPP
ZMT
ZY4
~G-
AATTM
AAXKI
AAYWO
AAYXX
ABDPE
ABWVN
ACRPL
ACVFH
ADCNI
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AFXIZ
AGCQF
AGQPQ
AGRNS
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
BNPGV
CITATION
SSH
ID FETCH-LOGICAL-c306t-287aadc9ca14185bb6e08d1dc8a497e95c52857a2b9eabca1f77a057a6d598433
IEDL.DBID .~1
ISSN 0262-8856
IngestDate Tue Jul 01 00:48:14 EDT 2025
Thu Apr 24 23:12:50 EDT 2025
Fri Feb 23 02:23:39 EST 2024
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Video subjective quality
Video compression
Visual attention
Eye-tracking
Language English
License https://www.elsevier.com/tdm/userlicense/1.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c306t-287aadc9ca14185bb6e08d1dc8a497e95c52857a2b9eabca1f77a057a6d598433
PageCount 14
ParticipantIDs crossref_citationtrail_10_1016_j_imavis_2010_07_001
crossref_primary_10_1016_j_imavis_2010_07_001
elsevier_sciencedirect_doi_10_1016_j_imavis_2010_07_001
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2011
2011-1-00
PublicationDateYYYYMMDD 2011-01-01
PublicationDate_xml – year: 2011
  text: 2011
PublicationDecade 2010
PublicationTitle Image and vision computing
PublicationYear 2011
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Chi, Chen, Yeh, Jhu (bb0045) 2008; vol. 23, no. 2
Wandell (bb0015) 1995
Liu, Zheng, Ding, Yuan (bb0100) 2008
Li, Itti (bb0145) May, 2008
Malvar, He, Cutler (bb0195) 2004
Peters, Itti (bb0225) 2007
Tang, Chen, Yu, Tsai (bb0075) Feb. 2006; vol. 8
Kortum, Geisler (bb0020) 1996; vol. 2657
Rauschenbach, Schumann (bb0035) 1999; vol. 23, no. 6
Wiegand, Sullivan, Bjntegaard, Luthra (bb0010) July 2003; vol. 13
Lee, Bovik, Kim (bb0030) 1999
Lai, Wong, Lun (bb0060) Aug. 2002; vol. 1
Cerf, Harel, Einhauser, Koch (bb0055) 2008; vol. 20
“Tutorial: objective perceptual assessment of video quality: full reference television,”, ITU-T Technical tutorials, 2005.
Wolfe (bb0175) 1998; vol. 8
Itti, Koch (bb0095) Mar, 2001; vol. 2
Minoo, Nguyen (bb0080) Nov. 2005
Jiang, Ling (bb0125) 2006; vol. 16, no. 5
Wang, Ostermann, Zhang (bb0190) 2002
Sun, Ahmad, Li, Zhang (bb0135) 2006; vol. 8, no. 1
Weigand, Sullivan, Luthra (bb0005) 2003
Doulamis, Doulamis, Kalogera, Kollias (bb0025) 1998; vol. 8
Itti (bb0105) 2004; vol. 13, no. 10
Wang, Zhang, Li (bb0150) 2000
Vranjes, Rimac-Drlje, Zagar (bb0205) 2008
Ehinger, Hidalgo-Sotelo, Torralba, Oliva (bb0215) 2009; vol. 17, no. 6&7
Huang, Lin (bb0085) 2007; vol. 9, no. 6
Chen, Qiu, Lu, Zhu, Chen, Gu, Charles (bb0120) 2007
Lai, Gu, Wang, Ma, Zhang (bb0115) 2004
Watson, Hu, McGowan (bb0160) 2001; vol. 10, no. 1
Treisman, Gelade (bb0180) 1980; vol. 12
Itti, Koch (bb0185) Jan. 2001; vol. 10
Li, Itti (bb0220) 2009
Bamber (bb0200) 1975; vol. 12
Parkhurst, Niebur (bb0040) 2002; vol. 44, no. 4
Itti (bb0110) 2004; vol. 5292
Liu, Li, Soh (bb0140) 2008; vol. 18, no. 1
Itti, Koch, Niebur (bb0090) Nov. 1998; vol. 20
Watson (bb0210) 1998; vol. 3299
L. S. Karlsson, “Spatio-temporal pre-processing methods for region-of-interest video coding,” PhD dissertation, Sundsvall, Sweden, 2007.
Hershler, Hochstein (bb0050) 2005; vol. 45, no. 13
Webster, Jones, Pinson, Voran, Wolf, Webster, Jones, Pinson, Voran, Wolf (bb0155) 1993; vol. 1913
http://ilab.usc.edu/toolkit.
Tong, Rao (bb0065) July, 2006; vol. 15, no. 3
Lee, Pattechis, Bovik (bb0130) 2001; vol. 10, no. 7
Minoo (10.1016/j.imavis.2010.07.001_bb0080) 2005
Itti (10.1016/j.imavis.2010.07.001_bb0110) 2004; vol. 5292
Itti (10.1016/j.imavis.2010.07.001_bb0090) 1998; vol. 20
10.1016/j.imavis.2010.07.001_bb0165
Itti (10.1016/j.imavis.2010.07.001_bb0105) 2004; vol. 13, no. 10
Vranjes (10.1016/j.imavis.2010.07.001_bb0205) 2008
Chi (10.1016/j.imavis.2010.07.001_bb0045) 2008; vol. 23, no. 2
Hershler (10.1016/j.imavis.2010.07.001_bb0050) 2005; vol. 45, no. 13
Ehinger (10.1016/j.imavis.2010.07.001_bb0215) 2009; vol. 17, no. 6&7
Wiegand (10.1016/j.imavis.2010.07.001_bb0010) 2003; vol. 13
Itti (10.1016/j.imavis.2010.07.001_bb0185) 2001; vol. 10
Chen (10.1016/j.imavis.2010.07.001_bb0120) 2007
Wolfe (10.1016/j.imavis.2010.07.001_bb0175) 1998; vol. 8
Kortum (10.1016/j.imavis.2010.07.001_bb0020) 1996; vol. 2657
Tang (10.1016/j.imavis.2010.07.001_bb0075) 2006; vol. 8
Cerf (10.1016/j.imavis.2010.07.001_bb0055) 2008; vol. 20
Liu (10.1016/j.imavis.2010.07.001_bb0140) 2008; vol. 18, no. 1
Lai (10.1016/j.imavis.2010.07.001_bb0060) 2002; vol. 1
Peters (10.1016/j.imavis.2010.07.001_bb0225) 2007
Doulamis (10.1016/j.imavis.2010.07.001_bb0025) 1998; vol. 8
Lee (10.1016/j.imavis.2010.07.001_bb0030) 1999
Malvar (10.1016/j.imavis.2010.07.001_bb0195) 2004
Wandell (10.1016/j.imavis.2010.07.001_bb0015) 1995
10.1016/j.imavis.2010.07.001_bb0170
10.1016/j.imavis.2010.07.001_bb0070
Sun (10.1016/j.imavis.2010.07.001_bb0135) 2006; vol. 8, no. 1
Lee (10.1016/j.imavis.2010.07.001_bb0130) 2001; vol. 10, no. 7
Watson (10.1016/j.imavis.2010.07.001_bb0210) 1998; vol. 3299
Tong (10.1016/j.imavis.2010.07.001_bb0065) 2006; vol. 15, no. 3
Parkhurst (10.1016/j.imavis.2010.07.001_bb0040) 2002; vol. 44, no. 4
Weigand (10.1016/j.imavis.2010.07.001_bb0005) 2003
Lai (10.1016/j.imavis.2010.07.001_bb0115) 2004
Li (10.1016/j.imavis.2010.07.001_bb0220) 2009
Rauschenbach (10.1016/j.imavis.2010.07.001_bb0035) 1999; vol. 23, no. 6
Watson (10.1016/j.imavis.2010.07.001_bb0160) 2001; vol. 10, no. 1
Itti (10.1016/j.imavis.2010.07.001_bb0095) 2001; vol. 2
Liu (10.1016/j.imavis.2010.07.001_bb0100) 2008
Bamber (10.1016/j.imavis.2010.07.001_bb0200) 1975; vol. 12
Huang (10.1016/j.imavis.2010.07.001_bb0085) 2007; vol. 9, no. 6
Jiang (10.1016/j.imavis.2010.07.001_bb0125) 2006; vol. 16, no. 5
Webster (10.1016/j.imavis.2010.07.001_bb0155) 1993; vol. 1913
Treisman (10.1016/j.imavis.2010.07.001_bb0180) 1980; vol. 12
Wang (10.1016/j.imavis.2010.07.001_bb0150) 2000
Wang (10.1016/j.imavis.2010.07.001_bb0190) 2002
Li (10.1016/j.imavis.2010.07.001_bb0145) 2008
References_xml – volume: vol. 15, no. 3
  year: July, 2006
  ident: bb0065
  article-title: Region-of-interest based rate control for low-bit-rate video conferencing
  publication-title: Journal of Electronic Imaging
– volume: vol. 44, no. 4
  start-page: 611
  year: 2002
  end-page: 629
  ident: bb0040
  article-title: Variable-resolution displays: a theoretical, practical, and behavioral evalutation
  publication-title: Human Factors
– volume: vol. 9, no. 6
  start-page: 1113
  year: 2007
  end-page: 1124
  ident: bb0085
  article-title: A novel 4-D perceptual quantization modeling for H.264 bit-rate control
  publication-title: IEEE Trans. On Multimedia
– start-page: 485
  year: 2004
  end-page: 488
  ident: bb0195
  article-title: High-quality linear interporlation for demosaicing of bayer-patterned color images
  publication-title: Proc. ICASSP
– volume: vol. 8, no. 1
  start-page: 1
  year: 2006
  end-page: 10
  ident: bb0135
  article-title: Region-based rate control and bit allocation for wireless video transmission
  publication-title: IEEE Trans. on Multimedia
– year: May, 2008
  ident: bb0145
  article-title: Visual attention guided video compression
  publication-title: Proc. Vision Science Society Annual Meeting (VSS08)
– volume: vol. 12
  start-page: 375
  year: 1975
  end-page: 387
  ident: bb0200
  article-title: The area above the ordinal dominance graph and the area below the receiver operating characteristic graph
  publication-title: Journal of Mathematical Psychology
– volume: vol. 20
  start-page: 241
  year: 2008
  end-page: 248
  ident: bb0055
  article-title: Predicting human gaze using low-level saliency combined with face detection
  publication-title: Advances in neural information processing systems
– year: 2007
  ident: bb0225
  article-title: Beyond bottom-up: incorporating task-dependent influences into a computational model of spatial attention
  publication-title: Proc. CVPR
– start-page: 1
  year: 2008
  end-page: 4
  ident: bb0100
  article-title: Video attention: learning to detect a salient object sequence
  publication-title: Proc. ICPR
– volume: vol. 16, no. 5
  start-page: 663
  year: 2006
  end-page: 669
  ident: bb0125
  article-title: On Lagrange multiplier and quantizer adjustment for H.264 frame layer video rate control
  publication-title: IEEE Trans. Circuits and Syst. Video Technol
– reference: “Tutorial: objective perceptual assessment of video quality: full reference television,”, ITU-T Technical tutorials, 2005.
– start-page: 90
  year: 1999
  end-page: 94
  ident: bb0030
  article-title: Low delay foveated visual communications over wireless channels
  publication-title: Proc. IEEE Int. Conf. Image Processing
– year: 2003
  ident: bb0005
  article-title: Draft ITU-T recommendation H.264 and final draft international standard 14496-10 advanced video coding
  publication-title: Joint Video Teams of ISO/IECJTCI/SC29/WG11 and ITU-T SG/16/Q.6Doc.JVT-G050r, Geneva, Switzerland
– year: 2002
  ident: bb0190
  article-title: Video Processing and Communication
  publication-title: Pearson Education
– volume: vol. 10, no. 7
  start-page: 977
  year: 2001
  end-page: 992
  ident: bb0130
  article-title: Foveated video compression with optimal rate control
  publication-title: IEEE Trans. on Image Processing
– volume: vol. 10, no. 1
  start-page: 20
  year: 2001
  end-page: 29
  ident: bb0160
  article-title: Digital video quality metric based on human vision
  publication-title: Journal of Electronic Imaging
– volume: vol. 18, no. 1
  start-page: 134
  year: 2008
  end-page: 139
  ident: bb0140
  article-title: Region-of-interest based resource allocation for conversational video communications of H.264/AVC
  publication-title: IEEE Trans. Circuits and Syst. Video Technol
– volume: vol. 2657
  start-page: 350
  year: 1996
  end-page: 360
  ident: bb0020
  article-title: Implementation of a foveated image coding system for bandwidth reduction of video images
  publication-title: Proc. SPIE
– reference: L. S. Karlsson, “Spatio-temporal pre-processing methods for region-of-interest video coding,” PhD dissertation, Sundsvall, Sweden, 2007.
– volume: vol. 12
  start-page: 97
  year: 1980
  end-page: 136
  ident: bb0180
  article-title: A feature-integration theory of attention
  publication-title: Cognition Psychology
– volume: vol. 23, no. 6
  start-page: 857
  year: 1999
  end-page: 866
  ident: bb0035
  article-title: Demand-driven image transmission with levels of detail and region s of interest
  publication-title: Comput. Graph
– volume: vol. 23, no. 2
  start-page: 127
  year: 2008
  end-page: 142
  ident: bb0045
  article-title: Region-of-interest video coding based on rate and distortion variations for H.263+
  publication-title: Image Communication
– year: 2009
  ident: bb0220
  article-title: Gist based top-down templates for gaze prediction
  publication-title: Proc. Vision Science Society Annual Meeting
– volume: vol. 3299
  start-page: 139
  year: 1998
  end-page: 147
  ident: bb0210
  article-title: Toward a perceptual video quality metric
  publication-title: Human Vision, Visual Processing, and Digital Display
– volume: vol. 8
  start-page: 11
  year: Feb. 2006
  end-page: 18
  ident: bb0075
  article-title: Visual sensitivity guided bit allocation for video coding
  publication-title: IEEE Trans. Multimedia
– start-page: 3634
  year: 2007
  end-page: 3638
  ident: bb0120
  article-title: Improving video coding at scene cuts using attention based adaptive bit allocation
  publication-title: Proc. ISCAS
– volume: vol. 8
  start-page: 303
  year: 1998
  end-page: 304
  ident: bb0175
  article-title: Visual memory: what do you know about what you saw?
  publication-title: Current Biology
– volume: vol. 8
  start-page: 928
  year: 1998
  end-page: 934
  ident: bb0025
  article-title: Improving the performance of MPEG coders using adaptive regions of interest
  publication-title: IEEE Trans. On Circuits syst. Video Technol
– volume: vol. 45, no. 13
  start-page: 1707
  year: 2005
  end-page: 1724
  ident: bb0050
  article-title: At first sight: a high-level pop out effects for faces
  publication-title: Vision Research
– volume: vol. 1913
  start-page: 15
  year: 1993
  end-page: 26
  ident: bb0155
  article-title: Objective video quality assessment system based on human perception
  publication-title: Proc. SPIE
– start-page: 791
  year: 2000
  end-page: 794
  ident: bb0150
  article-title: Objective quality evaluation of digital video
  publication-title: Proc. PCCAS
– volume: vol. 13, no. 10
  start-page: 1304
  year: 2004
  end-page: 1318
  ident: bb0105
  article-title: Automatic foveation for video compression using a neurobiological model of visual attention
  publication-title: IEEE Trans. on Image Processing
– year: 2008
  ident: bb0205
  article-title: Subjective and objective quality evaluation of the H.264/AVC coded video
  publication-title: Proc. Systems, Signals and Image Processing
– year: 1995
  ident: bb0015
  article-title: Foundations of Vision
– year: Nov. 2005
  ident: bb0080
  article-title: Perceptual video coding with H.264
  publication-title: Proc. 39th Asilomar Conf. Signals, Systems, and Computers
– volume: vol. 5292
  start-page: 272
  year: 2004
  end-page: 283
  ident: bb0110
  article-title: Automatic attention-based prioritization of unconstrained video for compression
  publication-title: Proc. SPIE Human Vision and Electronic Imaging
– volume: vol. 13
  start-page: 560
  year: July 2003
  end-page: 576
  ident: bb0010
  article-title: Overview of the H.264/AVC video coding standard
  publication-title: IEEE Trans. Circuits and Syst. Video Technol
– year: 2004
  ident: bb0115
  article-title: A content-based bit allocation model for video streaming
  publication-title: Proc. IEEE international Conference on Multimedia and Expo
– volume: vol. 20
  start-page: 1254
  year: Nov. 1998
  end-page: 1259
  ident: bb0090
  article-title: A model of saliency-based visual attention for rapid scene analysis
  publication-title: IEEE Transactions on Pattern Analysis and Machine Intelligence
– volume: vol. 10
  start-page: 161
  year: Jan. 2001
  end-page: 169
  ident: bb0185
  article-title: Feature combination strategies for saliency-based visual attention systems
  publication-title: Journal of Electronic Imaging
– reference: http://ilab.usc.edu/toolkit.
– volume: vol. 1
  start-page: 656
  year: Aug. 2002
  end-page: 659
  ident: bb0060
  article-title: A rate control algorithm using human visual system for video conferencing systems
  publication-title: Proc. Int. Conf. Signal Processing
– volume: vol. 2
  start-page: 194
  year: Mar, 2001
  end-page: 203
  ident: bb0095
  article-title: Computational modeling of visual attention
  publication-title: Nature Reviews, Neuroscience
– volume: vol. 17, no. 6&7
  start-page: 945
  year: 2009
  end-page: 978
  ident: bb0215
  article-title: Modeling search for people in 900 scenes: a combined source model of eye guidance
  publication-title: Visual Cognition
– volume: vol. 5292
  start-page: 272
  year: 2004
  ident: 10.1016/j.imavis.2010.07.001_bb0110
  article-title: Automatic attention-based prioritization of unconstrained video for compression
– volume: vol. 13, no. 10
  start-page: 1304
  year: 2004
  ident: 10.1016/j.imavis.2010.07.001_bb0105
  article-title: Automatic foveation for video compression using a neurobiological model of visual attention
– volume: vol. 17, no. 6&7
  start-page: 945
  year: 2009
  ident: 10.1016/j.imavis.2010.07.001_bb0215
  article-title: Modeling search for people in 900 scenes: a combined source model of eye guidance
– volume: vol. 10
  start-page: 161
  year: 2001
  ident: 10.1016/j.imavis.2010.07.001_bb0185
  article-title: Feature combination strategies for saliency-based visual attention systems
– volume: vol. 12
  start-page: 375
  year: 1975
  ident: 10.1016/j.imavis.2010.07.001_bb0200
  article-title: The area above the ordinal dominance graph and the area below the receiver operating characteristic graph
– year: 2009
  ident: 10.1016/j.imavis.2010.07.001_bb0220
  article-title: Gist based top-down templates for gaze prediction
– ident: 10.1016/j.imavis.2010.07.001_bb0070
– year: 2002
  ident: 10.1016/j.imavis.2010.07.001_bb0190
  article-title: Video Processing and Communication
– start-page: 791
  year: 2000
  ident: 10.1016/j.imavis.2010.07.001_bb0150
  article-title: Objective quality evaluation of digital video
– start-page: 3634
  year: 2007
  ident: 10.1016/j.imavis.2010.07.001_bb0120
  article-title: Improving video coding at scene cuts using attention based adaptive bit allocation
– volume: vol. 8
  start-page: 303
  year: 1998
  ident: 10.1016/j.imavis.2010.07.001_bb0175
  article-title: Visual memory: what do you know about what you saw?
– ident: 10.1016/j.imavis.2010.07.001_bb0170
– volume: vol. 1
  start-page: 656
  year: 2002
  ident: 10.1016/j.imavis.2010.07.001_bb0060
  article-title: A rate control algorithm using human visual system for video conferencing systems
– volume: vol. 44, no. 4
  start-page: 611
  year: 2002
  ident: 10.1016/j.imavis.2010.07.001_bb0040
  article-title: Variable-resolution displays: a theoretical, practical, and behavioral evalutation
– start-page: 90
  year: 1999
  ident: 10.1016/j.imavis.2010.07.001_bb0030
  article-title: Low delay foveated visual communications over wireless channels
– start-page: 1
  year: 2008
  ident: 10.1016/j.imavis.2010.07.001_bb0100
  article-title: Video attention: learning to detect a salient object sequence
– year: 2005
  ident: 10.1016/j.imavis.2010.07.001_bb0080
  article-title: Perceptual video coding with H.264
– volume: vol. 9, no. 6
  start-page: 1113
  year: 2007
  ident: 10.1016/j.imavis.2010.07.001_bb0085
  article-title: A novel 4-D perceptual quantization modeling for H.264 bit-rate control
– volume: vol. 13
  start-page: 560
  year: 2003
  ident: 10.1016/j.imavis.2010.07.001_bb0010
  article-title: Overview of the H.264/AVC video coding standard
– year: 2008
  ident: 10.1016/j.imavis.2010.07.001_bb0145
  article-title: Visual attention guided video compression
– volume: vol. 23, no. 2
  start-page: 127
  year: 2008
  ident: 10.1016/j.imavis.2010.07.001_bb0045
  article-title: Region-of-interest video coding based on rate and distortion variations for H.263+
– year: 2004
  ident: 10.1016/j.imavis.2010.07.001_bb0115
  article-title: A content-based bit allocation model for video streaming
– volume: vol. 3299
  start-page: 139
  year: 1998
  ident: 10.1016/j.imavis.2010.07.001_bb0210
  article-title: Toward a perceptual video quality metric
– year: 2007
  ident: 10.1016/j.imavis.2010.07.001_bb0225
  article-title: Beyond bottom-up: incorporating task-dependent influences into a computational model of spatial attention
– volume: vol. 20
  start-page: 241
  year: 2008
  ident: 10.1016/j.imavis.2010.07.001_bb0055
  article-title: Predicting human gaze using low-level saliency combined with face detection
– volume: vol. 20
  start-page: 1254
  year: 1998
  ident: 10.1016/j.imavis.2010.07.001_bb0090
  article-title: A model of saliency-based visual attention for rapid scene analysis
– volume: vol. 18, no. 1
  start-page: 134
  year: 2008
  ident: 10.1016/j.imavis.2010.07.001_bb0140
  article-title: Region-of-interest based resource allocation for conversational video communications of H.264/AVC
– start-page: 485
  year: 2004
  ident: 10.1016/j.imavis.2010.07.001_bb0195
  article-title: High-quality linear interporlation for demosaicing of bayer-patterned color images
– volume: vol. 10, no. 1
  start-page: 20
  year: 2001
  ident: 10.1016/j.imavis.2010.07.001_bb0160
  article-title: Digital video quality metric based on human vision
– volume: vol. 2657
  start-page: 350
  year: 1996
  ident: 10.1016/j.imavis.2010.07.001_bb0020
  article-title: Implementation of a foveated image coding system for bandwidth reduction of video images
– year: 2008
  ident: 10.1016/j.imavis.2010.07.001_bb0205
  article-title: Subjective and objective quality evaluation of the H.264/AVC coded video
– year: 1995
  ident: 10.1016/j.imavis.2010.07.001_bb0015
– volume: vol. 15, no. 3
  year: 2006
  ident: 10.1016/j.imavis.2010.07.001_bb0065
  article-title: Region-of-interest based rate control for low-bit-rate video conferencing
– volume: vol. 2
  start-page: 194
  year: 2001
  ident: 10.1016/j.imavis.2010.07.001_bb0095
  article-title: Computational modeling of visual attention
– volume: vol. 10, no. 7
  start-page: 977
  year: 2001
  ident: 10.1016/j.imavis.2010.07.001_bb0130
  article-title: Foveated video compression with optimal rate control
– volume: vol. 8
  start-page: 928
  year: 1998
  ident: 10.1016/j.imavis.2010.07.001_bb0025
  article-title: Improving the performance of MPEG coders using adaptive regions of interest
– ident: 10.1016/j.imavis.2010.07.001_bb0165
– volume: vol. 16, no. 5
  start-page: 663
  year: 2006
  ident: 10.1016/j.imavis.2010.07.001_bb0125
  article-title: On Lagrange multiplier and quantizer adjustment for H.264 frame layer video rate control
– volume: vol. 8, no. 1
  start-page: 1
  year: 2006
  ident: 10.1016/j.imavis.2010.07.001_bb0135
  article-title: Region-based rate control and bit allocation for wireless video transmission
– volume: vol. 1913
  start-page: 15
  year: 1993
  ident: 10.1016/j.imavis.2010.07.001_bb0155
  article-title: Objective video quality assessment system based on human perception
– year: 2003
  ident: 10.1016/j.imavis.2010.07.001_bb0005
  article-title: Draft ITU-T recommendation H.264 and final draft international standard 14496-10 advanced video coding
– volume: vol. 45, no. 13
  start-page: 1707
  year: 2005
  ident: 10.1016/j.imavis.2010.07.001_bb0050
  article-title: At first sight: a high-level pop out effects for faces
– volume: vol. 23, no. 6
  start-page: 857
  year: 1999
  ident: 10.1016/j.imavis.2010.07.001_bb0035
  article-title: Demand-driven image transmission with levels of detail and region s of interest
– volume: vol. 8
  start-page: 11
  year: 2006
  ident: 10.1016/j.imavis.2010.07.001_bb0075
  article-title: Visual sensitivity guided bit allocation for video coding
– volume: vol. 12
  start-page: 97
  year: 1980
  ident: 10.1016/j.imavis.2010.07.001_bb0180
  article-title: A feature-integration theory of attention
SSID ssj0007079
Score 2.4343104
Snippet A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 1
SubjectTerms Eye-tracking
Video compression
Video subjective quality
Visual attention
Title Visual attention guided bit allocation in video compression
URI https://dx.doi.org/10.1016/j.imavis.2010.07.001
Volume 29
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LSwMxEA5FL3rwURXro-TgNXYf2U0WT6VYqmIvWultyUtZ0W2x26u_3cwmqxVEQdjTkoEwJDPfJN98Qegspqk2Jo1IoDNDKDOacKpjYrEx5VqlTIfQjXw7TkcTej1Npi00aHphgFbpY7-L6XW09n963pu9eVH07mz1EHEO-R8KGw6Kn5QyWOXn7180D1CAc-csdufb0U37XM3xKl6hld8TvEDLMPw5Pa2knOEO2vJYEffddHZRy5RttO1xI_a7ctFGmyuignvo4qFYLK0ZCGfWVEb8tCy0NZBFheGa3R3S4aLE0IM3w8Aqd2zYch9Nhpf3gxHxTyQQZbF-RWy9I4RWmRIhqNBImZqA61ArLmjGTJaoJOIJE5HMjJB21CNjwkI0keok4zSOD9BaOSvNIcKBltLWyTJSMqa23hahLVYSKSIuAQToDoobz-TK64fDMxYveUMUe86dP3PwZx7AxXbYQeTTau70M_4Yzxqn59_WQW5D_K-WR_-2PEYb7qQYvhO0Vr0tzamFGpXs1mupi9b7Vzej8QfgKdPE
linkProvider Elsevier
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV09T8MwED2VMgADHwVE-czAGtokTuKICVVUBdoutKibZccGBUFa0XTlt-OLHSgSAgkpU-STrJPv_M5-9wxwHpBIKhX5blsmyiWxki4lMnA1NiZUplEsPexGHgyj3pjcTsJJDTpVLwzSKm3uNzm9zNb2T8t6szXLsta9rh58SnH_x8KGBiuwSnT44jMGF-9fPA-UgDMHLTr09fCqf64keWWv2MtvGV4oZuj9vD8t7Tndbdi0YNG5MvPZgZrKG7BlgaNjw3LegI0lVcFduHzI5gtthsqZJZfReVpkUhuIrHDwnt2c0jlZ7mAT3tRBWrmhw-Z7MO5ejzo9176R4KYa7BeuLng4l2mScg9laISIVJtKT6aUkyRWSZiGPg1j7otEcaFHPcYx1xiNRzJMKAmCfajn01wdgNOWQuhCWfipCIguuLmnq5VQcJ8KRAGyCUHlGZZaAXF8x-KFVUyxZ2b8ydCfrI03214T3E-rmRHQ-GN8XDmdfVsITOf4Xy0P_215Bmu90aDP-jfDuyNYN8fG-B1DvXhbqBONOwpxWq6rD9_i1VI
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Visual+attention+guided+bit+allocation+in+video+compression&rft.jtitle=Image+and+vision+computing&rft.au=Li%2C+Zhicheng&rft.au=Qin%2C+Shiyin&rft.au=Itti%2C+Laurent&rft.date=2011&rft.pub=Elsevier+B.V&rft.issn=0262-8856&rft.eissn=1872-8138&rft.volume=29&rft.issue=1&rft.spage=1&rft.epage=14&rft_id=info:doi/10.1016%2Fj.imavis.2010.07.001&rft.externalDocID=S0262885610001083
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0262-8856&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0262-8856&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0262-8856&client=summon