Visual attention guided bit allocation in video compression
A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions in video. From the top salient locations from the computed saliency map, a guidance map is generated to guide the bit allocation strategy thro...
Saved in:
Published in | Image and vision computing Vol. 29; no. 1; pp. 1 - 14 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Elsevier B.V
2011
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions in video. From the top salient locations from the computed saliency map, a guidance map is generated to guide the bit allocation strategy through a new constrained global optimization approach, which can be solved in a closed form and independently of video frame content. Fifty video sequences (300 frames each) and eye-tracking data from 14 subjects were collected to evaluate both the accuracy of the attention prediction model and the subjective quality of the encoded video. Results show that the area under the curve of the guidance map is 0.773
±
0.002, significantly above chance (0.500). Using a new eye-tracking-weighted PSNR (EWPSNR) measure of subjective quality, more than 90% of the encoded video clips with the proposed method achieve better subjective quality compared to standard encoding with matched bit rate. The improvement in EWPSNR is up to over 2
dB and on average 0.79
dB. |
---|---|
AbstractList | A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions in video. From the top salient locations from the computed saliency map, a guidance map is generated to guide the bit allocation strategy through a new constrained global optimization approach, which can be solved in a closed form and independently of video frame content. Fifty video sequences (300 frames each) and eye-tracking data from 14 subjects were collected to evaluate both the accuracy of the attention prediction model and the subjective quality of the encoded video. Results show that the area under the curve of the guidance map is 0.773
±
0.002, significantly above chance (0.500). Using a new eye-tracking-weighted PSNR (EWPSNR) measure of subjective quality, more than 90% of the encoded video clips with the proposed method achieve better subjective quality compared to standard encoding with matched bit rate. The improvement in EWPSNR is up to over 2
dB and on average 0.79
dB. |
Author | Qin, Shiyin Li, Zhicheng Itti, Laurent |
Author_xml | – sequence: 1 givenname: Zhicheng surname: Li fullname: Li, Zhicheng organization: School of Automation Science and Electrical Engineering, Beihang University, Beijing, China – sequence: 2 givenname: Shiyin surname: Qin fullname: Qin, Shiyin organization: School of Automation Science and Electrical Engineering, Beihang University, Beijing, China – sequence: 3 givenname: Laurent surname: Itti fullname: Itti, Laurent email: itti@pollux.usc.edu organization: Computer Science Department, University of Southern California, Los Angeles, CA, USA |
BookMark | eNqFkN1KAzEQhYMo2FbfwIt9gV0n-5esgiDFPyh4o96G2WQqKdtNSdKCb2_aeuWFXs3wcc5h5kzZ6ehGYuyKQ8GBt9erwq5xZ0NRQkIgCgB-wiZcijKXvJKnbAJlm3bZtOdsGsIKAASIbsJuP2zY4pBhjDRG68bsc2sNmay3McNhcBoP1I7ZLnGXabfeeAohwQt2tsQh0OXPnLH3x4e3-XO-eH16md8vcl1BG_NSCkSjO4285rLp-5ZAGm60xLoT1DW6KWUjsOw7wj6plkIgJNCappN1Vc3YzTFXexeCp6XSNh7Oih7toDiofQ1qpY41qH0NCoRKNSRz_cu88Unmv_6z3R1tlB7bWfIqaEujJmM96aiMs38HfAOdLHx5 |
CitedBy_id | crossref_primary_10_1016_j_patcog_2018_02_004 crossref_primary_10_1109_TBC_2018_2795459 crossref_primary_10_1016_j_image_2015_04_011 crossref_primary_10_1109_ACCESS_2019_2960807 crossref_primary_10_1108_AEAT_10_2012_0164 crossref_primary_10_1007_s11042_020_08686_z crossref_primary_10_1371_journal_pone_0150673 crossref_primary_10_1080_02564602_2016_1231023 crossref_primary_10_4304_jsw_7_11_2591_2598 crossref_primary_10_1007_s10514_018_9752_3 crossref_primary_10_1109_ACCESS_2023_3286577 crossref_primary_10_1007_s11042_015_2465_0 crossref_primary_10_3389_fncom_2016_00124 crossref_primary_10_1016_j_neucom_2020_06_003 crossref_primary_10_1016_j_image_2017_09_007 crossref_primary_10_1109_TIP_2017_2722238 crossref_primary_10_1109_JSEN_2019_2899102 crossref_primary_10_1007_s12243_012_0352_5 crossref_primary_10_1016_j_eswa_2020_113654 crossref_primary_10_1109_MPRV_2021_3051309 crossref_primary_10_1117_1_JEI_25_6_061626 crossref_primary_10_1016_j_jvcir_2017_08_003 crossref_primary_10_1109_TCSVT_2016_2595324 crossref_primary_10_7498_aps_66_109501 crossref_primary_10_1109_JSTSP_2011_2165199 crossref_primary_10_1007_s11263_020_01371_6 crossref_primary_10_1109_TCSVT_2023_3342903 crossref_primary_10_5594_JMI_2022_3160541 crossref_primary_10_1109_TCSVT_2015_2450175 crossref_primary_10_1016_j_jvcir_2018_01_014 crossref_primary_10_1007_s12559_016_9406_8 crossref_primary_10_1109_TIP_2013_2247409 crossref_primary_10_1109_TCE_2011_6131159 crossref_primary_10_1109_TPAMI_2019_2924417 crossref_primary_10_1016_j_image_2013_07_003 crossref_primary_10_1016_j_dib_2019_103991 crossref_primary_10_1109_ACCESS_2021_3110292 crossref_primary_10_1109_ACCESS_2018_2826562 crossref_primary_10_3390_electronics12030680 crossref_primary_10_1109_TCSVT_2015_2389491 crossref_primary_10_1016_j_imavis_2020_104001 crossref_primary_10_1109_ACCESS_2018_2843384 crossref_primary_10_1109_JSTSP_2016_2634458 crossref_primary_10_1016_j_engappai_2024_109806 crossref_primary_10_1002_cav_2287 crossref_primary_10_1007_s11045_018_0610_4 crossref_primary_10_1016_j_patcog_2017_09_023 crossref_primary_10_1016_j_image_2015_04_007 crossref_primary_10_1109_TIP_2019_2960869 crossref_primary_10_1007_s11277_016_3704_z crossref_primary_10_1371_journal_pcbi_1011512 crossref_primary_10_3390_e22101174 crossref_primary_10_1145_3369110 crossref_primary_10_1017_ATSIP_2013_5 crossref_primary_10_1007_s11042_017_4914_4 crossref_primary_10_1109_TMM_2019_2928494 crossref_primary_10_1007_s11042_017_4725_7 crossref_primary_10_1016_j_imavis_2016_07_007 crossref_primary_10_1016_j_imavis_2020_103964 crossref_primary_10_1109_ACCESS_2024_3394222 crossref_primary_10_1016_j_neucom_2012_08_029 crossref_primary_10_1109_TCSVT_2015_2474075 crossref_primary_10_1007_s11042_017_5334_1 crossref_primary_10_1109_TIP_2011_2165292 crossref_primary_10_1016_j_ijleo_2016_05_027 crossref_primary_10_1016_j_image_2013_01_001 crossref_primary_10_1109_TMM_2013_2264655 crossref_primary_10_4304_jsw_8_10_2541_2548 crossref_primary_10_1109_TIP_2013_2279941 crossref_primary_10_1007_s12559_014_9246_3 crossref_primary_10_1007_s11263_023_01950_3 crossref_primary_10_1016_j_ins_2017_01_019 crossref_primary_10_1109_TIP_2012_2233485 crossref_primary_10_1175_JTECH_D_16_0092_1 crossref_primary_10_1109_ACCESS_2021_3050489 crossref_primary_10_1007_s11042_015_3054_y crossref_primary_10_1007_s11042_016_4124_5 crossref_primary_10_1109_TCSVT_2014_2308642 crossref_primary_10_1109_JSTSP_2012_2215006 crossref_primary_10_1109_TNNLS_2016_2522440 crossref_primary_10_3390_rs10040652 crossref_primary_10_1109_TGRS_2024_3479190 crossref_primary_10_1109_TMM_2017_2743987 crossref_primary_10_1109_JSAC_2022_3223408 crossref_primary_10_1080_13506285_2012_667456 crossref_primary_10_1109_LGRS_2013_2253443 crossref_primary_10_1145_3129289 crossref_primary_10_1109_TBC_2017_2781118 crossref_primary_10_1109_TIP_2013_2282897 crossref_primary_10_1109_JSTSP_2014_2313717 crossref_primary_10_1016_j_patrec_2013_06_004 crossref_primary_10_1007_s10462_012_9385_4 crossref_primary_10_1109_TIP_2018_2837106 crossref_primary_10_1016_j_knosys_2015_02_028 crossref_primary_10_1007_s13319_017_0121_3 crossref_primary_10_1016_j_image_2015_07_002 crossref_primary_10_1007_s00521_022_06895_1 crossref_primary_10_1109_ACCESS_2018_2876427 crossref_primary_10_1109_TCSVT_2019_2911396 crossref_primary_10_1109_TCSVT_2022_3172971 crossref_primary_10_1109_TIP_2018_2861217 crossref_primary_10_1109_TMM_2017_2721544 crossref_primary_10_1155_2014_343860 crossref_primary_10_1109_ACCESS_2018_2883967 crossref_primary_10_1109_TCSVT_2024_3419910 crossref_primary_10_1016_j_ijleo_2016_11_191 crossref_primary_10_1109_TCSVT_2021_3056134 crossref_primary_10_1007_s11045_015_0347_2 crossref_primary_10_1109_JSTSP_2014_2314864 crossref_primary_10_1109_TCSVT_2018_2886277 crossref_primary_10_1109_TIP_2021_3091909 crossref_primary_10_1016_j_neucom_2017_08_054 crossref_primary_10_1016_j_jvcir_2013_02_007 crossref_primary_10_1109_TIP_2014_2307434 crossref_primary_10_1017_ATSIP_2015_4 crossref_primary_10_1109_TMM_2023_3327886 crossref_primary_10_1167_jov_24_12_11 |
ContentType | Journal Article |
Copyright | 2010 Elsevier B.V. |
Copyright_xml | – notice: 2010 Elsevier B.V. |
DBID | AAYXX CITATION |
DOI | 10.1016/j.imavis.2010.07.001 |
DatabaseName | CrossRef |
DatabaseTitle | CrossRef |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Applied Sciences Engineering |
EISSN | 1872-8138 |
EndPage | 14 |
ExternalDocumentID | 10_1016_j_imavis_2010_07_001 S0262885610001083 |
GroupedDBID | --K --M .~1 0R~ 1B1 1~. 1~5 29I 4.4 457 4G. 5GY 5VS 7-5 71M 8P~ 9JN AABNK AACTN AAEDT AAEDW AAIAV AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXUO AAYFN ABBOA ABFNM ABFRF ABJNI ABMAC ABOCM ABTAH ABXDB ABYKQ ACDAQ ACGFO ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADJOM ADMUD ADTZH AEBSH AECPX AEFWE AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJBFU AJOXV ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ASPBG AVWKF AXJTR AZFZN BJAXD BKOJK BLXMC CS3 DU5 EBS EFJIC EFLBG EJD EO8 EO9 EP2 EP3 F0J F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-Q G8K GBLVA GBOLZ HLZ HVGLF HZ~ IHE J1W JJJVA KOM LG9 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- RIG RNS ROL RPZ SBC SDF SDG SDP SES SEW SPC SPCBC SST SSV SSZ T5K TN5 UHS UNMZH VOH WUQ XFK XPP ZMT ZY4 ~G- AATTM AAXKI AAYWO AAYXX ABDPE ABWVN ACRPL ACVFH ADCNI ADNMO AEIPS AEUPX AFJKZ AFPUW AFXIZ AGCQF AGQPQ AGRNS AIGII AIIUN AKBMS AKRWK AKYEP ANKPU APXCP BNPGV CITATION SSH |
ID | FETCH-LOGICAL-c306t-287aadc9ca14185bb6e08d1dc8a497e95c52857a2b9eabca1f77a057a6d598433 |
IEDL.DBID | .~1 |
ISSN | 0262-8856 |
IngestDate | Tue Jul 01 00:48:14 EDT 2025 Thu Apr 24 23:12:50 EDT 2025 Fri Feb 23 02:23:39 EST 2024 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Keywords | Video subjective quality Video compression Visual attention Eye-tracking |
Language | English |
License | https://www.elsevier.com/tdm/userlicense/1.0 |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c306t-287aadc9ca14185bb6e08d1dc8a497e95c52857a2b9eabca1f77a057a6d598433 |
PageCount | 14 |
ParticipantIDs | crossref_citationtrail_10_1016_j_imavis_2010_07_001 crossref_primary_10_1016_j_imavis_2010_07_001 elsevier_sciencedirect_doi_10_1016_j_imavis_2010_07_001 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2011 2011-1-00 |
PublicationDateYYYYMMDD | 2011-01-01 |
PublicationDate_xml | – year: 2011 text: 2011 |
PublicationDecade | 2010 |
PublicationTitle | Image and vision computing |
PublicationYear | 2011 |
Publisher | Elsevier B.V |
Publisher_xml | – name: Elsevier B.V |
References | Chi, Chen, Yeh, Jhu (bb0045) 2008; vol. 23, no. 2 Wandell (bb0015) 1995 Liu, Zheng, Ding, Yuan (bb0100) 2008 Li, Itti (bb0145) May, 2008 Malvar, He, Cutler (bb0195) 2004 Peters, Itti (bb0225) 2007 Tang, Chen, Yu, Tsai (bb0075) Feb. 2006; vol. 8 Kortum, Geisler (bb0020) 1996; vol. 2657 Rauschenbach, Schumann (bb0035) 1999; vol. 23, no. 6 Wiegand, Sullivan, Bjntegaard, Luthra (bb0010) July 2003; vol. 13 Lee, Bovik, Kim (bb0030) 1999 Lai, Wong, Lun (bb0060) Aug. 2002; vol. 1 Cerf, Harel, Einhauser, Koch (bb0055) 2008; vol. 20 “Tutorial: objective perceptual assessment of video quality: full reference television,”, ITU-T Technical tutorials, 2005. Wolfe (bb0175) 1998; vol. 8 Itti, Koch (bb0095) Mar, 2001; vol. 2 Minoo, Nguyen (bb0080) Nov. 2005 Jiang, Ling (bb0125) 2006; vol. 16, no. 5 Wang, Ostermann, Zhang (bb0190) 2002 Sun, Ahmad, Li, Zhang (bb0135) 2006; vol. 8, no. 1 Weigand, Sullivan, Luthra (bb0005) 2003 Doulamis, Doulamis, Kalogera, Kollias (bb0025) 1998; vol. 8 Itti (bb0105) 2004; vol. 13, no. 10 Wang, Zhang, Li (bb0150) 2000 Vranjes, Rimac-Drlje, Zagar (bb0205) 2008 Ehinger, Hidalgo-Sotelo, Torralba, Oliva (bb0215) 2009; vol. 17, no. 6&7 Huang, Lin (bb0085) 2007; vol. 9, no. 6 Chen, Qiu, Lu, Zhu, Chen, Gu, Charles (bb0120) 2007 Lai, Gu, Wang, Ma, Zhang (bb0115) 2004 Watson, Hu, McGowan (bb0160) 2001; vol. 10, no. 1 Treisman, Gelade (bb0180) 1980; vol. 12 Itti, Koch (bb0185) Jan. 2001; vol. 10 Li, Itti (bb0220) 2009 Bamber (bb0200) 1975; vol. 12 Parkhurst, Niebur (bb0040) 2002; vol. 44, no. 4 Itti (bb0110) 2004; vol. 5292 Liu, Li, Soh (bb0140) 2008; vol. 18, no. 1 Itti, Koch, Niebur (bb0090) Nov. 1998; vol. 20 Watson (bb0210) 1998; vol. 3299 L. S. Karlsson, “Spatio-temporal pre-processing methods for region-of-interest video coding,” PhD dissertation, Sundsvall, Sweden, 2007. Hershler, Hochstein (bb0050) 2005; vol. 45, no. 13 Webster, Jones, Pinson, Voran, Wolf, Webster, Jones, Pinson, Voran, Wolf (bb0155) 1993; vol. 1913 http://ilab.usc.edu/toolkit. Tong, Rao (bb0065) July, 2006; vol. 15, no. 3 Lee, Pattechis, Bovik (bb0130) 2001; vol. 10, no. 7 Minoo (10.1016/j.imavis.2010.07.001_bb0080) 2005 Itti (10.1016/j.imavis.2010.07.001_bb0110) 2004; vol. 5292 Itti (10.1016/j.imavis.2010.07.001_bb0090) 1998; vol. 20 10.1016/j.imavis.2010.07.001_bb0165 Itti (10.1016/j.imavis.2010.07.001_bb0105) 2004; vol. 13, no. 10 Vranjes (10.1016/j.imavis.2010.07.001_bb0205) 2008 Chi (10.1016/j.imavis.2010.07.001_bb0045) 2008; vol. 23, no. 2 Hershler (10.1016/j.imavis.2010.07.001_bb0050) 2005; vol. 45, no. 13 Ehinger (10.1016/j.imavis.2010.07.001_bb0215) 2009; vol. 17, no. 6&7 Wiegand (10.1016/j.imavis.2010.07.001_bb0010) 2003; vol. 13 Itti (10.1016/j.imavis.2010.07.001_bb0185) 2001; vol. 10 Chen (10.1016/j.imavis.2010.07.001_bb0120) 2007 Wolfe (10.1016/j.imavis.2010.07.001_bb0175) 1998; vol. 8 Kortum (10.1016/j.imavis.2010.07.001_bb0020) 1996; vol. 2657 Tang (10.1016/j.imavis.2010.07.001_bb0075) 2006; vol. 8 Cerf (10.1016/j.imavis.2010.07.001_bb0055) 2008; vol. 20 Liu (10.1016/j.imavis.2010.07.001_bb0140) 2008; vol. 18, no. 1 Lai (10.1016/j.imavis.2010.07.001_bb0060) 2002; vol. 1 Peters (10.1016/j.imavis.2010.07.001_bb0225) 2007 Doulamis (10.1016/j.imavis.2010.07.001_bb0025) 1998; vol. 8 Lee (10.1016/j.imavis.2010.07.001_bb0030) 1999 Malvar (10.1016/j.imavis.2010.07.001_bb0195) 2004 Wandell (10.1016/j.imavis.2010.07.001_bb0015) 1995 10.1016/j.imavis.2010.07.001_bb0170 10.1016/j.imavis.2010.07.001_bb0070 Sun (10.1016/j.imavis.2010.07.001_bb0135) 2006; vol. 8, no. 1 Lee (10.1016/j.imavis.2010.07.001_bb0130) 2001; vol. 10, no. 7 Watson (10.1016/j.imavis.2010.07.001_bb0210) 1998; vol. 3299 Tong (10.1016/j.imavis.2010.07.001_bb0065) 2006; vol. 15, no. 3 Parkhurst (10.1016/j.imavis.2010.07.001_bb0040) 2002; vol. 44, no. 4 Weigand (10.1016/j.imavis.2010.07.001_bb0005) 2003 Lai (10.1016/j.imavis.2010.07.001_bb0115) 2004 Li (10.1016/j.imavis.2010.07.001_bb0220) 2009 Rauschenbach (10.1016/j.imavis.2010.07.001_bb0035) 1999; vol. 23, no. 6 Watson (10.1016/j.imavis.2010.07.001_bb0160) 2001; vol. 10, no. 1 Itti (10.1016/j.imavis.2010.07.001_bb0095) 2001; vol. 2 Liu (10.1016/j.imavis.2010.07.001_bb0100) 2008 Bamber (10.1016/j.imavis.2010.07.001_bb0200) 1975; vol. 12 Huang (10.1016/j.imavis.2010.07.001_bb0085) 2007; vol. 9, no. 6 Jiang (10.1016/j.imavis.2010.07.001_bb0125) 2006; vol. 16, no. 5 Webster (10.1016/j.imavis.2010.07.001_bb0155) 1993; vol. 1913 Treisman (10.1016/j.imavis.2010.07.001_bb0180) 1980; vol. 12 Wang (10.1016/j.imavis.2010.07.001_bb0150) 2000 Wang (10.1016/j.imavis.2010.07.001_bb0190) 2002 Li (10.1016/j.imavis.2010.07.001_bb0145) 2008 |
References_xml | – volume: vol. 15, no. 3 year: July, 2006 ident: bb0065 article-title: Region-of-interest based rate control for low-bit-rate video conferencing publication-title: Journal of Electronic Imaging – volume: vol. 44, no. 4 start-page: 611 year: 2002 end-page: 629 ident: bb0040 article-title: Variable-resolution displays: a theoretical, practical, and behavioral evalutation publication-title: Human Factors – volume: vol. 9, no. 6 start-page: 1113 year: 2007 end-page: 1124 ident: bb0085 article-title: A novel 4-D perceptual quantization modeling for H.264 bit-rate control publication-title: IEEE Trans. On Multimedia – start-page: 485 year: 2004 end-page: 488 ident: bb0195 article-title: High-quality linear interporlation for demosaicing of bayer-patterned color images publication-title: Proc. ICASSP – volume: vol. 8, no. 1 start-page: 1 year: 2006 end-page: 10 ident: bb0135 article-title: Region-based rate control and bit allocation for wireless video transmission publication-title: IEEE Trans. on Multimedia – year: May, 2008 ident: bb0145 article-title: Visual attention guided video compression publication-title: Proc. Vision Science Society Annual Meeting (VSS08) – volume: vol. 12 start-page: 375 year: 1975 end-page: 387 ident: bb0200 article-title: The area above the ordinal dominance graph and the area below the receiver operating characteristic graph publication-title: Journal of Mathematical Psychology – volume: vol. 20 start-page: 241 year: 2008 end-page: 248 ident: bb0055 article-title: Predicting human gaze using low-level saliency combined with face detection publication-title: Advances in neural information processing systems – year: 2007 ident: bb0225 article-title: Beyond bottom-up: incorporating task-dependent influences into a computational model of spatial attention publication-title: Proc. CVPR – start-page: 1 year: 2008 end-page: 4 ident: bb0100 article-title: Video attention: learning to detect a salient object sequence publication-title: Proc. ICPR – volume: vol. 16, no. 5 start-page: 663 year: 2006 end-page: 669 ident: bb0125 article-title: On Lagrange multiplier and quantizer adjustment for H.264 frame layer video rate control publication-title: IEEE Trans. Circuits and Syst. Video Technol – reference: “Tutorial: objective perceptual assessment of video quality: full reference television,”, ITU-T Technical tutorials, 2005. – start-page: 90 year: 1999 end-page: 94 ident: bb0030 article-title: Low delay foveated visual communications over wireless channels publication-title: Proc. IEEE Int. Conf. Image Processing – year: 2003 ident: bb0005 article-title: Draft ITU-T recommendation H.264 and final draft international standard 14496-10 advanced video coding publication-title: Joint Video Teams of ISO/IECJTCI/SC29/WG11 and ITU-T SG/16/Q.6Doc.JVT-G050r, Geneva, Switzerland – year: 2002 ident: bb0190 article-title: Video Processing and Communication publication-title: Pearson Education – volume: vol. 10, no. 7 start-page: 977 year: 2001 end-page: 992 ident: bb0130 article-title: Foveated video compression with optimal rate control publication-title: IEEE Trans. on Image Processing – volume: vol. 10, no. 1 start-page: 20 year: 2001 end-page: 29 ident: bb0160 article-title: Digital video quality metric based on human vision publication-title: Journal of Electronic Imaging – volume: vol. 18, no. 1 start-page: 134 year: 2008 end-page: 139 ident: bb0140 article-title: Region-of-interest based resource allocation for conversational video communications of H.264/AVC publication-title: IEEE Trans. Circuits and Syst. Video Technol – volume: vol. 2657 start-page: 350 year: 1996 end-page: 360 ident: bb0020 article-title: Implementation of a foveated image coding system for bandwidth reduction of video images publication-title: Proc. SPIE – reference: L. S. Karlsson, “Spatio-temporal pre-processing methods for region-of-interest video coding,” PhD dissertation, Sundsvall, Sweden, 2007. – volume: vol. 12 start-page: 97 year: 1980 end-page: 136 ident: bb0180 article-title: A feature-integration theory of attention publication-title: Cognition Psychology – volume: vol. 23, no. 6 start-page: 857 year: 1999 end-page: 866 ident: bb0035 article-title: Demand-driven image transmission with levels of detail and region s of interest publication-title: Comput. Graph – volume: vol. 23, no. 2 start-page: 127 year: 2008 end-page: 142 ident: bb0045 article-title: Region-of-interest video coding based on rate and distortion variations for H.263+ publication-title: Image Communication – year: 2009 ident: bb0220 article-title: Gist based top-down templates for gaze prediction publication-title: Proc. Vision Science Society Annual Meeting – volume: vol. 3299 start-page: 139 year: 1998 end-page: 147 ident: bb0210 article-title: Toward a perceptual video quality metric publication-title: Human Vision, Visual Processing, and Digital Display – volume: vol. 8 start-page: 11 year: Feb. 2006 end-page: 18 ident: bb0075 article-title: Visual sensitivity guided bit allocation for video coding publication-title: IEEE Trans. Multimedia – start-page: 3634 year: 2007 end-page: 3638 ident: bb0120 article-title: Improving video coding at scene cuts using attention based adaptive bit allocation publication-title: Proc. ISCAS – volume: vol. 8 start-page: 303 year: 1998 end-page: 304 ident: bb0175 article-title: Visual memory: what do you know about what you saw? publication-title: Current Biology – volume: vol. 8 start-page: 928 year: 1998 end-page: 934 ident: bb0025 article-title: Improving the performance of MPEG coders using adaptive regions of interest publication-title: IEEE Trans. On Circuits syst. Video Technol – volume: vol. 45, no. 13 start-page: 1707 year: 2005 end-page: 1724 ident: bb0050 article-title: At first sight: a high-level pop out effects for faces publication-title: Vision Research – volume: vol. 1913 start-page: 15 year: 1993 end-page: 26 ident: bb0155 article-title: Objective video quality assessment system based on human perception publication-title: Proc. SPIE – start-page: 791 year: 2000 end-page: 794 ident: bb0150 article-title: Objective quality evaluation of digital video publication-title: Proc. PCCAS – volume: vol. 13, no. 10 start-page: 1304 year: 2004 end-page: 1318 ident: bb0105 article-title: Automatic foveation for video compression using a neurobiological model of visual attention publication-title: IEEE Trans. on Image Processing – year: 2008 ident: bb0205 article-title: Subjective and objective quality evaluation of the H.264/AVC coded video publication-title: Proc. Systems, Signals and Image Processing – year: 1995 ident: bb0015 article-title: Foundations of Vision – year: Nov. 2005 ident: bb0080 article-title: Perceptual video coding with H.264 publication-title: Proc. 39th Asilomar Conf. Signals, Systems, and Computers – volume: vol. 5292 start-page: 272 year: 2004 end-page: 283 ident: bb0110 article-title: Automatic attention-based prioritization of unconstrained video for compression publication-title: Proc. SPIE Human Vision and Electronic Imaging – volume: vol. 13 start-page: 560 year: July 2003 end-page: 576 ident: bb0010 article-title: Overview of the H.264/AVC video coding standard publication-title: IEEE Trans. Circuits and Syst. Video Technol – year: 2004 ident: bb0115 article-title: A content-based bit allocation model for video streaming publication-title: Proc. IEEE international Conference on Multimedia and Expo – volume: vol. 20 start-page: 1254 year: Nov. 1998 end-page: 1259 ident: bb0090 article-title: A model of saliency-based visual attention for rapid scene analysis publication-title: IEEE Transactions on Pattern Analysis and Machine Intelligence – volume: vol. 10 start-page: 161 year: Jan. 2001 end-page: 169 ident: bb0185 article-title: Feature combination strategies for saliency-based visual attention systems publication-title: Journal of Electronic Imaging – reference: http://ilab.usc.edu/toolkit. – volume: vol. 1 start-page: 656 year: Aug. 2002 end-page: 659 ident: bb0060 article-title: A rate control algorithm using human visual system for video conferencing systems publication-title: Proc. Int. Conf. Signal Processing – volume: vol. 2 start-page: 194 year: Mar, 2001 end-page: 203 ident: bb0095 article-title: Computational modeling of visual attention publication-title: Nature Reviews, Neuroscience – volume: vol. 17, no. 6&7 start-page: 945 year: 2009 end-page: 978 ident: bb0215 article-title: Modeling search for people in 900 scenes: a combined source model of eye guidance publication-title: Visual Cognition – volume: vol. 5292 start-page: 272 year: 2004 ident: 10.1016/j.imavis.2010.07.001_bb0110 article-title: Automatic attention-based prioritization of unconstrained video for compression – volume: vol. 13, no. 10 start-page: 1304 year: 2004 ident: 10.1016/j.imavis.2010.07.001_bb0105 article-title: Automatic foveation for video compression using a neurobiological model of visual attention – volume: vol. 17, no. 6&7 start-page: 945 year: 2009 ident: 10.1016/j.imavis.2010.07.001_bb0215 article-title: Modeling search for people in 900 scenes: a combined source model of eye guidance – volume: vol. 10 start-page: 161 year: 2001 ident: 10.1016/j.imavis.2010.07.001_bb0185 article-title: Feature combination strategies for saliency-based visual attention systems – volume: vol. 12 start-page: 375 year: 1975 ident: 10.1016/j.imavis.2010.07.001_bb0200 article-title: The area above the ordinal dominance graph and the area below the receiver operating characteristic graph – year: 2009 ident: 10.1016/j.imavis.2010.07.001_bb0220 article-title: Gist based top-down templates for gaze prediction – ident: 10.1016/j.imavis.2010.07.001_bb0070 – year: 2002 ident: 10.1016/j.imavis.2010.07.001_bb0190 article-title: Video Processing and Communication – start-page: 791 year: 2000 ident: 10.1016/j.imavis.2010.07.001_bb0150 article-title: Objective quality evaluation of digital video – start-page: 3634 year: 2007 ident: 10.1016/j.imavis.2010.07.001_bb0120 article-title: Improving video coding at scene cuts using attention based adaptive bit allocation – volume: vol. 8 start-page: 303 year: 1998 ident: 10.1016/j.imavis.2010.07.001_bb0175 article-title: Visual memory: what do you know about what you saw? – ident: 10.1016/j.imavis.2010.07.001_bb0170 – volume: vol. 1 start-page: 656 year: 2002 ident: 10.1016/j.imavis.2010.07.001_bb0060 article-title: A rate control algorithm using human visual system for video conferencing systems – volume: vol. 44, no. 4 start-page: 611 year: 2002 ident: 10.1016/j.imavis.2010.07.001_bb0040 article-title: Variable-resolution displays: a theoretical, practical, and behavioral evalutation – start-page: 90 year: 1999 ident: 10.1016/j.imavis.2010.07.001_bb0030 article-title: Low delay foveated visual communications over wireless channels – start-page: 1 year: 2008 ident: 10.1016/j.imavis.2010.07.001_bb0100 article-title: Video attention: learning to detect a salient object sequence – year: 2005 ident: 10.1016/j.imavis.2010.07.001_bb0080 article-title: Perceptual video coding with H.264 – volume: vol. 9, no. 6 start-page: 1113 year: 2007 ident: 10.1016/j.imavis.2010.07.001_bb0085 article-title: A novel 4-D perceptual quantization modeling for H.264 bit-rate control – volume: vol. 13 start-page: 560 year: 2003 ident: 10.1016/j.imavis.2010.07.001_bb0010 article-title: Overview of the H.264/AVC video coding standard – year: 2008 ident: 10.1016/j.imavis.2010.07.001_bb0145 article-title: Visual attention guided video compression – volume: vol. 23, no. 2 start-page: 127 year: 2008 ident: 10.1016/j.imavis.2010.07.001_bb0045 article-title: Region-of-interest video coding based on rate and distortion variations for H.263+ – year: 2004 ident: 10.1016/j.imavis.2010.07.001_bb0115 article-title: A content-based bit allocation model for video streaming – volume: vol. 3299 start-page: 139 year: 1998 ident: 10.1016/j.imavis.2010.07.001_bb0210 article-title: Toward a perceptual video quality metric – year: 2007 ident: 10.1016/j.imavis.2010.07.001_bb0225 article-title: Beyond bottom-up: incorporating task-dependent influences into a computational model of spatial attention – volume: vol. 20 start-page: 241 year: 2008 ident: 10.1016/j.imavis.2010.07.001_bb0055 article-title: Predicting human gaze using low-level saliency combined with face detection – volume: vol. 20 start-page: 1254 year: 1998 ident: 10.1016/j.imavis.2010.07.001_bb0090 article-title: A model of saliency-based visual attention for rapid scene analysis – volume: vol. 18, no. 1 start-page: 134 year: 2008 ident: 10.1016/j.imavis.2010.07.001_bb0140 article-title: Region-of-interest based resource allocation for conversational video communications of H.264/AVC – start-page: 485 year: 2004 ident: 10.1016/j.imavis.2010.07.001_bb0195 article-title: High-quality linear interporlation for demosaicing of bayer-patterned color images – volume: vol. 10, no. 1 start-page: 20 year: 2001 ident: 10.1016/j.imavis.2010.07.001_bb0160 article-title: Digital video quality metric based on human vision – volume: vol. 2657 start-page: 350 year: 1996 ident: 10.1016/j.imavis.2010.07.001_bb0020 article-title: Implementation of a foveated image coding system for bandwidth reduction of video images – year: 2008 ident: 10.1016/j.imavis.2010.07.001_bb0205 article-title: Subjective and objective quality evaluation of the H.264/AVC coded video – year: 1995 ident: 10.1016/j.imavis.2010.07.001_bb0015 – volume: vol. 15, no. 3 year: 2006 ident: 10.1016/j.imavis.2010.07.001_bb0065 article-title: Region-of-interest based rate control for low-bit-rate video conferencing – volume: vol. 2 start-page: 194 year: 2001 ident: 10.1016/j.imavis.2010.07.001_bb0095 article-title: Computational modeling of visual attention – volume: vol. 10, no. 7 start-page: 977 year: 2001 ident: 10.1016/j.imavis.2010.07.001_bb0130 article-title: Foveated video compression with optimal rate control – volume: vol. 8 start-page: 928 year: 1998 ident: 10.1016/j.imavis.2010.07.001_bb0025 article-title: Improving the performance of MPEG coders using adaptive regions of interest – ident: 10.1016/j.imavis.2010.07.001_bb0165 – volume: vol. 16, no. 5 start-page: 663 year: 2006 ident: 10.1016/j.imavis.2010.07.001_bb0125 article-title: On Lagrange multiplier and quantizer adjustment for H.264 frame layer video rate control – volume: vol. 8, no. 1 start-page: 1 year: 2006 ident: 10.1016/j.imavis.2010.07.001_bb0135 article-title: Region-based rate control and bit allocation for wireless video transmission – volume: vol. 1913 start-page: 15 year: 1993 ident: 10.1016/j.imavis.2010.07.001_bb0155 article-title: Objective video quality assessment system based on human perception – year: 2003 ident: 10.1016/j.imavis.2010.07.001_bb0005 article-title: Draft ITU-T recommendation H.264 and final draft international standard 14496-10 advanced video coding – volume: vol. 45, no. 13 start-page: 1707 year: 2005 ident: 10.1016/j.imavis.2010.07.001_bb0050 article-title: At first sight: a high-level pop out effects for faces – volume: vol. 23, no. 6 start-page: 857 year: 1999 ident: 10.1016/j.imavis.2010.07.001_bb0035 article-title: Demand-driven image transmission with levels of detail and region s of interest – volume: vol. 8 start-page: 11 year: 2006 ident: 10.1016/j.imavis.2010.07.001_bb0075 article-title: Visual sensitivity guided bit allocation for video coding – volume: vol. 12 start-page: 97 year: 1980 ident: 10.1016/j.imavis.2010.07.001_bb0180 article-title: A feature-integration theory of attention |
SSID | ssj0007079 |
Score | 2.4343104 |
Snippet | A visual attention-based bit allocation strategy for video compression is proposed. Saliency-based attention prediction is used to detect interesting regions... |
SourceID | crossref elsevier |
SourceType | Enrichment Source Index Database Publisher |
StartPage | 1 |
SubjectTerms | Eye-tracking Video compression Video subjective quality Visual attention |
Title | Visual attention guided bit allocation in video compression |
URI | https://dx.doi.org/10.1016/j.imavis.2010.07.001 |
Volume | 29 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LSwMxEA5FL3rwURXro-TgNXYf2U0WT6VYqmIvWultyUtZ0W2x26u_3cwmqxVEQdjTkoEwJDPfJN98Qegspqk2Jo1IoDNDKDOacKpjYrEx5VqlTIfQjXw7TkcTej1Npi00aHphgFbpY7-L6XW09n963pu9eVH07mz1EHEO-R8KGw6Kn5QyWOXn7180D1CAc-csdufb0U37XM3xKl6hld8TvEDLMPw5Pa2knOEO2vJYEffddHZRy5RttO1xI_a7ctFGmyuignvo4qFYLK0ZCGfWVEb8tCy0NZBFheGa3R3S4aLE0IM3w8Aqd2zYch9Nhpf3gxHxTyQQZbF-RWy9I4RWmRIhqNBImZqA61ArLmjGTJaoJOIJE5HMjJB21CNjwkI0keok4zSOD9BaOSvNIcKBltLWyTJSMqa23hahLVYSKSIuAQToDoobz-TK64fDMxYveUMUe86dP3PwZx7AxXbYQeTTau70M_4Yzxqn59_WQW5D_K-WR_-2PEYb7qQYvhO0Vr0tzamFGpXs1mupi9b7Vzej8QfgKdPE |
linkProvider | Elsevier |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV09T8MwED2VMgADHwVE-czAGtokTuKICVVUBdoutKibZccGBUFa0XTlt-OLHSgSAgkpU-STrJPv_M5-9wxwHpBIKhX5blsmyiWxki4lMnA1NiZUplEsPexGHgyj3pjcTsJJDTpVLwzSKm3uNzm9zNb2T8t6szXLsta9rh58SnH_x8KGBiuwSnT44jMGF-9fPA-UgDMHLTr09fCqf64keWWv2MtvGV4oZuj9vD8t7Tndbdi0YNG5MvPZgZrKG7BlgaNjw3LegI0lVcFduHzI5gtthsqZJZfReVpkUhuIrHDwnt2c0jlZ7mAT3tRBWrmhw-Z7MO5ejzo9176R4KYa7BeuLng4l2mScg9laISIVJtKT6aUkyRWSZiGPg1j7otEcaFHPcYx1xiNRzJMKAmCfajn01wdgNOWQuhCWfipCIguuLmnq5VQcJ8KRAGyCUHlGZZaAXF8x-KFVUyxZ2b8ydCfrI03214T3E-rmRHQ-GN8XDmdfVsITOf4Xy0P_215Bmu90aDP-jfDuyNYN8fG-B1DvXhbqBONOwpxWq6rD9_i1VI |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Visual+attention+guided+bit+allocation+in+video+compression&rft.jtitle=Image+and+vision+computing&rft.au=Li%2C+Zhicheng&rft.au=Qin%2C+Shiyin&rft.au=Itti%2C+Laurent&rft.date=2011&rft.pub=Elsevier+B.V&rft.issn=0262-8856&rft.eissn=1872-8138&rft.volume=29&rft.issue=1&rft.spage=1&rft.epage=14&rft_id=info:doi/10.1016%2Fj.imavis.2010.07.001&rft.externalDocID=S0262885610001083 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0262-8856&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0262-8856&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0262-8856&client=summon |