画像認識コンペティションの新展開
Saved in:
Published in | 映像情報メディア学会誌 Vol. 73; no. 6; pp. 1084 - 1089 |
---|---|
Main Author | |
Format | Journal Article |
Language | Japanese |
Published |
一般社団法人 映像情報メディア学会
2019
|
Online Access | Get full text |
ISSN | 1342-6907 1881-6908 |
DOI | 10.3169/itej.73.1084 |
Cover
Author | 中山, 英樹 |
---|---|
Author_xml | – sequence: 1 fullname: 中山, 英樹 organization: 東京大学大学院情報理工学系研究科創造情報学准教授 |
BookMark | eNo9T81Kw0AYXKSCtfbmayTut_97Ein-QcGLnpdNstGEWiXJxaMNeBHxUot49SAIiuLBg4-zVOtb2KAIw8wwAwOzjFrDk6FDaBVwSEHotaxyeShpCFixBdQGpSAQGqvW3FNGGi-XULcsswhjCiCAqzbCX-OPaX09e7yaPd360Zuv57jz9YUf3fvRu68fmuT8-XPyMn29-Z5crqDF1A5K1_3TDjrY2tzv7QT9ve3d3kY_yAlTNEgIsVIwm6jUCZdoUJIksU6YYKBpajEXXAOoNAVHLRGYSxITzbWMothyRzto_Xc3Lyt76MxpkR3b4szYosrigTPNXSOpEQ01n_-b-MgWJrf0B_smYdI |
ContentType | Journal Article |
Copyright | 2019 一般社団法人 映像情報メディア学会 |
Copyright_xml | – notice: 2019 一般社団法人 映像情報メディア学会 |
DOI | 10.3169/itej.73.1084 |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISSN | 1881-6908 |
EndPage | 1089 |
ExternalDocumentID | article_itej_73_6_73_1084_article_char_ja |
GroupedDBID | ALMA_UNASSIGNED_HOLDINGS CS3 JSF KQ8 RJT |
ID | FETCH-LOGICAL-j2483-d22a764ad8fe6ed91872dc9d464193fa05659118ff1e3a260572c29597bbca5e3 |
ISSN | 1342-6907 |
IngestDate | Thu Nov 07 05:45:31 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | true |
Issue | 6 |
Language | Japanese |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-j2483-d22a764ad8fe6ed91872dc9d464193fa05659118ff1e3a260572c29597bbca5e3 |
OpenAccessLink | https://www.jstage.jst.go.jp/article/itej/73/6/73_1084/_article/-char/ja |
PageCount | 6 |
ParticipantIDs | jstage_primary_article_itej_73_6_73_1084_article_char_ja |
PublicationCentury | 2000 |
PublicationDate | 20190000 |
PublicationDateYYYYMMDD | 2019-01-01 |
PublicationDate_xml | – year: 2019 text: 20190000 |
PublicationDecade | 2010 |
PublicationTitle | 映像情報メディア学会誌 |
PublicationTitleAlternate | 映情学誌 |
PublicationYear | 2019 |
Publisher | 一般社団法人 映像情報メディア学会 |
Publisher_xml | – name: 一般社団法人 映像情報メディア学会 |
References | 32)S. Ren, K. He, R. Girshick and J. Sun: “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks”, in Proc. of NIPS(2015 21)C. Sun, A. Shrivastava, S. Singh and A. Gupta: “Revisiting Unreasonable Effectiveness of Data in Deep Learning Era”, in Proc. of IEEE ICCV(2017 29)K. He, X. Zhang, S. Ren and J. Sun: “Deep Residual Learning for Image Recognition”, in Proc. of IEEE CVPR(2016 15)R. Krishna, Y. Zhu, O. Groth, J. Johnson, K. Hata, J. Kravitz, S. Chen, Y. Kalanditis, L.-J. Li, D.A. Shamma, M.S. Bernstein, L. Fei-Fei: “Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations”, International Journal of Computer Vision, 123, 1, pp 32-73(2017 22)D. Mahajan, R. Girshick, V. Ramanathan, K. He, M. Paluri, Y. Li, A. Bharambe, L. van der Maaten: “Exploring the Limits of Weakly Supervised Pretraining”, in Proc. of ECCV(2018 6)J. Xiao, J. Hays, K.A. Ehinger and A. Torralba: “SUN Database : Large-Scale Scene Recognition from Abbey to Zoo”, in Proc. of IEEE CVPR, pp.3485-3492(2010 8)A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar and L. Fei-Fei: “Large-Scale Video Classification with Convolutional Neural Networks”, in Proc. of IEEE CVPR, pp.1725-1732(2014 20)A. Miech, D. Zhukov, J.-B. Alayrac, M. Tapaswi, I. Laptev and J. Sivic: “HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips”, in Proc. of IEEE ICCV (2019 23)O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A.C. Berg and L. Fei-Fei: “ImageNet Large Scale Visual Recognition Challenge”, International Journal of Computer Vision, 115, 3, pp 211-252(2015 33)T.-Y. Lin, P. Doll_r, R. Girshick, K. He, B. Hariharan and S. Belongie: “Feature Pyramid Networks for Object Detection”, in Proc. of IEEE CVPR(2017 17)S. Abu-El-Haija, N. Kothari, J. Lee, P. Natsev, G. Toderici, B. Varadarajan, S. Vijayanarasimhan: “Youtube-8M: A large-scale video classification benchmark”, arXiv preprint arXiv:1609.08675(2016 10)A. Rohrbach, M. Rohrbach, N. Tandon and B. Schiele: “A dataset for Movie Description”, in Proc. of IEEE CVPR, pp.3202-3212(2015 30)J. Hu, L. Shen, S. Albanie, G. Sun and E. Wu: “Squeeze-and-Excitation Networks”, in Proc. of IEEE CVPR(2018 14)A. Das, S. Kottur, K. Gupta, A. Singh, D. Yadav, J.M. F. Moura, D. Parikh and D. Batra: “Visual Dialog”, in Proc. of IEEE CVPR(2017 19)W. Li, L. Wang, W. Li, E. Agustsson and L.V. Gool: “WebVision Database: Visual Learning and Understanding from Web Data”, arXiv preprint, .arXiv: 1708.02862(2017 9)F.C. Heilbron, V. Escorcia, B. Ghanem, J.C. Niebles and U. Norte: “ActivityNet : A Large-Scale Video Benchmark for Human Activity Understanding”, in Proc. of IEEE CVPR(2015 1)J. Ponce, T.L. Berg, M. Everingham, D.A. Forsyth, M. Hebert, S. Lazebnik, M. Marszalek, C. Schmid, B.C. Russell, A. Torralba, C.K.I. Williams, J. Zhang and A. Zisserman: “Dataset issues in object recognition”, Toward Category-Level Object Recognition, 4170, pp.29-48, Springer-Verlag Berlin Heidelberg(2006 4)T.-Y. Lin, M. Maire, S. Belongie, L.D. Bourdev, R.B. Girshick, J. Hays, P. Perona, D. Ramanan, P. Dollár and C.L. Zitnick: “Microsoft COCO: Common Objects in Context”, in Proc. of ECCV, pp.740-755(2014 31)A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, L. Kaiser and I. Polosukhin: “Attention is All You Need”, in Proc. of NIPS(2017 5)中山英樹:“深層畳み込みニューラルネットによる画像特徴抽出と転移学習”,信学技報, 115, 146. pp.55-59(2015 26)R. Girshick, J. Donahue, T. Darrell and J. Malik: “Rich feature hierarchies for accurate object detection and semantic segmentation”, in Proc. of IEEE CVPR(2014 18)B. Thomee, D.A. Shamma, G. Friedland, B. Elizalde, K. Ni, D. Poland, D. Borth and L.-J. Li: “YFCC100M: the new data in multimedia research”, Communications of the ACM, 59, 2, Pages 64-73(2016 24)M. Everingham, S.M.A. Eslami, L. Van Gool, C.K.I. Williams, J. Winn and A. Zisserman: “The Pascal Visual Object Classes Challenge: A Retrospective”, International Journal of Computer Vision, 111, pp.98-136(2014 11)J. Xu, T. Mei, T. Yao and Y. Rui: “MSR-VTT: A large video description dataset for bridging video and language”, in Proc. of IEEE CVPR(2016 28)K. He, X. Zhang, S. Ren and J. Sun: “Delving Deep into Rectifiers : Surpassing Human-Level Performance on ImageNet Classification”, in Proc. of IEEE ICCV(2015 13)Y. Zhu, O. Groth, M. Bernstein and L. Fei-Fei: “Visual7W: Grounded Question Answering in Images”, in Proc. of IEEE CVPR, pp.4995-5004(2016 27)S. Ioffe and C. Szegedy: “Batch Normalization : Accelerating Deep Network Training by Reducing Internal Covariate Shift”, in Proc. of ICML(2015 2)J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li and L. Fei-Fei: “ImageNet: A Large-scale Hierarchical Image Database”, in Proc. of IEEE CVPR (2009 7)B. Zhou, A. Lapedriza, J. Xiao, A. Torralba and A. Oliva: “Learning Deep Features for Scene Recognition using Places Database”, in Proc. of NIPS, pp.487-495(2014 3)C. Fellbaum: “WordNet: An electronic lexical database”, MIT Press(1998 16)I. Krasin, T. Duerig, N. Alldrin, V. Ferrari, S. Abu-El-Haija, A. Kuznetsova, H. Rom, J. Uijlings, S. Popov, A. Veit, S. Belongie, V. Gomes, A. Gupta, C. Sun, G. Chechik, D. Cai, Z. Feng, D. Narayanan and K. Murphy: “Openimages: A public dataset for large-scale multilabel and multi-class image classification”(2016),Dataset available from https://github.com/openimages 25)A. Krizhevsky, I. Sutskever and G.E. Hinton: “ImageNet Classification with Deep Convolutional Neural Networks”, in Proc. of NIPS, pp.1097-1105(2012 12)A. Agrawal, J. Lu, S. Antol, M. Mitchell, C.L. Zitnick, D. Batra and D. Parikh: “VQA: Visual Question Answering”, in Proc. of IEEE ICCV (2015 |
References_xml | – reference: 7)B. Zhou, A. Lapedriza, J. Xiao, A. Torralba and A. Oliva: “Learning Deep Features for Scene Recognition using Places Database”, in Proc. of NIPS, pp.487-495(2014) – reference: 23)O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A.C. Berg and L. Fei-Fei: “ImageNet Large Scale Visual Recognition Challenge”, International Journal of Computer Vision, 115, 3, pp 211-252(2015) – reference: 8)A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar and L. Fei-Fei: “Large-Scale Video Classification with Convolutional Neural Networks”, in Proc. of IEEE CVPR, pp.1725-1732(2014) – reference: 17)S. Abu-El-Haija, N. Kothari, J. Lee, P. Natsev, G. Toderici, B. Varadarajan, S. Vijayanarasimhan: “Youtube-8M: A large-scale video classification benchmark”, arXiv preprint arXiv:1609.08675(2016) – reference: 11)J. Xu, T. Mei, T. Yao and Y. Rui: “MSR-VTT: A large video description dataset for bridging video and language”, in Proc. of IEEE CVPR(2016) – reference: 22)D. Mahajan, R. Girshick, V. Ramanathan, K. He, M. Paluri, Y. Li, A. Bharambe, L. van der Maaten: “Exploring the Limits of Weakly Supervised Pretraining”, in Proc. of ECCV(2018) – reference: 3)C. Fellbaum: “WordNet: An electronic lexical database”, MIT Press(1998) – reference: 15)R. Krishna, Y. Zhu, O. Groth, J. Johnson, K. Hata, J. Kravitz, S. Chen, Y. Kalanditis, L.-J. Li, D.A. Shamma, M.S. Bernstein, L. Fei-Fei: “Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations”, International Journal of Computer Vision, 123, 1, pp 32-73(2017) – reference: 26)R. Girshick, J. Donahue, T. Darrell and J. Malik: “Rich feature hierarchies for accurate object detection and semantic segmentation”, in Proc. of IEEE CVPR(2014) – reference: 4)T.-Y. Lin, M. Maire, S. Belongie, L.D. Bourdev, R.B. Girshick, J. Hays, P. Perona, D. Ramanan, P. Dollár and C.L. Zitnick: “Microsoft COCO: Common Objects in Context”, in Proc. of ECCV, pp.740-755(2014) – reference: 14)A. Das, S. Kottur, K. Gupta, A. Singh, D. Yadav, J.M. F. Moura, D. Parikh and D. Batra: “Visual Dialog”, in Proc. of IEEE CVPR(2017) – reference: 19)W. Li, L. Wang, W. Li, E. Agustsson and L.V. Gool: “WebVision Database: Visual Learning and Understanding from Web Data”, arXiv preprint, .arXiv: 1708.02862(2017) – reference: 10)A. Rohrbach, M. Rohrbach, N. Tandon and B. Schiele: “A dataset for Movie Description”, in Proc. of IEEE CVPR, pp.3202-3212(2015) – reference: 6)J. Xiao, J. Hays, K.A. Ehinger and A. Torralba: “SUN Database : Large-Scale Scene Recognition from Abbey to Zoo”, in Proc. of IEEE CVPR, pp.3485-3492(2010) – reference: 13)Y. Zhu, O. Groth, M. Bernstein and L. Fei-Fei: “Visual7W: Grounded Question Answering in Images”, in Proc. of IEEE CVPR, pp.4995-5004(2016) – reference: 18)B. Thomee, D.A. Shamma, G. Friedland, B. Elizalde, K. Ni, D. Poland, D. Borth and L.-J. Li: “YFCC100M: the new data in multimedia research”, Communications of the ACM, 59, 2, Pages 64-73(2016) – reference: 12)A. Agrawal, J. Lu, S. Antol, M. Mitchell, C.L. Zitnick, D. Batra and D. Parikh: “VQA: Visual Question Answering”, in Proc. of IEEE ICCV (2015) – reference: 9)F.C. Heilbron, V. Escorcia, B. Ghanem, J.C. Niebles and U. Norte: “ActivityNet : A Large-Scale Video Benchmark for Human Activity Understanding”, in Proc. of IEEE CVPR(2015) – reference: 20)A. Miech, D. Zhukov, J.-B. Alayrac, M. Tapaswi, I. Laptev and J. Sivic: “HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips”, in Proc. of IEEE ICCV (2019) – reference: 33)T.-Y. Lin, P. Doll_r, R. Girshick, K. He, B. Hariharan and S. Belongie: “Feature Pyramid Networks for Object Detection”, in Proc. of IEEE CVPR(2017) – reference: 2)J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li and L. Fei-Fei: “ImageNet: A Large-scale Hierarchical Image Database”, in Proc. of IEEE CVPR (2009) – reference: 30)J. Hu, L. Shen, S. Albanie, G. Sun and E. Wu: “Squeeze-and-Excitation Networks”, in Proc. of IEEE CVPR(2018) – reference: 24)M. Everingham, S.M.A. Eslami, L. Van Gool, C.K.I. Williams, J. Winn and A. Zisserman: “The Pascal Visual Object Classes Challenge: A Retrospective”, International Journal of Computer Vision, 111, pp.98-136(2014) – reference: 1)J. Ponce, T.L. Berg, M. Everingham, D.A. Forsyth, M. Hebert, S. Lazebnik, M. Marszalek, C. Schmid, B.C. Russell, A. Torralba, C.K.I. Williams, J. Zhang and A. Zisserman: “Dataset issues in object recognition”, Toward Category-Level Object Recognition, 4170, pp.29-48, Springer-Verlag Berlin Heidelberg(2006) – reference: 27)S. Ioffe and C. Szegedy: “Batch Normalization : Accelerating Deep Network Training by Reducing Internal Covariate Shift”, in Proc. of ICML(2015) – reference: 21)C. Sun, A. Shrivastava, S. Singh and A. Gupta: “Revisiting Unreasonable Effectiveness of Data in Deep Learning Era”, in Proc. of IEEE ICCV(2017) – reference: 5)中山英樹:“深層畳み込みニューラルネットによる画像特徴抽出と転移学習”,信学技報, 115, 146. pp.55-59(2015) – reference: 32)S. Ren, K. He, R. Girshick and J. Sun: “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks”, in Proc. of NIPS(2015) – reference: 16)I. Krasin, T. Duerig, N. Alldrin, V. Ferrari, S. Abu-El-Haija, A. Kuznetsova, H. Rom, J. Uijlings, S. Popov, A. Veit, S. Belongie, V. Gomes, A. Gupta, C. Sun, G. Chechik, D. Cai, Z. Feng, D. Narayanan and K. Murphy: “Openimages: A public dataset for large-scale multilabel and multi-class image classification”(2016),Dataset available from https://github.com/openimages – reference: 29)K. He, X. Zhang, S. Ren and J. Sun: “Deep Residual Learning for Image Recognition”, in Proc. of IEEE CVPR(2016) – reference: 25)A. Krizhevsky, I. Sutskever and G.E. Hinton: “ImageNet Classification with Deep Convolutional Neural Networks”, in Proc. of NIPS, pp.1097-1105(2012) – reference: 28)K. He, X. Zhang, S. Ren and J. Sun: “Delving Deep into Rectifiers : Surpassing Human-Level Performance on ImageNet Classification”, in Proc. of IEEE ICCV(2015) – reference: 31)A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, L. Kaiser and I. Polosukhin: “Attention is All You Need”, in Proc. of NIPS(2017) |
SSID | ssib003116158 ssib017172179 ssj0061382 ssib000937025 ssib002809428 ssib002484574 ssib001234188 ssib023167534 ssib056857217 ssib023157722 |
Score | 2.1740787 |
SourceID | jstage |
SourceType | Publisher |
StartPage | 1084 |
Title | 画像認識コンペティションの新展開 |
URI | https://www.jstage.jst.go.jp/article/itej/73/6/73_1084/_article/-char/ja |
Volume | 73 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
ispartofPNX | 映像情報メディア学会誌, 2019, Vol.73(6), pp.1084-1089 |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtR1Na9RQMNR60YP4id_04DtJ1rzv947JbkpRFIQWegtJNkH20IpsL97cBS8iXrSIVw-CoCgePPhzQrX-C2desruv0kMtCCFMJjPJfCR5M5O8SRDcEsOal6LQYVlTHoqSyrCwtoQbj1cwwOUFL_CN7v0Ham1D3N2Um0snHnlfLe2Mi1759NB5JcfxKuDArzhL9h88Oz8oIAAG_8IaPAzrI_mYpJpYQZKEpJIYTswqSQ2JY2IGDhgQa0gKeEYS7gC-AGzcAUZ1NDGfEetuV6wPclESpyRVxCqSRHjShBIrSWoRYxI_0nVkIEPkyaYcIBEDeODtzjIDjP5bkpg54gGJQUjQtO_EbnWc13fdHqduKxEscOEgmUncWRSJDUmsX-HwnqAdu4kcBxynj3aNAZniAW3ixFFoA9QV8DEst_-fgt5AwQULsbLQjqMtzhiKOOOPLu2PWrq7yB8qaNT-G68LO2DTHjakcaqwIyzkH6Oe5r0F14Em4d0lmCFZpnmmcIW02WwPTuLLRpBJnGRaU_z49d5DLxCHKNV_3wzxjKB-IzthhPQSBWYi6yeunGLmMN-mGisLi_e_kERISOOYtw2J6qIxoVRGIsMshlLYGdOVSjort1NW0BZ3fEtA2DiCJGr2AaaLCdfPBme6ZG4lbpU_FyyN8vPBaa_F54Ug-vX6x9701f7Hl_uf3jaTb80UlnfN9Hkzed9MvjfTD4h59vnn7pe9r29-7764GGyspuv9tbD7S0k4AqvwcMhYrpXIh6auVDW01Gg2LO1QKAHJUZ2DchIiClPXtOI5lg80K5mFRL4oylxW_FKwvLW9VV0OViQzqtKlLetcCB3JIrd5DcYucqVLiOOvBKZVN3vctqLJjuz3q8dnvRacwruyrTteD5bHT3aqGxCJj4ub7iL6A3r5p-s |
linkProvider | Colorado Alliance of Research Libraries |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%E7%94%BB%E5%83%8F%E8%AA%8D%E8%AD%98%E3%82%B3%E3%83%B3%E3%83%9A%E3%83%86%E3%82%A3%E3%82%B7%E3%83%A7%E3%83%B3%E3%81%AE%E6%96%B0%E5%B1%95%E9%96%8B&rft.jtitle=%E6%98%A0%E5%83%8F%E6%83%85%E5%A0%B1%E3%83%A1%E3%83%87%E3%82%A3%E3%82%A2%E5%AD%A6%E4%BC%9A%E8%AA%8C&rft.au=%E4%B8%AD%E5%B1%B1%2C+%E8%8B%B1%E6%A8%B9&rft.date=2019&rft.pub=%E4%B8%80%E8%88%AC%E7%A4%BE%E5%9B%A3%E6%B3%95%E4%BA%BA+%E6%98%A0%E5%83%8F%E6%83%85%E5%A0%B1%E3%83%A1%E3%83%87%E3%82%A3%E3%82%A2%E5%AD%A6%E4%BC%9A&rft.issn=1342-6907&rft.eissn=1881-6908&rft.volume=73&rft.issue=6&rft.spage=1084&rft.epage=1089&rft_id=info:doi/10.3169%2Fitej.73.1084&rft.externalDocID=article_itej_73_6_73_1084_article_char_ja |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1342-6907&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1342-6907&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1342-6907&client=summon |