画像認識コンペティションの新展開

Saved in:

Bibliographic Details
Published in	映像情報メディア学会誌 Vol. 73; no. 6; pp. 1084 - 1089
Main Author	中山, 英樹
Format	Journal Article
Language	Japanese
Published	一般社団法人映像情報メディア学会 2019
Online Access	Get full text
ISSN	1342-6907 1881-6908
DOI	10.3169/itej.73.1084

Cover

Author	中山, 英樹
Author_xml	– sequence: 1 fullname: 中山, 英樹 organization: 東京大学大学院情報理工学系研究科創造情報学准教授
BookMark	eNo9T81Kw0AYXKSCtfbmayTut_97Ein-QcGLnpdNstGEWiXJxaMNeBHxUot49SAIiuLBg4-zVOtb2KAIw8wwAwOzjFrDk6FDaBVwSEHotaxyeShpCFixBdQGpSAQGqvW3FNGGi-XULcsswhjCiCAqzbCX-OPaX09e7yaPd360Zuv57jz9YUf3fvRu68fmuT8-XPyMn29-Z5crqDF1A5K1_3TDjrY2tzv7QT9ve3d3kY_yAlTNEgIsVIwm6jUCZdoUJIksU6YYKBpajEXXAOoNAVHLRGYSxITzbWMothyRzto_Xc3Lyt76MxpkR3b4szYosrigTPNXSOpEQ01n_-b-MgWJrf0B_smYdI
ContentType	Journal Article
Copyright	2019 一般社団法人映像情報メディア学会
Copyright_xml	– notice: 2019 一般社団法人映像情報メディア学会
DOI	10.3169/itej.73.1084
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISSN	1881-6908
EndPage	1089
ExternalDocumentID	article_itej_73_6_73_1084_article_char_ja
GroupedDBID	ALMA_UNASSIGNED_HOLDINGS CS3 JSF KQ8 RJT
ID	FETCH-LOGICAL-j2483-d22a764ad8fe6ed91872dc9d464193fa05659118ff1e3a260572c29597bbca5e3
ISSN	1342-6907
IngestDate	Thu Nov 07 05:45:31 EST 2024
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	true
Issue	6
Language	Japanese
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-j2483-d22a764ad8fe6ed91872dc9d464193fa05659118ff1e3a260572c29597bbca5e3
OpenAccessLink	https://www.jstage.jst.go.jp/article/itej/73/6/73_1084/_article/-char/ja
PageCount	6
ParticipantIDs	jstage_primary_article_itej_73_6_73_1084_article_char_ja
PublicationCentury	2000
PublicationDate	20190000
PublicationDateYYYYMMDD	2019-01-01
PublicationDate_xml	– year: 2019 text: 20190000
PublicationDecade	2010
PublicationTitle	映像情報メディア学会誌
PublicationTitleAlternate	映情学誌
PublicationYear	2019
Publisher	一般社団法人映像情報メディア学会
Publisher_xml	– name: 一般社団法人映像情報メディア学会
References	32）S. Ren, K. He, R. Girshick and J. Sun: “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks”, in Proc. of NIPS（2015 21）C. Sun, A. Shrivastava, S. Singh and A. Gupta: “Revisiting Unreasonable Effectiveness of Data in Deep Learning Era”, in Proc. of IEEE ICCV（2017 29）K. He, X. Zhang, S. Ren and J. Sun: “Deep Residual Learning for Image Recognition”, in Proc. of IEEE CVPR（2016 15）R. Krishna, Y. Zhu, O. Groth, J. Johnson, K. Hata, J. Kravitz, S. Chen, Y. Kalanditis, L.-J. Li, D.A. Shamma, M.S. Bernstein, L. Fei-Fei: “Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations”, International Journal of Computer Vision, 123, 1, pp 32-73（2017 22）D. Mahajan, R. Girshick, V. Ramanathan, K. He, M. Paluri, Y. Li, A. Bharambe, L. van der Maaten: “Exploring the Limits of Weakly Supervised Pretraining”, in Proc. of ECCV（2018 6）J. Xiao, J. Hays, K.A. Ehinger and A. Torralba: “SUN Database : Large-Scale Scene Recognition from Abbey to Zoo”, in Proc. of IEEE CVPR, pp.3485-3492（2010 8）A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar and L. Fei-Fei: “Large-Scale Video Classification with Convolutional Neural Networks”, in Proc. of IEEE CVPR, pp.1725-1732（2014 20）A. Miech, D. Zhukov, J.-B. Alayrac, M. Tapaswi, I. Laptev and J. Sivic: “HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips”, in Proc. of IEEE ICCV （2019 23）O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A.C. Berg and L. Fei-Fei: “ImageNet Large Scale Visual Recognition Challenge”, International Journal of Computer Vision, 115, 3, pp 211-252（2015 33）T.-Y. Lin, P. Doll_r, R. Girshick, K. He, B. Hariharan and S. Belongie: “Feature Pyramid Networks for Object Detection”, in Proc. of IEEE CVPR（2017 17）S. Abu-El-Haija, N. Kothari, J. Lee, P. Natsev, G. Toderici, B. Varadarajan, S. Vijayanarasimhan: “Youtube-8M: A large-scale video classification benchmark”, arXiv preprint arXiv:1609.08675（2016 10）A. Rohrbach, M. Rohrbach, N. Tandon and B. Schiele: “A dataset for Movie Description”, in Proc. of IEEE CVPR, pp.3202-3212（2015 30）J. Hu, L. Shen, S. Albanie, G. Sun and E. Wu: “Squeeze-and-Excitation Networks”, in Proc. of IEEE CVPR（2018 14）A. Das, S. Kottur, K. Gupta, A. Singh, D. Yadav, J.M. F. Moura, D. Parikh and D. Batra: “Visual Dialog”, in Proc. of IEEE CVPR（2017 19）W. Li, L. Wang, W. Li, E. Agustsson and L.V. Gool: “WebVision Database: Visual Learning and Understanding from Web Data”, arXiv preprint, .arXiv: 1708.02862（2017 9）F.C. Heilbron, V. Escorcia, B. Ghanem, J.C. Niebles and U. Norte: “ActivityNet : A Large-Scale Video Benchmark for Human Activity Understanding”, in Proc. of IEEE CVPR（2015 1）J. Ponce, T.L. Berg, M. Everingham, D.A. Forsyth, M. Hebert, S. Lazebnik, M. Marszalek, C. Schmid, B.C. Russell, A. Torralba, C.K.I. Williams, J. Zhang and A. Zisserman: “Dataset issues in object recognition”, Toward Category-Level Object Recognition, 4170, pp.29-48, Springer-Verlag Berlin Heidelberg（2006 4）T.-Y. Lin, M. Maire, S. Belongie, L.D. Bourdev, R.B. Girshick, J. Hays, P. Perona, D. Ramanan, P. Dollár and C.L. Zitnick: “Microsoft COCO: Common Objects in Context”, in Proc. of ECCV, pp.740-755（2014 31）A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, L. Kaiser and I. Polosukhin: “Attention is All You Need”, in Proc. of NIPS（2017 5）中山英樹：“深層畳み込みニューラルネットによる画像特徴抽出と転移学習”，信学技報, 115, 146. pp.55-59（2015 26）R. Girshick, J. Donahue, T. Darrell and J. Malik: “Rich feature hierarchies for accurate object detection and semantic segmentation”, in Proc. of IEEE CVPR（2014 18）B. Thomee, D.A. Shamma, G. Friedland, B. Elizalde, K. Ni, D. Poland, D. Borth and L.-J. Li: “YFCC100M: the new data in multimedia research”, Communications of the ACM, 59, 2, Pages 64-73（2016 24）M. Everingham, S.M.A. Eslami, L. Van Gool, C.K.I. Williams, J. Winn and A. Zisserman: “The Pascal Visual Object Classes Challenge: A Retrospective”, International Journal of Computer Vision, 111, pp.98-136（2014 11）J. Xu, T. Mei, T. Yao and Y. Rui: “MSR-VTT: A large video description dataset for bridging video and language”, in Proc. of IEEE CVPR（2016 28）K. He, X. Zhang, S. Ren and J. Sun: “Delving Deep into Rectifiers : Surpassing Human-Level Performance on ImageNet Classification”, in Proc. of IEEE ICCV（2015 13）Y. Zhu, O. Groth, M. Bernstein and L. Fei-Fei: “Visual7W: Grounded Question Answering in Images”, in Proc. of IEEE CVPR, pp.4995-5004（2016 27）S. Ioffe and C. Szegedy: “Batch Normalization : Accelerating Deep Network Training by Reducing Internal Covariate Shift”, in Proc. of ICML（2015 2）J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li and L. Fei-Fei: “ImageNet: A Large-scale Hierarchical Image Database”, in Proc. of IEEE CVPR （2009 7）B. Zhou, A. Lapedriza, J. Xiao, A. Torralba and A. Oliva: “Learning Deep Features for Scene Recognition using Places Database”, in Proc. of NIPS, pp.487-495（2014 3）C. Fellbaum: “WordNet: An electronic lexical database”, MIT Press（1998 16）I. Krasin, T. Duerig, N. Alldrin, V. Ferrari, S. Abu-El-Haija, A. Kuznetsova, H. Rom, J. Uijlings, S. Popov, A. Veit, S. Belongie, V. Gomes, A. Gupta, C. Sun, G. Chechik, D. Cai, Z. Feng, D. Narayanan and K. Murphy: “Openimages: A public dataset for large-scale multilabel and multi-class image classification”（2016），Dataset available from https://github.com/openimages 25）A. Krizhevsky, I. Sutskever and G.E. Hinton: “ImageNet Classification with Deep Convolutional Neural Networks”, in Proc. of NIPS, pp.1097-1105（2012 12）A. Agrawal, J. Lu, S. Antol, M. Mitchell, C.L. Zitnick, D. Batra and D. Parikh: “VQA: Visual Question Answering”, in Proc. of IEEE ICCV （2015
References_xml	– reference: 7）B. Zhou, A. Lapedriza, J. Xiao, A. Torralba and A. Oliva: “Learning Deep Features for Scene Recognition using Places Database”, in Proc. of NIPS, pp.487-495（2014） – reference: 23）O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A.C. Berg and L. Fei-Fei: “ImageNet Large Scale Visual Recognition Challenge”, International Journal of Computer Vision, 115, 3, pp 211-252（2015） – reference: 8）A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar and L. Fei-Fei: “Large-Scale Video Classification with Convolutional Neural Networks”, in Proc. of IEEE CVPR, pp.1725-1732（2014） – reference: 17）S. Abu-El-Haija, N. Kothari, J. Lee, P. Natsev, G. Toderici, B. Varadarajan, S. Vijayanarasimhan: “Youtube-8M: A large-scale video classification benchmark”, arXiv preprint arXiv:1609.08675（2016） – reference: 11）J. Xu, T. Mei, T. Yao and Y. Rui: “MSR-VTT: A large video description dataset for bridging video and language”, in Proc. of IEEE CVPR（2016） – reference: 22）D. Mahajan, R. Girshick, V. Ramanathan, K. He, M. Paluri, Y. Li, A. Bharambe, L. van der Maaten: “Exploring the Limits of Weakly Supervised Pretraining”, in Proc. of ECCV（2018） – reference: 3）C. Fellbaum: “WordNet: An electronic lexical database”, MIT Press（1998） – reference: 15）R. Krishna, Y. Zhu, O. Groth, J. Johnson, K. Hata, J. Kravitz, S. Chen, Y. Kalanditis, L.-J. Li, D.A. Shamma, M.S. Bernstein, L. Fei-Fei: “Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations”, International Journal of Computer Vision, 123, 1, pp 32-73（2017） – reference: 26）R. Girshick, J. Donahue, T. Darrell and J. Malik: “Rich feature hierarchies for accurate object detection and semantic segmentation”, in Proc. of IEEE CVPR（2014） – reference: 4）T.-Y. Lin, M. Maire, S. Belongie, L.D. Bourdev, R.B. Girshick, J. Hays, P. Perona, D. Ramanan, P. Dollár and C.L. Zitnick: “Microsoft COCO: Common Objects in Context”, in Proc. of ECCV, pp.740-755（2014） – reference: 14）A. Das, S. Kottur, K. Gupta, A. Singh, D. Yadav, J.M. F. Moura, D. Parikh and D. Batra: “Visual Dialog”, in Proc. of IEEE CVPR（2017） – reference: 19）W. Li, L. Wang, W. Li, E. Agustsson and L.V. Gool: “WebVision Database: Visual Learning and Understanding from Web Data”, arXiv preprint, .arXiv: 1708.02862（2017） – reference: 10）A. Rohrbach, M. Rohrbach, N. Tandon and B. Schiele: “A dataset for Movie Description”, in Proc. of IEEE CVPR, pp.3202-3212（2015） – reference: 6）J. Xiao, J. Hays, K.A. Ehinger and A. Torralba: “SUN Database : Large-Scale Scene Recognition from Abbey to Zoo”, in Proc. of IEEE CVPR, pp.3485-3492（2010） – reference: 13）Y. Zhu, O. Groth, M. Bernstein and L. Fei-Fei: “Visual7W: Grounded Question Answering in Images”, in Proc. of IEEE CVPR, pp.4995-5004（2016） – reference: 18）B. Thomee, D.A. Shamma, G. Friedland, B. Elizalde, K. Ni, D. Poland, D. Borth and L.-J. Li: “YFCC100M: the new data in multimedia research”, Communications of the ACM, 59, 2, Pages 64-73（2016） – reference: 12）A. Agrawal, J. Lu, S. Antol, M. Mitchell, C.L. Zitnick, D. Batra and D. Parikh: “VQA: Visual Question Answering”, in Proc. of IEEE ICCV （2015） – reference: 9）F.C. Heilbron, V. Escorcia, B. Ghanem, J.C. Niebles and U. Norte: “ActivityNet : A Large-Scale Video Benchmark for Human Activity Understanding”, in Proc. of IEEE CVPR（2015） – reference: 20）A. Miech, D. Zhukov, J.-B. Alayrac, M. Tapaswi, I. Laptev and J. Sivic: “HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips”, in Proc. of IEEE ICCV （2019） – reference: 33）T.-Y. Lin, P. Doll_r, R. Girshick, K. He, B. Hariharan and S. Belongie: “Feature Pyramid Networks for Object Detection”, in Proc. of IEEE CVPR（2017） – reference: 2）J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li and L. Fei-Fei: “ImageNet: A Large-scale Hierarchical Image Database”, in Proc. of IEEE CVPR （2009） – reference: 30）J. Hu, L. Shen, S. Albanie, G. Sun and E. Wu: “Squeeze-and-Excitation Networks”, in Proc. of IEEE CVPR（2018） – reference: 24）M. Everingham, S.M.A. Eslami, L. Van Gool, C.K.I. Williams, J. Winn and A. Zisserman: “The Pascal Visual Object Classes Challenge: A Retrospective”, International Journal of Computer Vision, 111, pp.98-136（2014） – reference: 1）J. Ponce, T.L. Berg, M. Everingham, D.A. Forsyth, M. Hebert, S. Lazebnik, M. Marszalek, C. Schmid, B.C. Russell, A. Torralba, C.K.I. Williams, J. Zhang and A. Zisserman: “Dataset issues in object recognition”, Toward Category-Level Object Recognition, 4170, pp.29-48, Springer-Verlag Berlin Heidelberg（2006） – reference: 27）S. Ioffe and C. Szegedy: “Batch Normalization : Accelerating Deep Network Training by Reducing Internal Covariate Shift”, in Proc. of ICML（2015） – reference: 21）C. Sun, A. Shrivastava, S. Singh and A. Gupta: “Revisiting Unreasonable Effectiveness of Data in Deep Learning Era”, in Proc. of IEEE ICCV（2017） – reference: 5）中山英樹：“深層畳み込みニューラルネットによる画像特徴抽出と転移学習”，信学技報, 115, 146. pp.55-59（2015） – reference: 32）S. Ren, K. He, R. Girshick and J. Sun: “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks”, in Proc. of NIPS（2015） – reference: 16）I. Krasin, T. Duerig, N. Alldrin, V. Ferrari, S. Abu-El-Haija, A. Kuznetsova, H. Rom, J. Uijlings, S. Popov, A. Veit, S. Belongie, V. Gomes, A. Gupta, C. Sun, G. Chechik, D. Cai, Z. Feng, D. Narayanan and K. Murphy: “Openimages: A public dataset for large-scale multilabel and multi-class image classification”（2016），Dataset available from https://github.com/openimages – reference: 29）K. He, X. Zhang, S. Ren and J. Sun: “Deep Residual Learning for Image Recognition”, in Proc. of IEEE CVPR（2016） – reference: 25）A. Krizhevsky, I. Sutskever and G.E. Hinton: “ImageNet Classification with Deep Convolutional Neural Networks”, in Proc. of NIPS, pp.1097-1105（2012） – reference: 28）K. He, X. Zhang, S. Ren and J. Sun: “Delving Deep into Rectifiers : Surpassing Human-Level Performance on ImageNet Classification”, in Proc. of IEEE ICCV（2015） – reference: 31）A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, L. Kaiser and I. Polosukhin: “Attention is All You Need”, in Proc. of NIPS（2017）
SSID	ssib003116158 ssib017172179 ssj0061382 ssib000937025 ssib002809428 ssib002484574 ssib001234188 ssib023167534 ssib056857217 ssib023157722
Score	2.1740787
SourceID	jstage
SourceType	Publisher
StartPage	1084
Title	画像認識コンペティションの新展開
URI	https://www.jstage.jst.go.jp/article/itej/73/6/73_1084/_article/-char/ja
Volume	73
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
ispartofPNX	映像情報メディア学会誌, 2019, Vol.73(6), pp.1084-1089
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtR1Na9RQMNR60YP4id_04DtJ1rzv947JbkpRFIQWegtJNkH20IpsL97cBS8iXrSIVw-CoCgePPhzQrX-C2desruv0kMtCCFMJjPJfCR5M5O8SRDcEsOal6LQYVlTHoqSyrCwtoQbj1cwwOUFL_CN7v0Ham1D3N2Um0snHnlfLe2Mi1759NB5JcfxKuDArzhL9h88Oz8oIAAG_8IaPAzrI_mYpJpYQZKEpJIYTswqSQ2JY2IGDhgQa0gKeEYS7gC-AGzcAUZ1NDGfEetuV6wPclESpyRVxCqSRHjShBIrSWoRYxI_0nVkIEPkyaYcIBEDeODtzjIDjP5bkpg54gGJQUjQtO_EbnWc13fdHqduKxEscOEgmUncWRSJDUmsX-HwnqAdu4kcBxynj3aNAZniAW3ixFFoA9QV8DEst_-fgt5AwQULsbLQjqMtzhiKOOOPLu2PWrq7yB8qaNT-G68LO2DTHjakcaqwIyzkH6Oe5r0F14Em4d0lmCFZpnmmcIW02WwPTuLLRpBJnGRaU_z49d5DLxCHKNV_3wzxjKB-IzthhPQSBWYi6yeunGLmMN-mGisLi_e_kERISOOYtw2J6qIxoVRGIsMshlLYGdOVSjort1NW0BZ3fEtA2DiCJGr2AaaLCdfPBme6ZG4lbpU_FyyN8vPBaa_F54Ug-vX6x9701f7Hl_uf3jaTb80UlnfN9Hkzed9MvjfTD4h59vnn7pe9r29-7764GGyspuv9tbD7S0k4AqvwcMhYrpXIh6auVDW01Gg2LO1QKAHJUZ2DchIiClPXtOI5lg80K5mFRL4oylxW_FKwvLW9VV0OViQzqtKlLetcCB3JIrd5DcYucqVLiOOvBKZVN3vctqLJjuz3q8dnvRacwruyrTteD5bHT3aqGxCJj4ub7iL6A3r5p-s
linkProvider	Colorado Alliance of Research Libraries
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%E7%94%BB%E5%83%8F%E8%AA%8D%E8%AD%98%E3%82%B3%E3%83%B3%E3%83%9A%E3%83%86%E3%82%A3%E3%82%B7%E3%83%A7%E3%83%B3%E3%81%AE%E6%96%B0%E5%B1%95%E9%96%8B&rft.jtitle=%E6%98%A0%E5%83%8F%E6%83%85%E5%A0%B1%E3%83%A1%E3%83%87%E3%82%A3%E3%82%A2%E5%AD%A6%E4%BC%9A%E8%AA%8C&rft.au=%E4%B8%AD%E5%B1%B1%2C+%E8%8B%B1%E6%A8%B9&rft.date=2019&rft.pub=%E4%B8%80%E8%88%AC%E7%A4%BE%E5%9B%A3%E6%B3%95%E4%BA%BA+%E6%98%A0%E5%83%8F%E6%83%85%E5%A0%B1%E3%83%A1%E3%83%87%E3%82%A3%E3%82%A2%E5%AD%A6%E4%BC%9A&rft.issn=1342-6907&rft.eissn=1881-6908&rft.volume=73&rft.issue=6&rft.spage=1084&rft.epage=1089&rft_id=info:doi/10.3169%2Fitej.73.1084&rft.externalDocID=article_itej_73_6_73_1084_article_char_ja
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1342-6907&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1342-6907&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1342-6907&client=summon