画像認識コンペティションの新展開

Saved in:
Bibliographic Details
Published in映像情報メディア学会誌 Vol. 73; no. 6; pp. 1084 - 1089
Main Author 中山, 英樹
Format Journal Article
LanguageJapanese
Published 一般社団法人 映像情報メディア学会 2019
Online AccessGet full text
ISSN1342-6907
1881-6908
DOI10.3169/itej.73.1084

Cover

Author 中山, 英樹
Author_xml – sequence: 1
  fullname: 中山, 英樹
  organization: 東京大学大学院情報理工学系研究科創造情報学准教授
BookMark eNo9T81Kw0AYXKSCtfbmayTut_97Ein-QcGLnpdNstGEWiXJxaMNeBHxUot49SAIiuLBg4-zVOtb2KAIw8wwAwOzjFrDk6FDaBVwSEHotaxyeShpCFixBdQGpSAQGqvW3FNGGi-XULcsswhjCiCAqzbCX-OPaX09e7yaPd360Zuv57jz9YUf3fvRu68fmuT8-XPyMn29-Z5crqDF1A5K1_3TDjrY2tzv7QT9ve3d3kY_yAlTNEgIsVIwm6jUCZdoUJIksU6YYKBpajEXXAOoNAVHLRGYSxITzbWMothyRzto_Xc3Lyt76MxpkR3b4szYosrigTPNXSOpEQ01n_-b-MgWJrf0B_smYdI
ContentType Journal Article
Copyright 2019 一般社団法人 映像情報メディア学会
Copyright_xml – notice: 2019 一般社団法人 映像情報メディア学会
DOI 10.3169/itej.73.1084
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 1881-6908
EndPage 1089
ExternalDocumentID article_itej_73_6_73_1084_article_char_ja
GroupedDBID ALMA_UNASSIGNED_HOLDINGS
CS3
JSF
KQ8
RJT
ID FETCH-LOGICAL-j2483-d22a764ad8fe6ed91872dc9d464193fa05659118ff1e3a260572c29597bbca5e3
ISSN 1342-6907
IngestDate Thu Nov 07 05:45:31 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly true
Issue 6
Language Japanese
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-j2483-d22a764ad8fe6ed91872dc9d464193fa05659118ff1e3a260572c29597bbca5e3
OpenAccessLink https://www.jstage.jst.go.jp/article/itej/73/6/73_1084/_article/-char/ja
PageCount 6
ParticipantIDs jstage_primary_article_itej_73_6_73_1084_article_char_ja
PublicationCentury 2000
PublicationDate 20190000
PublicationDateYYYYMMDD 2019-01-01
PublicationDate_xml – year: 2019
  text: 20190000
PublicationDecade 2010
PublicationTitle 映像情報メディア学会誌
PublicationTitleAlternate 映情学誌
PublicationYear 2019
Publisher 一般社団法人 映像情報メディア学会
Publisher_xml – name: 一般社団法人 映像情報メディア学会
References 32)S. Ren, K. He, R. Girshick and J. Sun: “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks”, in Proc. of NIPS(2015
21)C. Sun, A. Shrivastava, S. Singh and A. Gupta: “Revisiting Unreasonable Effectiveness of Data in Deep Learning Era”, in Proc. of IEEE ICCV(2017
29)K. He, X. Zhang, S. Ren and J. Sun: “Deep Residual Learning for Image Recognition”, in Proc. of IEEE CVPR(2016
15)R. Krishna, Y. Zhu, O. Groth, J. Johnson, K. Hata, J. Kravitz, S. Chen, Y. Kalanditis, L.-J. Li, D.A. Shamma, M.S. Bernstein, L. Fei-Fei: “Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations”, International Journal of Computer Vision, 123, 1, pp 32-73(2017
22)D. Mahajan, R. Girshick, V. Ramanathan, K. He, M. Paluri, Y. Li, A. Bharambe, L. van der Maaten: “Exploring the Limits of Weakly Supervised Pretraining”, in Proc. of ECCV(2018
6)J. Xiao, J. Hays, K.A. Ehinger and A. Torralba: “SUN Database : Large-Scale Scene Recognition from Abbey to Zoo”, in Proc. of IEEE CVPR, pp.3485-3492(2010
8)A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar and L. Fei-Fei: “Large-Scale Video Classification with Convolutional Neural Networks”, in Proc. of IEEE CVPR, pp.1725-1732(2014
20)A. Miech, D. Zhukov, J.-B. Alayrac, M. Tapaswi, I. Laptev and J. Sivic: “HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips”, in Proc. of IEEE ICCV (2019
23)O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A.C. Berg and L. Fei-Fei: “ImageNet Large Scale Visual Recognition Challenge”, International Journal of Computer Vision, 115, 3, pp 211-252(2015
33)T.-Y. Lin, P. Doll_r, R. Girshick, K. He, B. Hariharan and S. Belongie: “Feature Pyramid Networks for Object Detection”, in Proc. of IEEE CVPR(2017
17)S. Abu-El-Haija, N. Kothari, J. Lee, P. Natsev, G. Toderici, B. Varadarajan, S. Vijayanarasimhan: “Youtube-8M: A large-scale video classification benchmark”, arXiv preprint arXiv:1609.08675(2016
10)A. Rohrbach, M. Rohrbach, N. Tandon and B. Schiele: “A dataset for Movie Description”, in Proc. of IEEE CVPR, pp.3202-3212(2015
30)J. Hu, L. Shen, S. Albanie, G. Sun and E. Wu: “Squeeze-and-Excitation Networks”, in Proc. of IEEE CVPR(2018
14)A. Das, S. Kottur, K. Gupta, A. Singh, D. Yadav, J.M. F. Moura, D. Parikh and D. Batra: “Visual Dialog”, in Proc. of IEEE CVPR(2017
19)W. Li, L. Wang, W. Li, E. Agustsson and L.V. Gool: “WebVision Database: Visual Learning and Understanding from Web Data”, arXiv preprint, .arXiv: 1708.02862(2017
9)F.C. Heilbron, V. Escorcia, B. Ghanem, J.C. Niebles and U. Norte: “ActivityNet : A Large-Scale Video Benchmark for Human Activity Understanding”, in Proc. of IEEE CVPR(2015
1)J. Ponce, T.L. Berg, M. Everingham, D.A. Forsyth, M. Hebert, S. Lazebnik, M. Marszalek, C. Schmid, B.C. Russell, A. Torralba, C.K.I. Williams, J. Zhang and A. Zisserman: “Dataset issues in object recognition”, Toward Category-Level Object Recognition, 4170, pp.29-48, Springer-Verlag Berlin Heidelberg(2006
4)T.-Y. Lin, M. Maire, S. Belongie, L.D. Bourdev, R.B. Girshick, J. Hays, P. Perona, D. Ramanan, P. Dollár and C.L. Zitnick: “Microsoft COCO: Common Objects in Context”, in Proc. of ECCV, pp.740-755(2014
31)A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, L. Kaiser and I. Polosukhin: “Attention is All You Need”, in Proc. of NIPS(2017
5)中山英樹:“深層畳み込みニューラルネットによる画像特徴抽出と転移学習”,信学技報, 115, 146. pp.55-59(2015
26)R. Girshick, J. Donahue, T. Darrell and J. Malik: “Rich feature hierarchies for accurate object detection and semantic segmentation”, in Proc. of IEEE CVPR(2014
18)B. Thomee, D.A. Shamma, G. Friedland, B. Elizalde, K. Ni, D. Poland, D. Borth and L.-J. Li: “YFCC100M: the new data in multimedia research”, Communications of the ACM, 59, 2, Pages 64-73(2016
24)M. Everingham, S.M.A. Eslami, L. Van Gool, C.K.I. Williams, J. Winn and A. Zisserman: “The Pascal Visual Object Classes Challenge: A Retrospective”, International Journal of Computer Vision, 111, pp.98-136(2014
11)J. Xu, T. Mei, T. Yao and Y. Rui: “MSR-VTT: A large video description dataset for bridging video and language”, in Proc. of IEEE CVPR(2016
28)K. He, X. Zhang, S. Ren and J. Sun: “Delving Deep into Rectifiers : Surpassing Human-Level Performance on ImageNet Classification”, in Proc. of IEEE ICCV(2015
13)Y. Zhu, O. Groth, M. Bernstein and L. Fei-Fei: “Visual7W: Grounded Question Answering in Images”, in Proc. of IEEE CVPR, pp.4995-5004(2016
27)S. Ioffe and C. Szegedy: “Batch Normalization : Accelerating Deep Network Training by Reducing Internal Covariate Shift”, in Proc. of ICML(2015
2)J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li and L. Fei-Fei: “ImageNet: A Large-scale Hierarchical Image Database”, in Proc. of IEEE CVPR (2009
7)B. Zhou, A. Lapedriza, J. Xiao, A. Torralba and A. Oliva: “Learning Deep Features for Scene Recognition using Places Database”, in Proc. of NIPS, pp.487-495(2014
3)C. Fellbaum: “WordNet: An electronic lexical database”, MIT Press(1998
16)I. Krasin, T. Duerig, N. Alldrin, V. Ferrari, S. Abu-El-Haija, A. Kuznetsova, H. Rom, J. Uijlings, S. Popov, A. Veit, S. Belongie, V. Gomes, A. Gupta, C. Sun, G. Chechik, D. Cai, Z. Feng, D. Narayanan and K. Murphy: “Openimages: A public dataset for large-scale multilabel and multi-class image classification”(2016),Dataset available from https://github.com/openimages
25)A. Krizhevsky, I. Sutskever and G.E. Hinton: “ImageNet Classification with Deep Convolutional Neural Networks”, in Proc. of NIPS, pp.1097-1105(2012
12)A. Agrawal, J. Lu, S. Antol, M. Mitchell, C.L. Zitnick, D. Batra and D. Parikh: “VQA: Visual Question Answering”, in Proc. of IEEE ICCV (2015
References_xml – reference: 7)B. Zhou, A. Lapedriza, J. Xiao, A. Torralba and A. Oliva: “Learning Deep Features for Scene Recognition using Places Database”, in Proc. of NIPS, pp.487-495(2014)
– reference: 23)O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A.C. Berg and L. Fei-Fei: “ImageNet Large Scale Visual Recognition Challenge”, International Journal of Computer Vision, 115, 3, pp 211-252(2015)
– reference: 8)A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar and L. Fei-Fei: “Large-Scale Video Classification with Convolutional Neural Networks”, in Proc. of IEEE CVPR, pp.1725-1732(2014)
– reference: 17)S. Abu-El-Haija, N. Kothari, J. Lee, P. Natsev, G. Toderici, B. Varadarajan, S. Vijayanarasimhan: “Youtube-8M: A large-scale video classification benchmark”, arXiv preprint arXiv:1609.08675(2016)
– reference: 11)J. Xu, T. Mei, T. Yao and Y. Rui: “MSR-VTT: A large video description dataset for bridging video and language”, in Proc. of IEEE CVPR(2016)
– reference: 22)D. Mahajan, R. Girshick, V. Ramanathan, K. He, M. Paluri, Y. Li, A. Bharambe, L. van der Maaten: “Exploring the Limits of Weakly Supervised Pretraining”, in Proc. of ECCV(2018)
– reference: 3)C. Fellbaum: “WordNet: An electronic lexical database”, MIT Press(1998)
– reference: 15)R. Krishna, Y. Zhu, O. Groth, J. Johnson, K. Hata, J. Kravitz, S. Chen, Y. Kalanditis, L.-J. Li, D.A. Shamma, M.S. Bernstein, L. Fei-Fei: “Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations”, International Journal of Computer Vision, 123, 1, pp 32-73(2017)
– reference: 26)R. Girshick, J. Donahue, T. Darrell and J. Malik: “Rich feature hierarchies for accurate object detection and semantic segmentation”, in Proc. of IEEE CVPR(2014)
– reference: 4)T.-Y. Lin, M. Maire, S. Belongie, L.D. Bourdev, R.B. Girshick, J. Hays, P. Perona, D. Ramanan, P. Dollár and C.L. Zitnick: “Microsoft COCO: Common Objects in Context”, in Proc. of ECCV, pp.740-755(2014)
– reference: 14)A. Das, S. Kottur, K. Gupta, A. Singh, D. Yadav, J.M. F. Moura, D. Parikh and D. Batra: “Visual Dialog”, in Proc. of IEEE CVPR(2017)
– reference: 19)W. Li, L. Wang, W. Li, E. Agustsson and L.V. Gool: “WebVision Database: Visual Learning and Understanding from Web Data”, arXiv preprint, .arXiv: 1708.02862(2017)
– reference: 10)A. Rohrbach, M. Rohrbach, N. Tandon and B. Schiele: “A dataset for Movie Description”, in Proc. of IEEE CVPR, pp.3202-3212(2015)
– reference: 6)J. Xiao, J. Hays, K.A. Ehinger and A. Torralba: “SUN Database : Large-Scale Scene Recognition from Abbey to Zoo”, in Proc. of IEEE CVPR, pp.3485-3492(2010)
– reference: 13)Y. Zhu, O. Groth, M. Bernstein and L. Fei-Fei: “Visual7W: Grounded Question Answering in Images”, in Proc. of IEEE CVPR, pp.4995-5004(2016)
– reference: 18)B. Thomee, D.A. Shamma, G. Friedland, B. Elizalde, K. Ni, D. Poland, D. Borth and L.-J. Li: “YFCC100M: the new data in multimedia research”, Communications of the ACM, 59, 2, Pages 64-73(2016)
– reference: 12)A. Agrawal, J. Lu, S. Antol, M. Mitchell, C.L. Zitnick, D. Batra and D. Parikh: “VQA: Visual Question Answering”, in Proc. of IEEE ICCV (2015)
– reference: 9)F.C. Heilbron, V. Escorcia, B. Ghanem, J.C. Niebles and U. Norte: “ActivityNet : A Large-Scale Video Benchmark for Human Activity Understanding”, in Proc. of IEEE CVPR(2015)
– reference: 20)A. Miech, D. Zhukov, J.-B. Alayrac, M. Tapaswi, I. Laptev and J. Sivic: “HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips”, in Proc. of IEEE ICCV (2019)
– reference: 33)T.-Y. Lin, P. Doll_r, R. Girshick, K. He, B. Hariharan and S. Belongie: “Feature Pyramid Networks for Object Detection”, in Proc. of IEEE CVPR(2017)
– reference: 2)J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li and L. Fei-Fei: “ImageNet: A Large-scale Hierarchical Image Database”, in Proc. of IEEE CVPR (2009)
– reference: 30)J. Hu, L. Shen, S. Albanie, G. Sun and E. Wu: “Squeeze-and-Excitation Networks”, in Proc. of IEEE CVPR(2018)
– reference: 24)M. Everingham, S.M.A. Eslami, L. Van Gool, C.K.I. Williams, J. Winn and A. Zisserman: “The Pascal Visual Object Classes Challenge: A Retrospective”, International Journal of Computer Vision, 111, pp.98-136(2014)
– reference: 1)J. Ponce, T.L. Berg, M. Everingham, D.A. Forsyth, M. Hebert, S. Lazebnik, M. Marszalek, C. Schmid, B.C. Russell, A. Torralba, C.K.I. Williams, J. Zhang and A. Zisserman: “Dataset issues in object recognition”, Toward Category-Level Object Recognition, 4170, pp.29-48, Springer-Verlag Berlin Heidelberg(2006)
– reference: 27)S. Ioffe and C. Szegedy: “Batch Normalization : Accelerating Deep Network Training by Reducing Internal Covariate Shift”, in Proc. of ICML(2015)
– reference: 21)C. Sun, A. Shrivastava, S. Singh and A. Gupta: “Revisiting Unreasonable Effectiveness of Data in Deep Learning Era”, in Proc. of IEEE ICCV(2017)
– reference: 5)中山英樹:“深層畳み込みニューラルネットによる画像特徴抽出と転移学習”,信学技報, 115, 146. pp.55-59(2015)
– reference: 32)S. Ren, K. He, R. Girshick and J. Sun: “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks”, in Proc. of NIPS(2015)
– reference: 16)I. Krasin, T. Duerig, N. Alldrin, V. Ferrari, S. Abu-El-Haija, A. Kuznetsova, H. Rom, J. Uijlings, S. Popov, A. Veit, S. Belongie, V. Gomes, A. Gupta, C. Sun, G. Chechik, D. Cai, Z. Feng, D. Narayanan and K. Murphy: “Openimages: A public dataset for large-scale multilabel and multi-class image classification”(2016),Dataset available from https://github.com/openimages
– reference: 29)K. He, X. Zhang, S. Ren and J. Sun: “Deep Residual Learning for Image Recognition”, in Proc. of IEEE CVPR(2016)
– reference: 25)A. Krizhevsky, I. Sutskever and G.E. Hinton: “ImageNet Classification with Deep Convolutional Neural Networks”, in Proc. of NIPS, pp.1097-1105(2012)
– reference: 28)K. He, X. Zhang, S. Ren and J. Sun: “Delving Deep into Rectifiers : Surpassing Human-Level Performance on ImageNet Classification”, in Proc. of IEEE ICCV(2015)
– reference: 31)A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, L. Kaiser and I. Polosukhin: “Attention is All You Need”, in Proc. of NIPS(2017)
SSID ssib003116158
ssib017172179
ssj0061382
ssib000937025
ssib002809428
ssib002484574
ssib001234188
ssib023167534
ssib056857217
ssib023157722
Score 2.1740787
SourceID jstage
SourceType Publisher
StartPage 1084
Title 画像認識コンペティションの新展開
URI https://www.jstage.jst.go.jp/article/itej/73/6/73_1084/_article/-char/ja
Volume 73
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
ispartofPNX 映像情報メディア学会誌, 2019, Vol.73(6), pp.1084-1089
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtR1Na9RQMNR60YP4id_04DtJ1rzv947JbkpRFIQWegtJNkH20IpsL97cBS8iXrSIVw-CoCgePPhzQrX-C2desruv0kMtCCFMJjPJfCR5M5O8SRDcEsOal6LQYVlTHoqSyrCwtoQbj1cwwOUFL_CN7v0Ham1D3N2Um0snHnlfLe2Mi1759NB5JcfxKuDArzhL9h88Oz8oIAAG_8IaPAzrI_mYpJpYQZKEpJIYTswqSQ2JY2IGDhgQa0gKeEYS7gC-AGzcAUZ1NDGfEetuV6wPclESpyRVxCqSRHjShBIrSWoRYxI_0nVkIEPkyaYcIBEDeODtzjIDjP5bkpg54gGJQUjQtO_EbnWc13fdHqduKxEscOEgmUncWRSJDUmsX-HwnqAdu4kcBxynj3aNAZniAW3ixFFoA9QV8DEst_-fgt5AwQULsbLQjqMtzhiKOOOPLu2PWrq7yB8qaNT-G68LO2DTHjakcaqwIyzkH6Oe5r0F14Em4d0lmCFZpnmmcIW02WwPTuLLRpBJnGRaU_z49d5DLxCHKNV_3wzxjKB-IzthhPQSBWYi6yeunGLmMN-mGisLi_e_kERISOOYtw2J6qIxoVRGIsMshlLYGdOVSjort1NW0BZ3fEtA2DiCJGr2AaaLCdfPBme6ZG4lbpU_FyyN8vPBaa_F54Ug-vX6x9701f7Hl_uf3jaTb80UlnfN9Hkzed9MvjfTD4h59vnn7pe9r29-7764GGyspuv9tbD7S0k4AqvwcMhYrpXIh6auVDW01Gg2LO1QKAHJUZ2DchIiClPXtOI5lg80K5mFRL4oylxW_FKwvLW9VV0OViQzqtKlLetcCB3JIrd5DcYucqVLiOOvBKZVN3vctqLJjuz3q8dnvRacwruyrTteD5bHT3aqGxCJj4ub7iL6A3r5p-s
linkProvider Colorado Alliance of Research Libraries
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%E7%94%BB%E5%83%8F%E8%AA%8D%E8%AD%98%E3%82%B3%E3%83%B3%E3%83%9A%E3%83%86%E3%82%A3%E3%82%B7%E3%83%A7%E3%83%B3%E3%81%AE%E6%96%B0%E5%B1%95%E9%96%8B&rft.jtitle=%E6%98%A0%E5%83%8F%E6%83%85%E5%A0%B1%E3%83%A1%E3%83%87%E3%82%A3%E3%82%A2%E5%AD%A6%E4%BC%9A%E8%AA%8C&rft.au=%E4%B8%AD%E5%B1%B1%2C+%E8%8B%B1%E6%A8%B9&rft.date=2019&rft.pub=%E4%B8%80%E8%88%AC%E7%A4%BE%E5%9B%A3%E6%B3%95%E4%BA%BA+%E6%98%A0%E5%83%8F%E6%83%85%E5%A0%B1%E3%83%A1%E3%83%87%E3%82%A3%E3%82%A2%E5%AD%A6%E4%BC%9A&rft.issn=1342-6907&rft.eissn=1881-6908&rft.volume=73&rft.issue=6&rft.spage=1084&rft.epage=1089&rft_id=info:doi/10.3169%2Fitej.73.1084&rft.externalDocID=article_itej_73_6_73_1084_article_char_ja
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1342-6907&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1342-6907&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1342-6907&client=summon