Bugs.jar a large-scale, diverse dataset of real-world Java bugs

We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and patches, drawn from 8 large, popular open-source Java projects, spanning 8 diverse and prominent application categories. It is an order of m...

Full description

Saved in:
Bibliographic Details
Published in2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR) pp. 10 - 13
Main Authors Saha, Ripon K., Lyu, Yingjun, Lam, Wing, Yoshida, Hiroaki, Prasad, Mukul R.
Format Conference Proceeding
LanguageEnglish
Published New York, NY, USA ACM 28.05.2018
SeriesACM Conferences
Subjects
Online AccessGet full text
ISBN9781450357166
1450357164
ISSN2574-3864
DOI10.1145/3196398.3196473

Cover

Loading…
Abstract We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and patches, drawn from 8 large, popular open-source Java projects, spanning 8 diverse and prominent application categories. It is an order of magnitude larger than Defects4J, the only other dataset in its class. We discuss the methodology used for constructing Bugs.jar, the representation of the dataset, several use-cases, and an illustration of three of the use-cases through the application of 3 specific tools on Bugs.jar, namely our own tool, Elixir, and two third-party tools, Ekstazi and JaCoCo.
AbstractList We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and patches, drawn from 8 large, popular open-source Java projects, spanning 8 diverse and prominent application categories. It is an order of magnitude larger than Defects4J, the only other dataset in its class. We discuss the methodology used for constructing Bugs.jar, the representation of the dataset, several use-cases, and an illustration of three of the use-cases through the application of 3 specific tools on Bugs.jar, namely our own tool, Elixir, and two third-party tools, Ekstazi and JaCoCo.
We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and patches, drawn from 8 large, popular opensource Java projects, spanning 8 diverse and prominent application categories. It is an order of magnitude larger than Defects4J, the only other dataset in its class. We discuss the methodology used for constructing Bugs.jar, the representation of the dataset, several use-cases, and an illustration of three of the use-cases through the application of 3 specific tools on Bugs.jar, namely our own tool, Elixir, and two third-party tools, Ekstazi and JaCoCo.
Author Prasad, Mukul R.
Lyu, Yingjun
Saha, Ripon K.
Lam, Wing
Yoshida, Hiroaki
Author_xml – sequence: 1
  givenname: Ripon K.
  surname: Saha
  fullname: Saha, Ripon K.
  organization: Fujitsu Laboratories of America, Inc
– sequence: 2
  givenname: Yingjun
  surname: Lyu
  fullname: Lyu, Yingjun
  organization: University of Southern California
– sequence: 3
  givenname: Wing
  surname: Lam
  fullname: Lam, Wing
  organization: University of Illinois at Urbana-Champaign
– sequence: 4
  givenname: Hiroaki
  surname: Yoshida
  fullname: Yoshida, Hiroaki
  organization: Fujitsu Laboratories of America, Inc
– sequence: 5
  givenname: Mukul R.
  surname: Prasad
  fullname: Prasad, Mukul R.
  organization: Fujitsu Laboratories of America, Inc
BookMark eNqNjztPwzAURs1LaimZGfgDLAm-vravPUJVHlIlFpgt39hBKbRBCQz8e1I1ExPTGY6-Tzrn4nTX7bIQlyArAG1uELxF76o9NeGRKDy5UUg0BNYei7kypEt0Vp_8cTNRDMNGSqms0wA0F7O777eh2sT-Qpw18WPIxcSFeL1fvSwfy_Xzw9Pydl1GpemrVISSkFFxkrEmZbRynKDBhjCBYq0ocTTZeO-TdTXnxJ6Tyc4nzZFxIa4Ov23OOXz27Tb2P8EZb8DSaKuDjfU2cNe9DwFk2GeHKTtM2YH7Njfj4PqfA_wFH5lSUA
CODEN IEEPAD
ContentType Conference Proceeding
Copyright 2018 ACM
Copyright_xml – notice: 2018 ACM
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/3196398.3196473
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781450357166
1450357164
EISSN 2574-3864
EndPage 13
ExternalDocumentID 8595167
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
AAJGR
ABLEC
ACM
ADPZR
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
GUFHI
IEGSK
LHSKQ
OCL
RIB
RIC
RIE
RIL
AAWTH
ADZIZ
CHZPO
ID FETCH-LOGICAL-a247t-273073b32bd0ac725428bd1f3f73d12b427dba5e5999d68cbedb9bd5e89d4bab3
IEDL.DBID RIE
ISBN 9781450357166
1450357164
IngestDate Wed Aug 27 02:59:18 EDT 2025
Fri Sep 13 11:04:49 EDT 2024
IsPeerReviewed false
IsScholarly true
Keywords reproducible bugs
Java programs
large-scale dataset
Language English
License Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org
LinkModel DirectLink
MeetingName ICSE '18: 40th International Conference on Software Engineering
MergedId FETCHMERGED-LOGICAL-a247t-273073b32bd0ac725428bd1f3f73d12b427dba5e5999d68cbedb9bd5e89d4bab3
PageCount 4
ParticipantIDs acm_books_10_1145_3196398_3196473
ieee_primary_8595167
acm_books_10_1145_3196398_3196473_brief
PublicationCentury 2000
PublicationDate 20180528
2018-May
PublicationDateYYYYMMDD 2018-05-28
2018-05-01
PublicationDate_xml – month: 05
  year: 2018
  text: 20180528
  day: 28
PublicationDecade 2010
PublicationPlace New York, NY, USA
PublicationPlace_xml – name: New York, NY, USA
PublicationSeriesTitle ACM Conferences
PublicationTitle 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR)
PublicationTitleAbbrev MSR
PublicationYear 2018
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0002684117
ssj0003211714
Score 2.4990492
Snippet We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and...
SourceID ieee
acm
SourceType Publisher
StartPage 10
SubjectTerms Computer bugs
Debugging
Java
Java Programs
Large Scale Dataset
Libraries
Reproducible Bugs
Software
Testing
Subtitle a large-scale, diverse dataset of real-world Java bugs
Title Bugs.jar
URI https://ieeexplore.ieee.org/document/8595167
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB7anjzVR8X6YgXBS5PmsXl58lFLKVZELfQWdvYhKrbSJh789e4maUUR9JaE3bDMzDIzu998A3Dsq1hQ5NRisblmjFFZCeOOxVxEXwVKR73maGB0Ew7GdDgJJjXorGphpJQF-Eza5rG4yxcznpujsq7h4nLDqA51bWZlrdbqPMWwlixrJs27rzObyKUVm49Lg25hbElsFxxUpk96nfHXb01VCp_Sb8JouZoSSvJi5xna_OMHUeN_l7sOra_qPXK78ksbUJPTTWgu2zeQajdvwdlF_riwn9n8lJyTa4MIt-61xmSH9AqwhiQ9lmknl5GZInc6oLQK5A0ZsndGzNwWjPtXD5cDq2qoYDGPRpkpw9E7Gn0PhcN4pHNDL0bhKl9FvnA9pF4kkAUy0FGjCGOOUmCCIpBxohXK0N-GxnQ2lTtAhDL_kSyJHEGpnoFBKB2XS6Vzbe46bTjS0k1NprBIy-LnIK00kFYaaMPJn2NSnD9J1YYtI970rWTgSCvJ7v7-eQ_WdFgTl7DEfWhk81we6NAhw8PCZj4B_eK7vQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB5qPeipahXrcwXBi4l57DaJJ9_U2hbxAd7Czj5ExVba1IO_3t0krSiC3pKwG5aZXeax830DsBvqWFIU1OGxvWaMUTsJF57DfcRQM228Xpsa6PaarXvafmAPFdifYmGUUnnxmXLtY36XLwdibFNlB5aLy29GMzBr7D5lBVprmlGxvCUT1KR9D01sE_m05PPxKTvIt1sSuzkLle2UPsPF67e2KrlVuahBd7KeopjkxR1n6IqPH1SN_13wAix_4ffI9dQyLUJF9ZegNmngQMrzXIejk_HjyH3mw0NyTDq2Jty5NTpT--QsL9dQ5IxnxsxlZKDJjXEpnbz2hrT5Oyd27jLcX5zfnbacsqWCwwMaZRaIY840hgFKj4vIRIdBjNLXoY5C6QdIg0giZ4oZv1E2Y4FKYoKSqTgxKuUYrkC1P-irVSBS2_8onkSepNTMQNZUni-UNtG28L0G7BjppjZWGKUF_JmlpQbSUgMN2PtzTIrDJ6UbULfiTd8KDo60lOza75-3Ya511-2kncve1TrMGycnLooUN6CaDcdq0zgSGW7l--cTB8S_Cg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+15th+International+Conference+on+Mining+Software+Repositories&rft.atitle=Bugs.jar&rft.au=Saha%2C+Ripon+K.&rft.au=Lyu%2C+Yingjun&rft.au=Lam%2C+Wing&rft.au=Yoshida%2C+Hiroaki&rft.series=ACM+Conferences&rft.date=2018-05-28&rft.pub=ACM&rft.isbn=9781450357166&rft.spage=10&rft.epage=13&rft_id=info:doi/10.1145%2F3196398.3196473
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450357166/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450357166/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450357166/sc.gif&client=summon&freeimage=true