Bugs.jar a large-scale, diverse dataset of real-world Java bugs
We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and patches, drawn from 8 large, popular open-source Java projects, spanning 8 diverse and prominent application categories. It is an order of m...
Saved in:
Published in | 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR) pp. 10 - 13 |
---|---|
Main Authors | , , , , |
Format | Conference Proceeding |
Language | English |
Published |
New York, NY, USA
ACM
28.05.2018
|
Series | ACM Conferences |
Subjects | |
Online Access | Get full text |
ISBN | 9781450357166 1450357164 |
ISSN | 2574-3864 |
DOI | 10.1145/3196398.3196473 |
Cover
Loading…
Abstract | We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and patches, drawn from 8 large, popular open-source Java projects, spanning 8 diverse and prominent application categories. It is an order of magnitude larger than Defects4J, the only other dataset in its class. We discuss the methodology used for constructing Bugs.jar, the representation of the dataset, several use-cases, and an illustration of three of the use-cases through the application of 3 specific tools on Bugs.jar, namely our own tool, Elixir, and two third-party tools, Ekstazi and JaCoCo. |
---|---|
AbstractList | We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and patches, drawn from 8 large, popular open-source Java projects, spanning 8 diverse and prominent application categories. It is an order of magnitude larger than Defects4J, the only other dataset in its class. We discuss the methodology used for constructing Bugs.jar, the representation of the dataset, several use-cases, and an illustration of three of the use-cases through the application of 3 specific tools on Bugs.jar, namely our own tool, Elixir, and two third-party tools, Ekstazi and JaCoCo. We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and patches, drawn from 8 large, popular opensource Java projects, spanning 8 diverse and prominent application categories. It is an order of magnitude larger than Defects4J, the only other dataset in its class. We discuss the methodology used for constructing Bugs.jar, the representation of the dataset, several use-cases, and an illustration of three of the use-cases through the application of 3 specific tools on Bugs.jar, namely our own tool, Elixir, and two third-party tools, Ekstazi and JaCoCo. |
Author | Prasad, Mukul R. Lyu, Yingjun Saha, Ripon K. Lam, Wing Yoshida, Hiroaki |
Author_xml | – sequence: 1 givenname: Ripon K. surname: Saha fullname: Saha, Ripon K. organization: Fujitsu Laboratories of America, Inc – sequence: 2 givenname: Yingjun surname: Lyu fullname: Lyu, Yingjun organization: University of Southern California – sequence: 3 givenname: Wing surname: Lam fullname: Lam, Wing organization: University of Illinois at Urbana-Champaign – sequence: 4 givenname: Hiroaki surname: Yoshida fullname: Yoshida, Hiroaki organization: Fujitsu Laboratories of America, Inc – sequence: 5 givenname: Mukul R. surname: Prasad fullname: Prasad, Mukul R. organization: Fujitsu Laboratories of America, Inc |
BookMark | eNqNjztPwzAURs1LaimZGfgDLAm-vravPUJVHlIlFpgt39hBKbRBCQz8e1I1ExPTGY6-Tzrn4nTX7bIQlyArAG1uELxF76o9NeGRKDy5UUg0BNYei7kypEt0Vp_8cTNRDMNGSqms0wA0F7O777eh2sT-Qpw18WPIxcSFeL1fvSwfy_Xzw9Pydl1GpemrVISSkFFxkrEmZbRynKDBhjCBYq0ocTTZeO-TdTXnxJ6Tyc4nzZFxIa4Ov23OOXz27Tb2P8EZb8DSaKuDjfU2cNe9DwFk2GeHKTtM2YH7Njfj4PqfA_wFH5lSUA |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
Copyright | 2018 ACM |
Copyright_xml | – notice: 2018 ACM |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1145/3196398.3196473 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 9781450357166 1450357164 |
EISSN | 2574-3864 |
EndPage | 13 |
ExternalDocumentID | 8595167 |
Genre | orig-research |
GroupedDBID | 6IE 6IF 6IL 6IN AAJGR ABLEC ACM ADPZR ALMA_UNASSIGNED_HOLDINGS APO BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK GUFHI IEGSK LHSKQ OCL RIB RIC RIE RIL AAWTH ADZIZ CHZPO |
ID | FETCH-LOGICAL-a247t-273073b32bd0ac725428bd1f3f73d12b427dba5e5999d68cbedb9bd5e89d4bab3 |
IEDL.DBID | RIE |
ISBN | 9781450357166 1450357164 |
IngestDate | Wed Aug 27 02:59:18 EDT 2025 Fri Sep 13 11:04:49 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | true |
Keywords | reproducible bugs Java programs large-scale dataset |
Language | English |
License | Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org |
LinkModel | DirectLink |
MeetingName | ICSE '18: 40th International Conference on Software Engineering |
MergedId | FETCHMERGED-LOGICAL-a247t-273073b32bd0ac725428bd1f3f73d12b427dba5e5999d68cbedb9bd5e89d4bab3 |
PageCount | 4 |
ParticipantIDs | acm_books_10_1145_3196398_3196473 ieee_primary_8595167 acm_books_10_1145_3196398_3196473_brief |
PublicationCentury | 2000 |
PublicationDate | 20180528 2018-May |
PublicationDateYYYYMMDD | 2018-05-28 2018-05-01 |
PublicationDate_xml | – month: 05 year: 2018 text: 20180528 day: 28 |
PublicationDecade | 2010 |
PublicationPlace | New York, NY, USA |
PublicationPlace_xml | – name: New York, NY, USA |
PublicationSeriesTitle | ACM Conferences |
PublicationTitle | 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR) |
PublicationTitleAbbrev | MSR |
PublicationYear | 2018 |
Publisher | ACM |
Publisher_xml | – name: ACM |
SSID | ssj0002684117 ssj0003211714 |
Score | 2.4990492 |
Snippet | We present Bugs.jar, a large-scale dataset for research in automated debugging, patching, and testing of Java programs. Bugs.jar is comprised of 1,158 bugs and... |
SourceID | ieee acm |
SourceType | Publisher |
StartPage | 10 |
SubjectTerms | Computer bugs Debugging Java Java Programs Large Scale Dataset Libraries Reproducible Bugs Software Testing |
Subtitle | a large-scale, diverse dataset of real-world Java bugs |
Title | Bugs.jar |
URI | https://ieeexplore.ieee.org/document/8595167 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB7anjzVR8X6YgXBS5PmsXl58lFLKVZELfQWdvYhKrbSJh789e4maUUR9JaE3bDMzDIzu998A3Dsq1hQ5NRisblmjFFZCeOOxVxEXwVKR73maGB0Ew7GdDgJJjXorGphpJQF-Eza5rG4yxcznpujsq7h4nLDqA51bWZlrdbqPMWwlixrJs27rzObyKUVm49Lg25hbElsFxxUpk96nfHXb01VCp_Sb8JouZoSSvJi5xna_OMHUeN_l7sOra_qPXK78ksbUJPTTWgu2zeQajdvwdlF_riwn9n8lJyTa4MIt-61xmSH9AqwhiQ9lmknl5GZInc6oLQK5A0ZsndGzNwWjPtXD5cDq2qoYDGPRpkpw9E7Gn0PhcN4pHNDL0bhKl9FvnA9pF4kkAUy0FGjCGOOUmCCIpBxohXK0N-GxnQ2lTtAhDL_kSyJHEGpnoFBKB2XS6Vzbe46bTjS0k1NprBIy-LnIK00kFYaaMPJn2NSnD9J1YYtI970rWTgSCvJ7v7-eQ_WdFgTl7DEfWhk81we6NAhw8PCZj4B_eK7vQ |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB5qPeipahXrcwXBi4l57DaJJ9_U2hbxAd7Czj5ExVba1IO_3t0krSiC3pKwG5aZXeax830DsBvqWFIU1OGxvWaMUTsJF57DfcRQM228Xpsa6PaarXvafmAPFdifYmGUUnnxmXLtY36XLwdibFNlB5aLy29GMzBr7D5lBVprmlGxvCUT1KR9D01sE_m05PPxKTvIt1sSuzkLle2UPsPF67e2KrlVuahBd7KeopjkxR1n6IqPH1SN_13wAix_4ffI9dQyLUJF9ZegNmngQMrzXIejk_HjyH3mw0NyTDq2Jty5NTpT--QsL9dQ5IxnxsxlZKDJjXEpnbz2hrT5Oyd27jLcX5zfnbacsqWCwwMaZRaIY840hgFKj4vIRIdBjNLXoY5C6QdIg0giZ4oZv1E2Y4FKYoKSqTgxKuUYrkC1P-irVSBS2_8onkSepNTMQNZUni-UNtG28L0G7BjppjZWGKUF_JmlpQbSUgMN2PtzTIrDJ6UbULfiTd8KDo60lOza75-3Ya511-2kncve1TrMGycnLooUN6CaDcdq0zgSGW7l--cTB8S_Cg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+15th+International+Conference+on+Mining+Software+Repositories&rft.atitle=Bugs.jar&rft.au=Saha%2C+Ripon+K.&rft.au=Lyu%2C+Yingjun&rft.au=Lam%2C+Wing&rft.au=Yoshida%2C+Hiroaki&rft.series=ACM+Conferences&rft.date=2018-05-28&rft.pub=ACM&rft.isbn=9781450357166&rft.spage=10&rft.epage=13&rft_id=info:doi/10.1145%2F3196398.3196473 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450357166/lc.gif&client=summon&freeimage=true |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450357166/mc.gif&client=summon&freeimage=true |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450357166/sc.gif&client=summon&freeimage=true |