50K-C a dataset of compilable, and compiled, Java projects

We provide a repository of 50,000 compilable Java projects. Each project in this dataset comes with references to all the dependencies required to compile it, the resulting bytecode, and the scripts with which the projects were built. The dependencies and the build scripts provide a mechanism to re-...

Full description

Saved in:
Bibliographic Details
Published in2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR) pp. 1 - 5
Main Authors Martins, Pedro, Achar, Rohan, Lopes, Cristina V.
Format Conference Proceeding
LanguageEnglish
Published New York, NY, USA ACM 28.05.2018
SeriesACM Conferences
Subjects
Online AccessGet full text
ISBN9781450357166
1450357164
ISSN2574-3864
DOI10.1145/3196398.3196450

Cover

Abstract We provide a repository of 50,000 compilable Java projects. Each project in this dataset comes with references to all the dependencies required to compile it, the resulting bytecode, and the scripts with which the projects were built. The dependencies and the build scripts provide a mechanism to re-create compilation of the projects, if needed (to instruct source code for bytecode analysis, for example). The bytecode is ready for testing, execution, and dynamic analysis tools.
AbstractList We provide a repository of 50,000 compilable Java projects. Each project in this dataset comes with references to all the dependencies required to compile it, the resulting bytecode, and the scripts with which the projects were built. The dependencies and the build scripts provide a mechanism to re-create compilation of the projects, if needed (to instruct source code for bytecode analysis, for example). The bytecode is ready for testing, execution, and dynamic analysis tools.
We provide a repository of 50,000 compilable Java projects. Each project in this dataset comes with references to all the dependencies required to compile it, the resulting bytecode, and the scripts with which the projects were built. The dependencies and the build scripts provide a mechanism to re-create compilation of the projects, if needed (to instruct source code for bytecode analysis, for example). The bytecode is ready for testing, execution, and dynamic analysis tools.
Author Lopes, Cristina V.
Martins, Pedro
Achar, Rohan
Author_xml – sequence: 1
  givenname: Pedro
  surname: Martins
  fullname: Martins, Pedro
  email: pribeiro@uci.edu
  organization: University of California
– sequence: 2
  givenname: Rohan
  surname: Achar
  fullname: Achar, Rohan
  email: rachar@uci.edu
  organization: University of California
– sequence: 3
  givenname: Cristina V.
  surname: Lopes
  fullname: Lopes, Cristina V.
  email: lopes@uci.edu
  organization: University of California
BookMark eNqNj7tOw0AQRZdHJEJwTcEP0Kwzs8_ZElm8RKQ0UK92k1nJQGJk0_D3OIorKqpbnKt7dS7F-b7bsxDXCDWisUuNwelA9SGNhRNRBU8jAG09Oncq5sp6IzU5c_aHXYhqGN4BQDkyiH4uZhZeZHMlZiV9DlxNuRBvD_evzZNcrR-fm7uVTMr4b4luw9YlCnkLjrCQBV_UeMSBKJviQwEu3imTuSSFBgzmwpq8Qtr6oBfi5rjbMnP86ttd6n8i2WDR2ZHeHmna7GLuuo8hIsSDc5yc4-Q8Vut_VmPuWy76F3zgTzc
CODEN IEEPAD
ContentType Conference Proceeding
Copyright 2018 ACM
Copyright_xml – notice: 2018 ACM
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/3196398.3196450
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781450357166
1450357164
EISSN 2574-3864
EndPage 5
ExternalDocumentID 8595165
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
AAJGR
ABLEC
ACM
ADPZR
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
GUFHI
IEGSK
LHSKQ
OCL
RIB
RIC
RIE
RIL
AAWTH
ADZIZ
CHZPO
ID FETCH-LOGICAL-a247t-16ce56a89bd0681f8507f2386e988b4f79f0ef7624befa214041bfe387218d793
IEDL.DBID RIE
ISBN 9781450357166
1450357164
IngestDate Wed Aug 27 02:59:18 EDT 2025
Fri Sep 13 11:04:49 EDT 2024
IsPeerReviewed false
IsScholarly true
Keywords large scale compilation
runnable software repositories
software mining
Language English
License Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org
LinkModel DirectLink
MeetingName ICSE '18: 40th International Conference on Software Engineering
MergedId FETCHMERGED-LOGICAL-a247t-16ce56a89bd0681f8507f2386e988b4f79f0ef7624befa214041bfe387218d793
PageCount 5
ParticipantIDs acm_books_10_1145_3196398_3196450
acm_books_10_1145_3196398_3196450_brief
ieee_primary_8595165
PublicationCentury 2000
PublicationDate 20180528
2018-May
PublicationDateYYYYMMDD 2018-05-28
2018-05-01
PublicationDate_xml – month: 05
  year: 2018
  text: 20180528
  day: 28
PublicationDecade 2010
PublicationPlace New York, NY, USA
PublicationPlace_xml – name: New York, NY, USA
PublicationSeriesTitle ACM Conferences
PublicationTitle 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR)
PublicationTitleAbbrev MSR
PublicationYear 2018
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0002684117
ssj0003211714
Score 2.3172612
Snippet We provide a repository of 50,000 compilable Java projects. Each project in this dataset comes with references to all the dependencies required to compile it,...
SourceID ieee
acm
SourceType Publisher
StartPage 1
SubjectTerms Buildings
Cloning
Data mining
Information systems -- Information retrieval -- Retrieval tasks and goals -- Information extraction
Java
Large Scale Compilation
Runnable Software Repositories
Software
Software and its engineering -- Software notations and tools -- Software libraries and repositories
Software Mining
Uniform resource locators
Subtitle a dataset of compilable, and compiled, Java projects
Title 50K-C
URI https://ieeexplore.ieee.org/document/8595165
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NS8MwFH9sO3mauonziwiCl3Vr2rRN9STTMSYTDw52K0nzAkPtZOs8-NebtN1EEfSWhFDC4yXvo-_3ewAXSruasTRwPC8KHRb76IhQSEdxo1AKlXkALVB48hCOpmw8C2Y16G6xMIhYFJ9hzw6Lf_lqka5tqqxvubhoGNShbtSsxGpt8ymWtWSDmbRz30Q2EWUVmw9lQb9Qtpj3Cg6qAmcv0tdvTVUKmzJswmRzmrKU5Lm3zmUv_fhB1Pjf4-5C-wu9Rx63dmkPapjtQ3PTvoFUt7kF14F77wyuyA25FbkxZjlZaGK3zV8snqpLRKaqOaouGYt3Yb9rEzerNkyHd0-DkVP1UnCEx6LcoWGKQSh4LJUbcqq58QO1MdchxpxLpqNYu6jNy8gkauFZ0h0qNfrcRIhcmUt8AI1skeEhEKl5GlOlqJY-U6knAz81gRGNMXaFjmQHzo1gExskrJIS9xwklfCTSvgduPxzTyKXc9QdaFnJJm8l-UZSCfXo9-Vj2DEeDS8rEk-gkS_XeGq8hlyeFeryCSwbuJQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEB58HPTkG9dnBMHLZrdpkzbVk_hgfax4UNhbSZoJiNoV7Xrw15u03RVF0FsSQgnDJPPofN8A7BsbWM5zQcMwiSlPI6QqVpoa6RTKoHEPoAcK92_i3j2_HIjBFLQnWBhErIrPsOOH1b98M8xHPlXW9VxcLBbTMOvsPhc1WmuSUfG8JWPUpJ9HLrZJGG_4fBgX3UrdUtmpWKgqpL3Kn7-1VamsyvkC9MfnqYtJHjujUnfyjx9Ujf898CKsfuH3yO3EMi3BFBbLsDBu4ECa-7wCRyK4oieH5JicqtKZs5IMLfHbHp48oqpNVGGaOZo2uVTvyn_Xp27eVuH-_OzupEebbgpUhTwpKYtzFLGSqTZBLJmVzhO0zmDHmEqpuU1SG6B1byPXaFXoaXeYthhJFyNK467xGswUwwLXgWgr85QZw6yOuMlDLaLchUYsxTRQNtEt2HOCzXyY8JbVyGeRNcLPGuG34ODPPZl-fUDbghUv2eylpt_IGqFu_L68C3O9u_51dn1xc7UJ886_kXV94hbMlK8j3HY-RKl3KtX5BA3Pu-E
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+15th+International+Conference+on+Mining+Software+Repositories&rft.atitle=50K-C&rft.au=Martins%2C+Pedro&rft.au=Achar%2C+Rohan&rft.au=Lopes%2C+Cristina+V.&rft.series=ACM+Conferences&rft.date=2018-05-28&rft.pub=ACM&rft.isbn=9781450357166&rft.spage=1&rft.epage=5&rft_id=info:doi/10.1145%2F3196398.3196450
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450357166/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450357166/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450357166/sc.gif&client=summon&freeimage=true