CLACER: A Deep Learning-based Compilation Error Classification Method for Novice Students' Programs

Compilation errors happen during the debugging process of novice students. Compiler error messages help novices to localize and remove errors, but these messages are difficult to understand for students. Some computing education researchers analyzed the compiler error messages generated by novice�...

Full description

Saved in:
Bibliographic Details
Published in2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC) pp. 74 - 83
Main Authors Li, Zheng, Sun, Fuxiang, Wang, Haifeng, Ding, Yifan, Liu, Yong, Chen, Xiang
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.07.2021
Subjects
Online AccessGet full text
DOI10.1109/COMPSAC51774.2021.00022

Cover

Abstract Compilation errors happen during the debugging process of novice students. Compiler error messages help novices to localize and remove errors, but these messages are difficult to understand for students. Some computing education researchers analyzed the compiler error messages generated by novice's attempts to compile their programs. However, some important questions remain open. For example, the existing compilation error category cannot cover all programs due to the simple static analysis and program repair patterns. And existing prediction models for classifying compilation errors are unsatisfactory because of the inappropriate neural networks. In this paper, we first propose a new category of compilation error based on the program tokens, which is the smallest unit of the program. Then we develop a neural network model CLACER (ClAssification of Compilation ERrors) based on TextCNN. CLACER performs better on extracting semantic features and statistical features from compiler error messages. To verify the effectiveness of our proposed category and corresponding method CLACER, we choose 16,926 student programs as our experimental subjects. Final experimental results indicate that our proposed classification category covers 16.5% more programs than the state-of-the-art category TEGCER. Moreover, CLACER improves the compiler's localization effectiveness and with a 4.25% improvement on the TEGCER category. Further analysis shows that CLACER has a promising prediction performance for different error classes, and TextCNN is more suitable for constructing the compilation error classification model.
AbstractList Compilation errors happen during the debugging process of novice students. Compiler error messages help novices to localize and remove errors, but these messages are difficult to understand for students. Some computing education researchers analyzed the compiler error messages generated by novice's attempts to compile their programs. However, some important questions remain open. For example, the existing compilation error category cannot cover all programs due to the simple static analysis and program repair patterns. And existing prediction models for classifying compilation errors are unsatisfactory because of the inappropriate neural networks. In this paper, we first propose a new category of compilation error based on the program tokens, which is the smallest unit of the program. Then we develop a neural network model CLACER (ClAssification of Compilation ERrors) based on TextCNN. CLACER performs better on extracting semantic features and statistical features from compiler error messages. To verify the effectiveness of our proposed category and corresponding method CLACER, we choose 16,926 student programs as our experimental subjects. Final experimental results indicate that our proposed classification category covers 16.5% more programs than the state-of-the-art category TEGCER. Moreover, CLACER improves the compiler's localization effectiveness and with a 4.25% improvement on the TEGCER category. Further analysis shows that CLACER has a promising prediction performance for different error classes, and TextCNN is more suitable for constructing the compilation error classification model.
Author Li, Zheng
Liu, Yong
Ding, Yifan
Wang, Haifeng
Sun, Fuxiang
Chen, Xiang
Author_xml – sequence: 1
  givenname: Zheng
  surname: Li
  fullname: Li, Zheng
  organization: Beijing University of Chemical Technology,College of Information Science and Technology,Beijing,China,100029
– sequence: 2
  givenname: Fuxiang
  surname: Sun
  fullname: Sun, Fuxiang
  organization: Beijing University of Chemical Technology,College of Information Science and Technology,Beijing,China,100029
– sequence: 3
  givenname: Haifeng
  surname: Wang
  fullname: Wang, Haifeng
  email: h.f.wang@hotmail.com
  organization: Beijing University of Chemical Technology,College of Information Science and Technology,Beijing,China,100029
– sequence: 4
  givenname: Yifan
  surname: Ding
  fullname: Ding, Yifan
  organization: Tsinghua University High School International,Beijing,China,100027
– sequence: 5
  givenname: Yong
  surname: Liu
  fullname: Liu, Yong
  email: lyong@mail.buct.edu.cn
  organization: Beijing University of Chemical Technology,College of Information Science and Technology,Beijing,China,100029
– sequence: 6
  givenname: Xiang
  surname: Chen
  fullname: Chen, Xiang
  organization: Nantong University,School of Information Science and Technology,China
BookMark eNotjE9LwzAchiPowU0_gQdz89Saf00bbyV2KnRuOAVvI21-mYG2KWkV_PYO5ul9eB54F-h8CAMgdEtJSilR93qz3u5KndE8FykjjKaEEMbO0IJKmQkmJP-8RK2uS129PeASPwKMuAYTBz8cksZMYLEO_eg7M_sw4CrGELHuzDR559uTXMP8FSx2x_IafnwLeDd_Wxjm6Q5vYzhE009X6MKZboLr_12ij1X1rp-TevP0oss68YzwOSk4B2aLQjRKZiRrhW1acI6QlgkurGkKegQqjYDGcucKpSRXuWOWSWeM5Et0c_r1ALAfo-9N_N2rjKlCZPwPCi9TRQ
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/COMPSAC51774.2021.00022
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 166542463X
9781665424639
EndPage 83
ExternalDocumentID 9529845
Genre orig-research
GrantInformation_xml – fundername: National Natural Science Foundation of China
  funderid: 10.13039/501100001809
– fundername: Chinese Academy of Sciences
  funderid: 10.13039/501100002367
GroupedDBID 6IE
6IH
CBEJK
RIE
RIO
ID FETCH-LOGICAL-i203t-833e2d884b96505c4dbceff00c2434dab8124316a4ebd3ff8996397f2d26faa63
IEDL.DBID RIE
IngestDate Thu Jun 29 18:38:50 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-833e2d884b96505c4dbceff00c2434dab8124316a4ebd3ff8996397f2d26faa63
PageCount 10
ParticipantIDs ieee_primary_9529845
PublicationCentury 2000
PublicationDate 2021-July
PublicationDateYYYYMMDD 2021-07-01
PublicationDate_xml – month: 07
  year: 2021
  text: 2021-July
PublicationDecade 2020
PublicationTitle 2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC)
PublicationTitleAbbrev COMPSAC
PublicationYear 2021
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8125002
Snippet Compilation errors happen during the debugging process of novice students. Compiler error messages help novices to localize and remove errors, but these...
SourceID ieee
SourceType Publisher
StartPage 74
SubjectTerms Compilation error
Compilation error classification
Computational modeling
Deep learning
Location awareness
Neural networks
Novice students' programs
Predictive models
Program processors
Semantics
Static analysis
Title CLACER: A Deep Learning-based Compilation Error Classification Method for Novice Students' Programs
URI https://ieeexplore.ieee.org/document/9529845
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwGA3bTp5UNvE3OQhebNemSZd6G3VjiJ3DOdhtND8qQ2hH1138682X1oniwVtJDwn52rwvyfveQ-gmUmBdPJCOjkTqUA4akBwELwcRTwfKvLb2bck0nCzo45ItW-huXwujtbbkM-3Co73LV4XcwVFZP2Ik4pS1Udt8ZnWtVkPZ8r2oHz8ns_kwZr7JaMzGj_iuVXf5YZtiUWN8iJKv_mqyyLu7q4QrP35JMf53QEeo912fh2d75DlGLZ13kYyfhvHo5R4P8YPWG9wop745AFQKw4-_rolveFSWRYmtHyYwherGxHpJY5PE4mkB6wee17qX21voDFhc2x5ajEev8cRpLBScNfGCyuFBoIninIrIpGJMUiWkzjLPk4QGVKUC8D3ww5RqoYIsM7GCm76MKBJmaRoGJ6iTF7k-RVgwbbaPhEmTAFKWgswPM2ujyqSnTRrHz1AXJmi1qVUyVs3cnP_dfIEOIEQ18fUSdapyp68MvFfi2sb1E-XZph8
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8IwGG4QD3pSA8ZvezDx4sY-2tF5IxOCypAIJNzI-jFDTBgZ4-Kvt283MRoP3pbu0KZv2udt-7zPg9BNKMG6uC0sFfLEIgw0IBkIXrZDlrSl_m3s2-Jh0J-Spxmd1dDdthZGKWXIZ8qGT_OWLzOxgauyVki9kBG6g3Y17hNaVmtVpC3XCVvRSzwadyLq6pxGH_081zb6Lj-MUwxu9A5Q_NVjSRd5tzcFt8XHLzHG_w7pEDW_K_TwaIs9R6imlg0kokEn6r7e4w5-UGqFK-3UNwugSmJY-ouS-oa7eZ7l2DhiAleobIyNmzTWaSweZrCD4HGpfLm-hc6Ax7VuommvO4n6VmWiYC08xy8s5vvKk4wRHupkjAoiuVBp6jjCIz6RCQeE990gIYpLP011tOCtL_WkF6RJEvjHqL7MluoEYU6VPkB6VOgUkNAEhH6o3h1lKhylEzl2ihowQfNVqZMxr-bm7O_ma7TXn8SD-eBx-HyO9iFcJQ32AtWLfKMuNdgX_MrE-BPG86ls
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2021+IEEE+45th+Annual+Computers%2C+Software%2C+and+Applications+Conference+%28COMPSAC%29&rft.atitle=CLACER%3A+A+Deep+Learning-based+Compilation+Error+Classification+Method+for+Novice+Students%27+Programs&rft.au=Li%2C+Zheng&rft.au=Sun%2C+Fuxiang&rft.au=Wang%2C+Haifeng&rft.au=Ding%2C+Yifan&rft.date=2021-07-01&rft.pub=IEEE&rft.spage=74&rft.epage=83&rft_id=info:doi/10.1109%2FCOMPSAC51774.2021.00022&rft.externalDocID=9529845