Inherent risks identification in a contract document through automated rule generation

Due to the limited time available during the bidding process, construction companies may fail to identify risky terms in the contract document before submission. This paper proposes a method that uses natural language processing (NLP) models, such as dependency parser and bidirectional encoder repre...

Full description

Saved in:
Bibliographic Details
Published inAutomation in construction Vol. 172; p. 106044
Main Authors Kim, Junho, Kwon, Baekgyu, Lee, JeeHee, Mun, Duhwan
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.04.2025
Subjects
Online AccessGet full text
ISSN0926-5805
DOI10.1016/j.autcon.2025.106044

Cover

Loading…
More Information
Summary:Due to the limited time available during the bidding process, construction companies may fail to identify risky terms in the contract document before submission. This paper proposes a method that uses natural language processing (NLP) models, such as dependency parser and bidirectional encoder representations from transformers (BERT), to disassemble and simplify sentences in a contract document and automatically generate rules for identifying risk sentences. The sentence disassembly process is conducted in the following order: adjunct separation, parallel structure separation, and subject-verb-object separation. Subsequently, risk sentence identification rules are automatically generated through the input of risk terms. The performance of the proposed method is verified using the generated rules. In the experiments, the accuracies of sentence disassembly and risk sentence identification were 95.5 % and 91.3 %, respectively. The proposed method can assist experts in reviewing contracts, significantly reducing the time required to generate new identification rules. •Method for generating rules to identify risk sentences.•Sentences in a contract were disassembled using NLP-based separation methods.•The contract lexicon can be automatically updated and expanded.•Identification rules were automatically generated using the lexicon and applied to contracts.•Achieved 91.3 % accuracy in identifying risk sentences.
ISSN:0926-5805
DOI:10.1016/j.autcon.2025.106044