0
  • DE
  • EN
  • FR
  • International Database and Gallery of Structures

Advertisement

Machine learning-based automated classification of worker-reported safety reports in construction

Author(s):



Medium: journal article
Language(s): English
Published in: Journal of Information Technology in Construction, , v. 27
Page(s): 926-950
DOI: 10.36680/j.itcon.2022.045
Abstract:

Limited academic attention has been paid to the applicability of Machine Learning (ML) approaches for analyzing worker-reported near-miss safety reports, as opposed to injury reports, at construction sites. Although resource-efficient analysis through ML of large volumes of such data at construction sites can help guide practitioners in decision-making to prevent injuries. The current study addresses this research gap by evaluating the relevance of ML approaches through quantitative and qualitative methods for scaling efficient near-miss reporting programs at construction sites. The study uses an extensive experimentation strategy consisting of input data processing, n-gram modeling, and sensitivity analysis. It first tests the proposition that, despite the data-quality challenges, the high performance of different ML algorithms can be achieved in automatically classifying the textual near-miss observations. The study relies on worker-reported near-miss data collected from a real construction site in Kuwait. The classification performance of various ML approaches is evaluated using F1 scores for three academically novel but commonly used category labels at the sites - "Unsafe Act (UA)," "Unsafe Condition (UC)," and "Good Observation (GO)." In addition, the practitioner's input was utilized to assess the practical applicability of ML classifiers for construction sites. The conventional Logistic Regression (LR) classifiers have a comparatively high F1 score of 0.79. However, ML classifiers faced challenges in distinguishing between UA and UC. Further, the analysis reveals that optimal ML classifiers may lose on being acceptable to human decision-makers. Overall, despite the promising performance of ML tools for the near-miss data, the sites with low maturity of reporting systems may find themselves unable to leverage ML to scale their reporting systems. A simplified experimentation strategy like the current study could help practitioners identify the data-specific optimal ML approaches in future applications.

Structurae cannot make the full text of this publication available at this time. The full text can be accessed through the publisher via the DOI: 10.36680/j.itcon.2022.045.
  • About this
    data sheet
  • Reference-ID
    10702802
  • Published on:
    11/12/2022
  • Last updated on:
    16/12/2022
 
Structurae cooperates with
International Association for Bridge and Structural Engineering (IABSE)
e-mosty Magazine
e-BrIM Magazine