Skip to content
Littera Scripta
Menu
  • About journal
  • Ethics
  • Editorial board
  • Instructions for Authors
  • Publication fees
  • Contact
  • Waiver policy
  • Open Acces Policy
  • Peer Review Policy
Menu

Mitigating challenges: Handling mis sing values and imbalanced data in bankruptcy prediction using machine learning

Abstract: The research on financial distress has become essential because the predicted results can serve as an early warning for managers, investors, and banks. Financial ratios calculated in financial reports can serve as indicators to assess the company’s condition. One of the approaches used for bankruptcy prediction is employing machine learning methods. Data requirements with balanced classes and the need to process data with complete parameters/features are prerequisites for building an accurate bankruptcy prediction model. In this study, we employed data balancing techniques such as downsampling and filling missing feature values using the average of nearest neighbors in data preprocessing before training the prediction model. From our experiments, we found that by addressing missing values and balancing the data, the F1 score of the prediction model using Random Forest (RF) improved by 30% compared to not addressing missing data and data imbalance. Although our testing used the Polish company dataset, which may have different characteristics from companies in other countries, the proposed strategies can serve as an initial approach for training datasets of other companies using machine learning methods.

Authors: Ednawati Rainarli, Amine Sabek
Keywords: Bankruptcy prediction, financial distress, imbalanced data, machine learning, missing value
Volume: 16
Issue: 2

Full article

  • Current Issue

Archive

  • Littera Scripta 2024 (Volume 17)
    • 2/2024
    • 1/2024
  • Littera Scripta 2023 (Volume 16)
    • 2/2023
    • 1/2023
  • Littera Scripta 2022 (Volume 15)
    • 2/2022
    • 1/2022
  • Littera Scripta 2021 (Volume 14)
    • 1/2021
  • Littera Scripta 2020 (Volume 13)
    • 2/2020
    • 1/2020
  • Littera Scripta 2019 (Volume 12)
    • 2/2019
    • 1/2019
  • Littera Scripta 2018 (Volume 11)
    • 2/2018
    • 1/2018
  • Littera Scripta 2017 (Volume 10)
    • 2/2017
    • 1/2017
  • Littera Scripta 2016 (Volume 9)
    • 3/2016
    • 2/2016
    • 1/2016
  • Littera Scripta 2015 (Volume 8)
    • 2/2015
    • 1/2015
  • Littera Scripta 2014 (Volume 7)
    • 2/2014
    • 1/2014
  • Littera Scripta 2013 (Volume 6)
    • 2/2013
    • 1/2013
  • Littera Scripta 2012 (Volume 5)
    • 2/2012
    • 1/2012
  • Littera Scripta 2011 (Volume 4)
    • 2/2011
    • 1/2011
  • Littera Scripta 2010 (Volume 3)
    • 1-2/2010
© 2025 Littera Scripta | Powered by Minimalist Blog WordPress Theme