The University of Melbourne
Browse

Dataset for Bootstrap Generalization Estimations and Ground Truth Comparison for Discovered Process Models in Process Mining

Version 2 2024-12-11, 22:39
Version 1 2024-08-11, 10:58
dataset
posted on 2024-12-11, 22:39 authored by Anandi KarunaratneAnandi Karunaratne, Artem PolyvyanyyArtem Polyvyanyy

This dataset contains estimated generalization values evaluated using the bootstrap generalization framework and corresponding ground truth values for models, with characteristics of the event logs used for the estimations. The estimated generalization values are compared with the ground truth values to assess the effectiveness of using system estimation techniques such as bootstrapping. The analysis also explores the coverage of various experimental parameters. Detailed information about the experiment is available in ieeexplore.ieee.org/document/10680679.

The dataset used in the experiment, as well as the experimental code, can be accessed at our GitHub repository.

We kindly request that you cite our work if you use this dataset in your research:
A. Karunaratne, A. Polyvyanyy, and A. Moffat, “The role of log representativeness in estimating generalization in process mining,” in Int. Conf. Process Mining. IEEE, 2024, pp. 33-40.


History

Usage metrics

    University of Melbourne

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC