Process Mining: Applied Discipline and Software Implementations
Keywords:Process mining, Information technology, Business process management, BPM, Event data, Event logs, XES, ProM, Disco, Celonis
Background. A precise picture of how business processes (in the interpretation by Andrea Burattin) are performed in real-life is vitally important for an organization because it shows actual situation revealing gaps and bottlenecks. Process mining is a discipline with the purpose to research processes using as the input so-called event data (or event logs) which in essence is a digital footprint left in IT systems as the result of business processes execution.
Objective. The goal of the study is to overview current state of process mining and find actual scientific and practice tasks in this field as well as justify and formalize requirements to the information technologies with the purpose to implement the found set of process mining applied tasks.
Methods. The method used by the authors to prepare current overview consisted of the following steps: (a) analysis of information sources; (b) finding and formalization of actual scientific as well as practice tasks; (c) description of the requirements to the information technologies with the purpose to implement the found set of actual tasks.
Results. It has been found out that process mining as an applied discipline has been actively developed for 20 years; significant contribution to creating the scientific basis of process mining has been done in Eindhoven University of Technology (The Netherlands) under direction of professor Wil M.P. van der Aalst. It also has been found actual scientific and practice tasks of process mining: event data preparation, dealing with concept drift, operational support, event data streams processing, handling big event data, improving process mining tools usability for the end users. It has been formalized requirements and specified quality attributes for the information technologies with the purpose to implement the found actual tasks. Architecture of the information technologies has been proposed by the authors.Conclusions. Currently the theoretical core of process mining has mainly been developed and quite structured. However, despite of the fact that mathematical methods and software tools have been successfully used in practice for a few years, the request for the intellectual business process analysis has not been fulfilled yet. The authors have found out that relevant information technologies should supply such functions as handling big event logs, dealing with event data streams as well as operational support of business processes which are at the execution stage.
A. Burattin, Process Mining Techniques in Business Environments. Cham, Switzerland: Springer, 2015. doi: 10.1007/978-3-319-17482-2
W.M.P. van der Aalst, Process Mining: Data Science in Action, 2nd ed. Berlin, Germany: Springer, 2016. doi: 10.1007/978-3-662-49851-4
A. Nerode, “Linear automaton transformations”, Proc. Am. Math. Soc., vol. 9, no. 4, pp. 541–544, Aug. 1958. doi: 10.2307/2033204
E.M. Gold, “Language identification in the limit”, Information and Control, vol. 10, no. 5, pp. 447–474, May 1967. doi: 10.1016/S0019-9958(67)91165-5
A. Viterbi, “Error bounds for convolutional codes and an asymptotically optimum decoding algorithm”, IEEE Trans. Inform. Theory, vol. 13, no. 2, pp. 260–269, April 1967. doi: 10.1109/TIT.1967.1054010
L.E. Baum et al., “A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains”, Ann. Math. Statist., vol. 41, no. 1, pp. 164–171, Feb. 1970. doi: 10.1214/aoms/1177697196
A.W. Biermann and J.A. Feldman, “On the synthesis of finite-state machines from samples of their behavior”, IEEE Trans. Comp., vol. C-21, no. 6, pp. 592–597, June 1972. doi: 10.1109/TC.1972.5009015
J.E. Cook and A.L. Wolf, “Discovering models of software processes from event-based data; CU-CS-819-96”, University of Colorado, Department of Computer Science, Boulder, CO, USA, Nov 1996.
R. Agrawal et al., “Mining Process Models from Workflow Logs”, in Advances in Database Technology – EDBT'98 (EDBT 1998 ). Lecture Notes in Computer Science, vol. 1377, H.J. Schek et al., eds. Berlin, Heidelberg, Germany: Springer, 1998, pp. 467–483. doi: 10.1007/BFb0101003
A. Datta, “Automating the discovery of AS-IS business process models: Probabilistic and algorithmic approaches”, Inform. Syst. Res., vol. 9, no. 3, pp. 275–301, Sep. 1998. doi: 10.1287/isre.9.3.275
W.M.P. van der Aalst and A.J.M.M. Weijters, “Process mining: A research agenda”, Computers in Industry, vol. 53, no. 3, pp. 231–244, June 2004. doi: 10.1016/j.compind.2003.10.001
W.M.P. van der Aalst et al., “Process mining manifesto”, in Business Process Management Workshops. BPM 2011 International Workshops. Lecture Notes in Business Information Processing, vol. 99, F. Daniel et al., eds. Berlin, Heidelberg, Germany: Springer, 2012, pp. 169–194. doi: 10.1007/978-3-642-28108-2_19
Gartner IT Glossary: Automated Business Process Discovery (ABPD) [Online]. Available: https://www.gartner.com/it-glossary/automated-business-process-discovery-abpd
I. Ailenei et al., “Definition and validation of process mining use cases”, in Business Process Management Workshops. BPM 2011 International Workshops. Lecture Notes in Business Information Processing, vol. 99, F. Daniel et al., eds. Berlin, Heidelberg, Germany: Springer, 2012, pp. 75–86. doi: 10.1007/978-3-642-28108-2_7
M. Kerremans. (2018). Market Guide for Process Mining [Online]. Available: https://www.gartner.com/doc/3870291/market-guide-process-mining
A. Batyuk and V. Voityshyn, “Business processes monitoring by means of real-time visual dashboards”, in Proc. 6th Int. Academic Conf. Information, Communication, Society 2017 (ICS 2017), Lviv, Ukraine, 2017, pp. 204–205.
B.F. van Dongen and W.M.P. van der Aalst, “A meta model for process mining data”, in Proc. Open Interop Workshop on Enterprise Modelling and Ontologies for Interoperability, Co-located with CAiSE'05 Conference, Porto, Portugal, 13–14 June 2005. Available: http://ceur-ws.org/Vol-160/paper11.pdf
IEEE Standard for eXtensible Event Stream (XES) for Achieving Interoperability in Event Logs and Event Streams, IEEE Standard 1849-2016, 2016.
OpenXES [Online]. Available: http://www.xes-standard.org/openxes/start
W.M.P. van der Aalst et al., “Workflow mining: discovering process models from event logs”, IEEE Trans. Knowl. Data Eng., vol. 16, no. 9, pp. 1128–1142, 2004. doi: 10.1109/TKDE.2004.47
Ch.W. Günther and W.M.P. van der Aalst, “Fuzzy mining – Adaptive process simplification based on multi-perspective metrics”, in Proceedings of the 5th International Conference on Business Process Management. BPM 2007. Lecture Notes in Computer Science, vol. 4714, G. Alonso et al., eds. Berlin, Heidelberg, Germany: Springer, 2007, pp. 328–343. doi: 10.1007/978-3-540-75183-0_24
Ch.W. Günther and A. Rozinat, “Disco: discover your processes”, in Proc. Demonstration Track of the 10th International Conference on Business Process Management (BPM 2012), Tallinn, Estonia, 2012, vol. 940, pp. 40–44.
A. Burattin et al. (2012). Heuristics Miners for Streaming Event Data [Online]. Available: https://arxiv.org/abs/1212.6383
A. Batyuk et al., “Software architecture design of the real-time processes monitoring platform”, in Proc. 2018 IEEE 2nd Int. Conf. Data Stream Mining & Processing (DSMP'2018), Lviv, Ukraine, 2018, pp. 98–101. doi: 10.1109/DSMP.2018.8478589
A. Batyuk and V. Voityshyn, “Streaming process discovery for lambda architecture-based process monitoring platform”, in 2018 IEEE 13th Int. Sci. Tech. Conf. Computer Science and Information Technologies (CSIT'2018), Lviv, Ukraine, 2018, pp. 298–301.
S.J.J. Leemans et al., “Scalable process discovery and conformance checking”, Software & Systems Modeling, vol. 17, no. 2, pp. 599–631, 2018. doi: 10.1007/s10270-016-0545-x
M.O. Kherbouche et al., “Towards a better assessment of event logs quality”, in Proc. 2016 IEEE Symposium Series on Computational Intelligence (SSCI), Athens, 2016, pp. 1–8. doi: 10.1109/SSCI.2016.7849946
R.P. Jagadeesh Chandra Bose et al., “Handling concept drift in process mining”, in Advanced Information Systems Engineering. CAiSE 2011. Lecture Notes in Computer Science, vol. 6741. London, UK, 2011, pp. 391–405. doi: 10.1007/978-3-642-21640-4_30
M.V.M. Kuma et al., “Capturing the sudden concept drift in process mining”, in BPM Workshops, vol. 1371, pp. 132–143, 2015.
J. Carmona and R. Gavaldà, “Online techniques for dealing with concept drift in process mining”, in Advances in Intelligent Data Analysis XI. IDA 2012. Lecture Notes in Computer Science, vol. 7619, J. Hollmén et al., eds. Berlin, Heidelberg, Germany: Springer, 2012, pp. 90–102. doi: 10.1007/978-3-642-34156-4_10
W.M.P. van der Aalst et al., “Time prediction based on process mining”, Inform. Systems, vol. 36, no. 2, pp. 450–475, 2011. doi: 10.1016/j.is.2010.09.001
S.J. van Zelst et al., “Event stream-based process discovery using abstract representations”, Knowl. Inform. Syst., vol. 54, no. 2, pp. 407–535, 2018. doi: 10.1007/s10115-017-1060-2
J. Evermann, “Scalable process discovery using map-reduce”, IEEE Trans. Services Comp., vol. 9, no. 3, pp. 469–481, 2016. doi: 10.1109/TSC.2014.2367525
Lambda Architecture [Online]. Available: http://lambda-architecture.net
Kappa Architecture [Online]. Available: http://milinda.pathirage.org/kappa-architecture.com
M. Du et al. (2018). Techniques for Interpretable Machine Learning [Online]. Available: https://arxiv.org/abs/1808.00033
I.M. Ailenei, “Process mining tools: A comparative analysis”, M.S. thesis, Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands. Available: http://alexandria.tue.nl/extra1/afstversl/ wsk-i/ailenei2011.pdf
I.M. Ailenei et al., “Towards an evaluation framework for process mining systems”, BPM Center Report BPM-11-13, BPMcenter.org, 2011. Available: http://bpmcenter.org/wp-content/uploads/reports/2011/BPM-11-13.pdf
C.J. Turner et al., “Business process mining: From theory to practice”, Business Process Management J., vol. 18, no. 3, pp. 493–512, June 2012. doi: 10.1108/14637151211232669
D. Verstraete, “Process mining in practice: Comparative study of process mining software”, M.S. thesis, Faculty of Economics and Business Administration, Ghent University, Ghent, Belgium. Available: https://lib.ugent.be/fulltxt/RUG01/002/ 165/042/RUG01-002165042_2014_0001_AC.pdf
M. Kebede, “Comparative evaluation of process mining tools”, M.S. thesis, Faculty of Mathematics and Computer Science, Institute of Computer Science, University of Tartu, Tartu, Estonia. Available: https://comserv.cs.ut.ee/home/files/gizaw_MSc.+in+Software+Engineering_2015.pdf?study=ATILoputoo&reference=BB4063305540E49644F08DD06F6C50F5D0266630
W.M.P. van der Aalst et al. (2017). RapidProM: mine your processes and not just your data [Online]. Available: https://arxiv.org/abs/1703.03740
B.F. van Dongen et al., “The ProM framework: A new era in process mining tool support”, in Applications and Theory of Petri Nets 2005. ICATPN 2005. Lecture Notes in Computer Science, vol. 3536, G. Ciardo and P. Darondeau, eds. Berlin, Heidelberg, Germany: Springer, 2005, pp. 444–454. doi: 10.1007/11494744_25
H.M.W. Verbeek et al., “ProM 6: the process mining toolkit”, in Proc. Business Process Management 2010 Demonstration Track, vol. 615, M. La Rosa, Ed. CEUR-WS.org, 2010, pp. 34–39.
A.K.A. de Medeiros et al., “Process mining for ubiquitous mobile systems: an overview and a concrete algorithm”, in Ubiquitous Mobile Information and Collaboration Systems. UMICS 2004. Lecture Notes in Computer Science, vol. 3272, L. Baresi et al., eds. Berlin, Heidelberg, Germany: Springer, 2004, pp. 151–165. doi: 10.1007/978-3-540-30188-2_12
L. Wen et al., “Mining process models with non-free-choice”, Data Mining & Knowledge Discovery, vol. 15, no. 2, pp. 145–180, 2007. doi: 10.1007/s10618-007-0065-y
L. Wen et al., “Mining process models with prime invisible”, Data & Knowledge Eng., vol. 69, no. 10, pp. 999–1021, 2010. doi: 10.1016/j.datak.2010.06.001
Showcase: SAP Process Mining by Celonis [Online]. Available: https://www.sap.com/developer/showcases/process-mining-by-celonis.html
F. Veit et al., “The proactive insights engine: process mining meets machine learning and artificial intelligence”, in 15th Int. Conf. Business Process Management (BPM'2017). BPM Demo Track and BPM Dissertation Award, vol. 1920, Barcelona, Spain, 2017.
System and Software Quality Models, ISO/IEC 25010, 2011.
W. Fan and A. Bifet, “Mining big data: Current status, and forecast to the future”, SIGKDD Explorations, vol. 14, no. 2, pp. 1–5, 2012. doi: 10.1145/2481244.2481246
P. McGee. (2017). New Big Data Trend Tracks ‘Digital Footprints’ [Online]. Available: https://www.ft.com/content/402553f4-c4a4-11e7-b30e-a7c1c7c13aab
Copyright (c) 2018 Igor Sikorsky Kyiv Polytechnic Institute
This work is licensed under a Creative Commons Attribution 4.0 International License.Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under CC BY 4.0 that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work