• Journal of Internet Computing and Services
    ISSN 2287 - 1136 (Online) / ISSN 1598 - 0170 (Print)
    https://jics.or.kr/

A MapReduce-Based Workflow BIG-Log Clustering Technique


Min-Hyuck Jin, Kwanghoon Pio Kim, Journal of Internet Computing and Services, Vol. 20, No. 1, pp. 87-96, Feb. 2019
10.7472/jksii.2019.20.1.87, Full Text:
Keywords: workflow process mining, structured information control nets, workflow process enactment event logs, temporal workcase, temporal worktransference, XES event stream data format, Hadoop MapReduce Framework

Abstract

In this paper, we propose a MapReduce-supported clustering technique for collecting and classifying distributed workflow enactment event logs as a preprocessing tool. Especially, we would call the distributed workflow enactment event logs as Workflow BIG-Logs, because they are satisfied with as well as well-fitted to the 5V properties of BIG-Data like Volume, Velocity, Variety, Veracity and Value. The clustering technique we develop in this paper is intentionally devised for the preprocessing phase of a specific workflow process mining and analysis algorithm based upon the workflow BIG-Logs. In other words, It uses the Map-Reduce framework as a Workflow BIG-Logs processing platform, it supports the IEEE XES standard data format, and it is eventually dedicated for the preprocessing phase of the ρ-Algorithm that is a typical workflow process mining algorithm based on the structured information control nets. More precisely, The Workflow BIG-Logs can be classified into two types: of activity-based clustering patterns and performer-based clustering patterns, and we try to implement an activity-based clustering pattern algorithm based upon the Map-Reduce framework. Finally, we try to verify the proposed clustering technique by carrying out an experimental study on the workflow enactment event log dataset released by the BPI Challenges.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[APA Style]
Jin, M. & Kim, K. (2019). A MapReduce-Based Workflow BIG-Log Clustering Technique. Journal of Internet Computing and Services, 20(1), 87-96. DOI: 10.7472/jksii.2019.20.1.87.

[IEEE Style]
M. Jin and K. P. Kim, "A MapReduce-Based Workflow BIG-Log Clustering Technique," Journal of Internet Computing and Services, vol. 20, no. 1, pp. 87-96, 2019. DOI: 10.7472/jksii.2019.20.1.87.

[ACM Style]
Min-Hyuck Jin and Kwanghoon Pio Kim. 2019. A MapReduce-Based Workflow BIG-Log Clustering Technique. Journal of Internet Computing and Services, 20, 1, (2019), 87-96. DOI: 10.7472/jksii.2019.20.1.87.