Assignment Task
Part A - Disco Analysis
For this part, you will be utilizing an event log, which is a real-life event log available for process mining analysis. Using the Disco software, unless specified otherwise, analyze the complete, unfiltered, original log to address each of the three questions.
a. Compare and contrast two process models (maps) generated using different settings: one with 100% activity and 100% paths, and the other with 50% activity and 25% paths. Explain the significance of not using a model with 0% paths to comprehend process behaviors.
b. Investigate the case variants identified from the log. How many case variants are present overall? Provide details on the top five most frequent case variants and their respective frequencies. Determine the coverage of these five case variants within the log. Explore the variants with low frequencies, particularly those with fewer than 10 cases. Discuss the implications of a large number of case variants on generating a representative process model.
a. Group A: Cases for Car Loans.
b. Group B: Cases for Home Improvements.
Describe any observed differences or similarities between the two process models and their respective groups of cases.