PRETSA: Event log sanitization for privacy-aware process discovery
Fahrenkrog-Petersen, Stephan A.
;
Aa, Han van der
;
Weidlich, Matthias

DOI:
|
https://doi.org/10.1109/ICPM.2019.00012
|
URL:
|
https://ieeexplore.ieee.org/document/8786060
|
Document Type:
|
Conference or workshop publication
|
Year of publication:
|
2019
|
Book title:
|
2019 International Conference on Process Mining : Aachen, Germany, 24-26 June 2019, proceedings
|
Page range:
|
1-8
|
Conference title:
|
ICPM 2019
|
Location of the conference venue:
|
Aachen, Germany
|
Date of the conference:
|
24.-26.06.2019
|
Place of publication:
|
Piscataway, NJ
|
Publishing house:
|
IEEE
|
ISBN:
|
978-1-72810-920-6 , 978-1-72810-919-0
|
Publication language:
|
English
|
Institution:
|
School of Business Informatics and Mathematics > Methoden der künstlichen Intelligenz (Juniorprofessur) (van der Aa 2020-)
|
Subject:
|
004 Computer science, internet 330 Economics
|
Abstract:
|
Event logs that originate from information systems enable comprehensive analysis of business processes, e.g., by process model discovery. However, logs potentially contain sensitive information about individual employees involved in process execution that are only partially hidden by an obfuscation of the event data. In this paper, we therefore address the risk of privacy-disclosure attacks on event logs with pseudonymized employee information. To this end, we introduce PRETSA, a novel algorithm for event log sanitization that provides privacy guarantees in terms of k-anonymity and t-closeness. It thereby avoids disclosure of employee identities, their membership in the event log, and their characterization based on sensitive attributes, such as performance information. Through step-wise transformations of a prefix-tree representation of an event log, we maintain its high utility for discovery of a performance-annotated process model. Experiments with real-world data demonstrate that sanitization with PRETSA yields event logs of higher utility compared to methods that exploit frequency-based filtering, while providing the same privacy guarantees.
|
 | Dieser Datensatz wurde nicht während einer Tätigkeit an der Universität Mannheim veröffentlicht, dies ist eine Externe Publikation. |
Search Authors in
You have found an error? Please let us know about your desired correction here: E-Mail
Actions (login required)
 |
Show item |
|
|