This page presents the results and experiments used for the IJCAI 2016 article.
We introduce a framework for knowledge-based sequence mining based on Answer Set Programming (ASP). We begin by modeling the basic task and refine it in the sequel in several ways. First, we show how easily condensed patterns can be extracted by modular extensions of the basic approach. Second, we illustrate how ASP's preference handling capacities can be exploited for mining patterns of interest. In doing so, we want to demonstrate how easy it is to incorporate knowledge into an ASP-based mining process. Since this comes with a trade-off in effectiveness, we provide an empirical study comparing our approach with closely related sequence mining approaches.
Our encodings have been design for the solver from Potassco ASP tools suite. We use clingo 4.5 as a solver and the ASPRIN system to extract preferred patterns (including update for ASPRIN-3).
You'll find below the ASP facts of the datasets we used to benchmark our ASP encoding. Simulated datasets are also available in a format that can be read by CPSM (Constraint Based Sequence Mining)
Simulated datasets used to evaluate computing performances:
Real datasets used to compare mining tasks:
Computing time: comparison of ASP solving wrt CPSM-emb
Number of patterns: compare our different encodings on real datasets