Data Mining &
Information Retrieval Lab.

Research Topics

a non exhaustive list ...

Efficient and Effective Learning to Rank

Web search engines, and other ranking systems, use very complex models in order to estimate the relevance of a document w.r.t. a given user query. These models are made of thousands regression trees, and their evaluation is computationally expensive. We aim to develop new Machine Learning (ML) algorithms for the construction of high-quality models that are also efficient at exploitation time.

Major Publications:

Adversarial Machine Learning

Machine Learning (ML) is increasingly used in several applications and different contexts. When ML is leveraged to ensure system security, such as in spam filtering and intrusion detection, everybody acknowledges the need of training ML models resilient to adversarial manipulations. To date, research on adversarial ML has mostly focused on deep neural networks. Despite their effectiveness in so called non-perceptual scenarios,  decision  tree  ensembles have received only limited attention by the security and machine learning communities. We aim at filling this gap by investigating novel algorithms for robust ensemble learning as well as novel solutions for the evaluation of models' robustness.

Major Publications:

Explainable AI

EXplainable AI (XAI) research aims at answering the ineludible need for AI systems of being trustworthy, fair, and understandable. Among the models that are most effective, we limit our interest to forests of decision trees such as Gradient Boosted Decision Trees (GBDTs). These are very accurate in several application scenarios, but their large size (up to thousands of decision trees) makes them a black box that is impossible to be interpreted by a human. We aim at building models that explainable and fair in classification and ranking.

Major Publications: