Ana C. Murillo

Additional short research projects on computer vision and machine learning for robotics, mobile phones and other embedded systems can be seen in Supervised Student Thesis Projects

CURRENT PROJECTS

Activity recognition with EVENT cameras

We have developed efficient transformer-based architectures to process event camera data for activity recognition (EventTransformer and EventTransformer+)

Besides, as part of a broader study centered on sleep, this project investigates the suitability of event cameras (project EVENTSLEEP), to analyze in a non-invasive manner specific behaviors that occur while sleeping and lead to sleeping disorders.

Autonomous DRONES: drone shows & cinematography (more details)

We are interested on increasing drones and swarms autonomy to perform more complex tasks, in particular related to cinematography (around the work in CineMPC and CineTransfer, we have proposed different approaches to make drones more autonomous when filming) and to drone show generation (Gen-Swarms, Adapting Deep Generative Models to Swarms of Drones).

Scene understanding in ENDOSCOPIC images

Deep Learning models to learn relevant concepts or features from endoscopic images that can facilitate 3D and Semantic Mapping from them. We have worked on adapting SuperPoint features to endoscopy and developed a novel approach to obtain automated endoscopy recording overviews in real procedures. Large part of this work has been developed under the ENDOMAPPER project, focused on colonoscopy.

We are also developing new methods to improve automated bronchoscopy assessment assistance and navigation assistance tools.

Scene UNDERSTANDING in VIDEOS: people behaviour analysis and activity understanding

We have developed novel deep learning models for different recognition tasks to analyze people behaviours and activities in former project FILOVI, Currently, we are exploring novel VLMs based efficient strategies to improve long video understanding techniques (FALCONeye).

Multi-modal segmentation in Waste Management plants (more details on project SpectralWaste)

Semantic segmentation in a new hyperspectral dataset captured on a realistic waste sorting facility scenario

SCENE UNDERSTANDING from multiple heterogeneous sensors (more details on projects iSUMA and DISCERNERS)

Deep Learning models for recognition tasks in different domains using multi-camera systems and heterogeneous sensors, with a particular focus on data and computational requirements efficiency

PAST PROJECTS

Semantic Segmentation with ConvNets (more details and project SUMA)

Deep Learning models for semantic segmentation in different domains, focusing on lack of dense training data and on the use of multi-modal data.

Learning from Multimodal Human-Robot interaction (more details)

Learning visual models from audio-visual human robot interaction in assistive settings. CHIST-ERA project "Interactive Grounded Language Understanding" (IGLU).

Interactive Image Segmentation (more details)

A novel efficient interaction paradigm that approximates any per-pixel magnitude from a few user strokes by propagating the sparse user input to each pixel of the image. This can be used in many image filters.

Analyzing pictures of groups of people from social media (more details)

Social media provides large amounts of visual data, generating new challenges for computer vision, as well as new opportunities and applications.

Recognition and semantic maps with wearable cameras (more details)

We have built a wearable catadioptric vision system and design algorithms for its use on semantic mapping and navigation assistance systems.

We have built a dataset with several wearable cameras recording simultaneously daily office activities and evaluated several alternatives for activity recognition with it.

Place Recognition - Appearance Based Mapping (more details)

Efficient Place Recognition. Gist based description for panoramas. Hierarchical Visual Localization.

Recognition for Robotics (more details)

Object Recognition and semantic analysis for robotics and semantic mapping tasks. Scene Understanding.

Multi-view Geometry (more details)

Structure From Motion and Localization, Robust Matching, Dominant planes detection and Segmentation.

Report abuse