The 5th International Workshop on Big Data & AI Tools, Models, and Use Cases for Innovative Scientific Discovery (BTSD) 2024

Workshop Date/Time:  December 15, 2024, Washington DC, USA (Full Day Workshop)
Conference Date: December 15-18, 2024, Washington DC, USA

Call for Papers

Program Chairs

Introduction to Workshop

Big data, machine learning, artificial intelligence, and data science technologies have paved the way for numerous success stories across various fields. They offer innovative methods to integrate, reuse, and analyze extensive data volumes. These achievements have encouraged scientists in disciplines such as physics, chemistry, materials science, and medicine to investigate how these tools can enhance scientific research.

However, realizing the potential benefits comes with its set of challenges. Many of the existing software tools and systems were not designed with scientific research or the unique needs of scientists in mind. Moreover, scientists who lack programming or computer science expertise may find these tools difficult to use. Conversely, computer scientists might face hurdles in grasping domain-specific issues without adequate background knowledge.

This workshop is designed to bridge the gap between domain scientists and computer scientists. It aims to explore avenues for creating and utilizing tools, systems, and methodologies to advance scientific discovery. Participants will exchange success stories, share insights from lessons learned, and tackle the challenges that need to be addressed to foster fruitful collaborations across different fields.


The workshop will focus on the following questions:


Research Topics Included in the Workshop:

Big data tools, systems, and methods are related to, but not limited to

which facilitate innovation and discovery in scientific domains such as:

Biomedical science, and more.


Tutorials


We are organizing a demo and tutorial session for this year's workshop to broaden its reach and attract more researchers. This session will offer attendees the opportunity to showcase their work, exchange ideas, and receive hands-on training from field experts. We welcome tutorial submissions in a short-paper format. Tutorials can be either 30 minutes or 1 hour long. Topics will include, but are not limited to, scientific tools and software, AI/ML applications in science, and data processing and management for large-scale scientific data.

Program Committee Members

Paper Submission

Please submit a short  paper (minimum 4 page, up to 6 page IEEE 2-column format) or full paper (minimum 8 page, up to 10 page IEEE 2-column format) through the online submission system. Submission is single-blind review system.

https://wi-lab.com/cyberchair/2024/bigdata24/scripts/submit.php?subarea=S23&undisplay_detail=1&wh=/cyberchair/2024/bigdata24/scripts/ws_submit.php


Papers should be formatted to IEEE Computer Society Proceedings Manuscript Formatting Guidelines (see link to "formatting instructions" below). 

Formatting Instructions

8.5" x 11" (DOC, PDF

LaTex Formatting Macros

Important Dates 

Nov 8, 2024: Notification of paper acceptance to authors
(delayed from Nov 4 due to submission deadline extension & review delay, we apologize for any inconvenience;)

Accepted Papers

A Generalized Outage Prediction Model for Various Types of Extreme Climate Events in Texas

A Decision Support System to Compile Environmental Mitigations from Hydropower Licensing Documents

A Deep Learning Approach to Maximizing Electrostatic Sieve Efficiency in Regolith Beneficiation

A framework for compressing unstructured scientific data via serialization

AAD-LLM: Adaptive Anomaly Detection Using Large Language Models

Discovering Propagating Signals in High-Content Multivariate Time Series via Spatio-Temporal Subsequence Clustering

DISTRI: Development and Integration of Simulation Tools for Resilient Infrastructure

Exploration of TPU Architectures for the Optimized Transformer in Drainage Crossing Detection

LOCOS: A cosine based local gene expression pattern finding algorithm on time-series data

Model and Data Management for Machine Learning (M2ML): Integrating Instruments, Edge and HPC for Accelerated Machine Learning

Modeling Lunar Surface Charging Using Physics-Informed Neural Networks

Multivariate Data Augmentation for Predictive Maintenance using Diffusion

Multivariate Time Series Clustering for Environmental State Characterization of Ground-Based Gravitational-Wave Detectors

Privacy Preserving Federated Learning for Advanced Scientific Ecosystems

SLWM: A Library for Implementing Complex Training Workflows for surrogates of MPC’s

Toward Smart Scheduling in Tapis

Tuning the interpolation basis in a multigrid decomposition for local error control

Presentation Preparation

Registration

Workshop Primary Contact 

Organizers’ Background