Research

海大資工系AI相關研究計畫 [Link]
臺灣資安科技研究中心 [Link]

Sparse Grid Imputation Using Unpaired Imprecise Auxiliary Data

Sparse grid imputation (SGI) is a challenging problem, as its goal is to infer the values of the entire grid from a limited number of cells with values. Traditionally, the problem is solved using regression methods such as KNN and kriging, whereas in the real world, there is often extra information—usually imprecise—that can aid inference and yield better performance. In the SGI problem, in addition to the limited number of fixed grid cells with precise target domain values, there are contextual data and imprecise observations over the whole grid. To solve this problem, we propose a distribution estimation theory for the whole grid and realize the theory via the composition architecture of the Target-Embedding and the Contextual CycleGAN trained with contextual information and imprecise observations. Contextual CycleGAN is structured as two generator - discriminator pairs and uses different types of contextual loss to guide the training. We consider the real-world problem of fine-grained PM2.5 inference with realistic settings: a few (less than 1%) grid cells with precise PM2.5 data and all grid cells with contextual information concerning weather and imprecise observations from satellites and microsensors. The task is to infer reasonable values for all grid cells. As there is no ground truth for empty cells, out-of-sample mean squared error and Jensen–Shannon divergence measurements are used in the empirical study. The results show that Contextual CycleGAN supports the proposed theory and outperforms the methods used for comparison.

Extreme Event Discovery With Self-Attention for PM2.5 Anomaly Prediction

Fine particulate matter (PM2.5) values of a particular location form a time series, whose prediction is challenging due to the complicated interactions between numerous factors from meteorological measurements, terrain conditions, and industry and human habitation activities, and their predictions have attracted considerable attention from the deep learning community. Although the deep learning approach for PM2.5 prediction generally has an acceptable accuracy, it has difficulty in PM2.5 anomaly prediction, while mis-predictions prevent the authority from issuing proper instructions to reduce the impact on general health. We use extreme value theory (EVT) to formulate the PM2.5 prediction problem with a self-attention-based neural network implementation. EVT-based loss accounts for the rarity of anomalous data, and self-attention captures global information. Experiments demonstrate that the proposed model obtains an improved performance of 478% in F1 score and 286% in Matthews correlation coefficient (MCC) over the fully connected network, and 229% in F1 and 148% in MCC over the typical transformer trained with the traditional loss function.

Using satellite data on remote transportation of air pollutants for PM2.5 prediction in northern Taiwan

Accurate PM2.5 prediction is part of the fight against air pollution that helps governments to manage environmental policy. Satellite Remote sensing aerosol optical depth (AOD) pro- cessed by The Multi-Angle Implementation of Atmospheric Correlation (MAIAC) algorithm allows us to observe the transportation of remote pollutants between regions. The paper proposes a composite neural network model, the Remote Transported Pollutants (RTP) model, for such long-range pollutant transportation that predicts more accurate local PM2.5 concentrations given such satellite data. The proposed RTP model integrates several deep learning components and learns from the heterogeneous features of various domains. We also detected remote transportation pollution events (RTPEs) at two reference sites from the AOD data. Extensive experiments using real-world data show that the proposed RTP model outperforms the base model that does not account for RTPEs by 17%-30%, 23%- 26% and 18%-22% and state-of-the-art models that account for RTPEs by 12%-22%, 12%-14%, and 10%-11% at +4h to +24h, +28h to +48 hours, and +52h to +72h hours respectively.

Composite Neural Network: Theory and Application to PM2.5 Prediction

This work investigates the framework and statistical performance guarantee of the composite neural network, which is composed of a collection of pre-trained and non-instantiated neural network models connected as a rooted directed acyclic graph, for solving complicated applications. A pre-trained neural network model is generally well trained, targeted to approximate a specific function. The advantages of adopting a pre-trained model as a component in composing a complicated neural network are two-fold. One is benefiting from the intelligence and diligence of domain experts, and the other is saving effort in data acquisition as well as computing resources and time for model training. Despite a general belief that a composite neural network may perform better than any a single component, the overall performance characteristics are not clear. In this work, we propose the framework of a composite network, and prove that a composite neural network performs better than any of its pre-trained components with a high probability. In the study, we explore a complicated application—PM2.5 prediction—to support the correctness of the proposed composite network theory. In the empirical evaluations of PM2.5 prediction, the constructed composite neural network models perform better than other machine learning models.

Forwarding Path Discovery with Software Defined Networking

Without STP and LLDP, we propose a new ap- proach to find faster paths between the source and the destination and fully utilize the links between switches under the Software Defined Network paradigm. To find round-trip path with lighter load, we design a mechanism to compose special ARP packets for exploring the routing path and avoid broadcast storm by dropping late or duplicate packets. Meanwhile, the controller can carefully regulate other packets without affecting the behavior of the special packets. The evaluation results with the Clos- like network topology show that our approach can be used to improve the performance of transmission throughput. We believe our method is very suitable, especially, for a dynamic network environment, where links are up and down frequently. Our implementation can be seen as an instance showing the flexibility of SDN.

Detecting Amplification Attacks with Software Defined Networking

Distributed denial of service (DDoS) is an attack that attempts to disrupt network service for various malicious purposes. It makes use of public services as reflectors to amplify the traffic, and thus called distributed reflection denial of service attacks. This type of attack forges source IP address, and makes it hard to filter the problematic packets. With Software Defined Networking (SDN) and machine learning techniques, we implement a system to detect DRDoS packets and block the amplification attacks automatically. DNS and NTP amplifications are two typical attacks of DDoS. By analyzing the traffic features, although our classifier is trained only for the DNS amplification attack, our system can identify and then block both DNS and NTP amplification attacks with great accuracy.

Degree Lower Bounds of Polynomials for Boolean Functions

Restrictions of Nondegenerate Boolean Functions and Degree Lower Bounds over Different Rings

Page updated

Google Sites

Report abuse