RQ1
Experiment Setup
We choose the classical ResNet, as the subject model architecture. In addition, we train the models on the CIFAR-10 dataset until the model
performance converges. we design an experiment to observe the semantic architecture of a deep classifier by controlling three variables.
Model Size: ResNet-14 ResNet-44 ResNet-86 ResNet-110 ResNet-152
Dataset Size: 6000 14000 30000 46000 60000
Task Size: 2-classification 4-classification 6-classification 8-classification 10-classification
With a total of 125 (5*5*5) configurations, we train each configuration(model) (i.e., architecture-dataset-task combination) 10 times. There will be 1250(125*10) models. The hyperparameters' details are as follows:
batch size: 128 learning rate: 0.01 weight_decay: 0.005 epochs: 300 initializer: he_normal
Results
Univariate linear regression analysis:
The x-axis respectively represents the value of three variables and y-axis represents the modularity metrics.
Module distribution:
The x-axis represents the location of modules in the models and the y-axis represents the module size (the number of layers in a module). Vertical red and blue lines reflect boundaries between ResNet stages, where the feature dimension changes.
Code Demo
https://github.com/hnurxn/Deep-Arc/RQ1