While many different types of distributions exist (e.g., normal, binomial, Poisson), working with SEM generally only needs to distinguish normal from non-normal distributions.
Analysis using AMOS is a parametric statistical method. Different from PLS-SEM, the maximum likelihood (ML)-based CB-SEM requires the data to be normally distributed.
Refers to the shape of data distribution for individual variable.
If variation from normal distribution is large, all resulting statistical tests are invalid, because F & t-statistics assume normality (Hair et al., 2010).
Normality can have serious effects in small samples (<50), but the impact effectively diminishes when sample sizes > 200.
Histogram: Compare the observed data values with a distribution approximating normal distribution.
Normal Probability Plot: Compare the cumulative distribution of actual data values with the cumulative distribution of a normal distribution.
Skewness and Kurtosis Statistics.
Shapiro-Wilks (sample < 2000).
Kolmogorov-Smirnov (sample > 2000).
The degree of symmetry in the variable distribution.
Threshold:
-2 ≤ skewness ≤ 2 (Curran et al., 1996; West et al., 1995; Gliselli et al., 1981).
-1 ≤ skewness ≤ 1 (Byrne, 2016; Awang, 2015)
Critical Ratio (CR) of skewness should be > 3 (Kline, 2015, Latan et al., 2020).
Perfectly symmetrical distribution
The degree peakedness/flatness in the variable distribution.
Threshold:
-7 ≤ Kurtosis ≤ 7 (Curran et al., 1996; West et al., 1995).
-3 ≤ Kurtosis ≤ 3 (Awang, 2015)
CR of kurtosis should be <10 (Kline, 2015, Latan et al., 2020).
Normal distribution
Mesokurtic Distribution
High degree of peakness
Leptokurtic Distribution
Low degree of peakness
Platykurtic distribution
The multivariate normal distribution is a generalization of the one-dimensional (univariate) normal distribution to higher dimensions.
To test for multivariate normality, please click here.
The expected Mardia’s skewness is 0 for a multivariate normal distribution and higher values indicate a more severe departure from normality.
According to Bentler (2005) and Byrne (2016), the critical ratio value of multivariate kurtosis should be less than 5.0 to indicate a multivariate normal distribution.
The normality can be assessed by assessing the measure of skewness for every item. The absolute value of skewness 1.0 or lower indicates the data is normally distributed. However, SEM using the Maximum Likelihood Estimator (MLE) like AMOS is fairly robust to skewness greater than 1.0 in absolute value if the sample size is large. Meaning, the researcher could proceed into further analysis (SEM) since the estimator used is MLE. Normally, the sample size greater than 200 is considered large enough in MLE even though the data distribution is slightly non-normal.
Another method for normality assessment is by looking at the multivariate kurtosis statistic. However, SEM using MLE is also robust to kurtotic violations of multivariate normality as long the sample size is large.
Check and remove outlier cases.
Remove non-normal item from the model.
Bootstrapping (i.e., re-sampling process in the existing data-set with replacement).
Distinctly different observation from the others.
Examines distribution of observations for each variable and selects as outliers those cases falling at the outer ranges (high or low) of the distribution.
Relates individual independent variable with individual dependent variable.
Evaluates the position of each observation compared with the center of all observations on a set of variable.
To test for multivariate outliers, Hair et al. (2010) and Byrne (2010) suggested to identify the extreme score on two or more constructs by using Mahalanobis distance (Mahalanobis D2). It evaluates the position of a particular case from the centroid of the remaining cases. Centroid is defined as the point created by the means of all the variables (Tabachnick & Fidell, 2007).
Based on a rule of thumb, the maximum Mahalanobis distance should not exceed the critical chi-square value, given the number of predictors as degree of freedom. Otherwise, the data may contain multivariate outliers (Hair, Tatham, Anderson, & Black, 1998)
Awang, Z. (2015). SEM Made Simple: A Gentle Approach to Learning Structural Equation Modeling. MPWS Rich Publication.
Curran, P. J., West, S. G., & Finch, J. F. (1996). The robustness of test statistics to nonnormality and specification error in confirmatory factor analysis. Psychological Methods, 1(1), 16-29. https://doi.org/10.1037/1082-989X.1.1.16
Kline, R. B. (2015). Principles and practice of structural equation modeling. Guilford publications.
Latan, H., Jabbour, C. J. C., de Sousa Jabbour, A. B. L., de Camargo Fiorini, P., & Foropon, C. (2020). Innovative efforts of ISO 9001-certified manufacturing firms: Evidence of links between determinants of innovation, continuous innovation and firm performance. International Journal of Production Economics, 223, 107526.
West, S. G., Finch, J. F., & Curran, P. J. (1995). Structural equation models with nonnormal variables: Problems and remedies. In R. H. Hoyle (Ed.), Structural equation modeling: Concepts, issues, and applications (pp. 56-75). Sage Publications, Inc.