SEMi-Complete by Design

A Monte Carlo Simulation to Assess Measurement Invariance in Moderated Nonlinear Factor Analysis and SEM Trees

Leonie Hagitte

Why Care About Measurement Invariance?

Measurement invariance (MI) is a prerequisite for meaningful comparisons of latent constructs across groups (Cheung and Rensvold 2000; Meredith 1964).

Violations of MI imply that observed group differences may reflect measurement artifacts rather than substantive differences in the underlying constructs (Putnick and Bornstein 2016).

Without MI, conclusions about group differences in means, variances, or relations are not statistically justified (Putnick and Bornstein 2016).

Why Care About Measurement Invariance?

In empirical practice, MI is often assumed rather than systematically evaluated.

Common practice

Reporting quality

Recent re-analyses indicate that only 26% of tested models achieved scalar invariance, while MI failed completely in 58% of cases; in roughly 50% of configural failures, the underlying factor structure differed between groups (Maassen et al. 2025).

The validity of many reported group comparisons therefore remains unclear.

MNLFA (Moderated Nonlinear Factor Analysis)

Moderated Nonlinear Factor Analysis (MNLFA) extends traditional factor models by allowing measurement and latent-variable parameters to vary continuously as functions of observed covariates (Kolbe et al. 2024; Curran et al. 2014).

In this framework, measurement invariance is conceptualized as the moderation of model parameters by an external variable.

Parameters that may vary

  • Item parameters: loadings, intercepts
  • Factor means
  • Factor variances

Moderators

  • Continuous covariates
  • Categorical covariates
  • Examples: age, gender, SES

MNLFA therefore models measurement noninvariance directly, rather than evaluating it only through sequential group-based tests (Kolbe, Jorgensen, and Molenaar 2021; Curran et al. 2014).

Advantages and Limitations of MNLFA

Advantages

  • Handles both categorical and continuous moderators
  • Avoids arbitrary group discretization
  • Provides fine-grained modeling of parameter variation
  • Is suited for nonlinear relationships

Limitations

  • High model complexity
  • Strong distributional and functional-form assumptions
  • Requires large sample sizes
  • Requires careful model specification

MNLFA is most appropriate when theory suggests systematic moderation of measurement parameters.

SEM Trees

Structural Equation Model (SEM) Trees combine confirmatory structural equation modeling with recursive partitioning to detect parameter heterogeneity across subgroups (Brandmaier et al. 2013, 2016; Arnold, Voelkle, and Brandmaier 2021).

SEM Trees identify subgroups in which model parameters differ by recursively splitting the sample based on covariates.

Core mechanism

  • Score-based tests for parameter instability
  • Recursive binary partitioning
  • Tree-structured subgroup identification

What is detected

  • Heterogeneity in measurement parameters
  • Heterogeneity in structural parameters
  • Data-driven detection of noninvariance

SEM Trees are particularly useful when relevant grouping variables are unknown, continuous, or high-dimensional.

Advantages and Limitations of SEM Trees

Advantages

  • Flexible, exploratory detection of noninvariance
  • Can capture complex interaction structures
  • Transparent and interpretable subgroup definitions
  • Does not require pre-specified grouping variables

Limitations

  • Risk of overfitting without pruning or cross-validation
  • Results depend on splitting criteria and stopping rules
  • Less suitable for strictly confirmatory hypothesis testing
  • Functional form of moderation is not explicitly parameterized

SEM Trees trade parametric efficiency for flexibility and are most informative in exploratory or misspecified settings.

From Methods to Comparison

MNLFA and SEM Trees represent fundamentally different approaches to detecting measurement noninvariance.

MNLFA

  • Parametric
  • Requires specification of functional form
  • Efficient under correct model specification

SEM Trees

  • Nonparametric
  • No functional-form assumptions
  • Flexible under misspecification

The relative performance of both approaches depends on the alignment between the data-generating process and the assumptions of the analysis model.

Simulation Framework

A Monte Carlo simulation is conducted to compare MNLFA and SEM Trees under varying forms of measurement noninvariance.

Design features

  • Parametric data generation
  • Multiple population models
  • Single-factor and two-factor CFA structures

Indicator structure

  • Single-factor model: four manifest indicators
  • Two-factor model: one factor with four indicators, one factor with three indicators

Single-factor model

\[ \begin{aligned} \mathbf{x} &= \boldsymbol{\nu}(Z) + \boldsymbol{\Lambda}_x(Z)\,\eta + \boldsymbol{\varepsilon}, \\ \boldsymbol{\varepsilon} &\sim \mathcal{N}(\mathbf{0}, \boldsymbol{\Theta}_\varepsilon) \end{aligned} \]

Two-factor model

\[ \mathbf{x} = \boldsymbol{\nu}_x(Z) + \boldsymbol{\Lambda}_x(Z)\,\eta_1 + \boldsymbol{\varepsilon}(Z) \]

\[ \mathbf{y} = \boldsymbol{\nu}_y(Z) + \boldsymbol{\Lambda}_y(Z)\,\eta_2 + \boldsymbol{\delta}(Z) \]

Simulation: Moderator Functions

Let \(M \sim \mathcal{U}(-1,1)\) denote a bounded continuous covariate.

Moderation is introduced via transformations of \(M\):

Linear

\[ h_1(M) = M \]

Quadratic

\[ h_2(M) = 2M^2 - 1 \]

Sigmoid

\[ h_3(M) = a + \frac{b-a}{1 + \exp\!\bigl(-k(M-c)\bigr)} \]

Noise

No systematic relationship with model parameters.

Each transformed variable enters the model as a separate moderator, allowing controlled variation in the functional form of parameter moderation.

Simulation

Analytical Model Study 1

Highlighted components indicate parameters subject to moderation. Analysis is conducted sequentially.

Study 1: Design Factors

Study 1 examines the impact of functional-form misspecification under controlled data-generating conditions without structural misspecification.

Data-generating process

  • Moderator form: linear, sigmoid, quadratic, or noise
  • Population models: 8 distinct model conditions
  • Loading moderation: \(\Delta_\lambda \in \{-0.3, -0.2, 0.2, 0.3\}\)
  • Intercept moderation: \(\Delta_\nu \in \{-1, -0.5, 0.5, 1\}\)
  • Indicator reliability: \(.60, .70, .80, .95\)

Analysis and sampling

  • MNLFA: linear moderation
  • SEMTREE: recursive partitioning
  • Sample size: \(N \in \{300, 500, 700, 1000\}\)

Functional-form misspecification arises when nonlinear data-generating mechanisms are analyzed under linear moderation assumptions in MNLFA, while SEM Trees impose no parametric form.

Trial Run: Data-Generating Model

The pilot isolates a simple single-factor setting in order to assess initial type I error and power patterns before scaling the full simulation design.

Measurement model

Single-factor CFA with one latent factor \(\eta\) and four indicators \(y_1\)\(y_4\)

Moderation structure

  • \(M_1\): primary moderator
  • \(M_2\): secondary moderator
  • \(M_0\): noise covariate

Functional forms

  • Linear: \(h(M)=M\)
  • Quadratic: \(h(M)=2M^2-1\)
  • Noise: null condition

Baseline parameters

  • Factor loadings: \(\lambda = 0.70\)
  • Indicator reliability: \(0.60,\;0.95\)

Population conditions

  • Null condition
  • Noninvariance condition:
    • moderated loadings: \(x_1, x_2\)
    • moderated intercepts: \(x_1, x_2\)

Trial-run settings

  • \(\Delta_\lambda \in \{-0.3, 0.3\}\)
  • \(\Delta_\nu \in \{-1, 1\}\)
  • Sample size: \(N \in \{300,\;1000\}\)
  • Replications: pilot run (e.g., 5–15 seeds per condition)

Trial Run: Analytical Models

SEM Trees (nonparametric)

Model - Single-factor CFA (RAM specification)

Predictors

  • \(m_1\), \(m_2\), \(m_0\)

Estimation procedure

  • Score-based recursive partitioning
  • Bonferroni-corrected split tests
  • Pre-pruning:
    • \(\alpha = .05\)
    • max depth = 3
    • minimum node size = 50

Evaluation

  • Global tree test
  • Root split detection
  • Type I error (splits under null)
  • Power (splits under true moderation)

MNLFA (parametric)

Model - Moderation via definition variables (OpenMx) - Loadings: \(\Lambda(Z)\)
- Intercepts: \(\nu(Z)\)

Estimation procedure

  • Sequential invariance testing:
    1. Configural model
    2. Metric model
    3. Scalar model

Evaluation

  • Likelihood-ratio tests (LRT)
  • \(\Delta\)CFI, \(\Delta\)RMSEA

Key assumption

  • Moderation is linear in the analysis model

Preliminary Results

Overall Results

Level Method Type Value
Metric MNLFA Power 0.619
Metric MNLFA Type I 0.138
Metric SEMTREE Power 0.611
Metric SEMTREE Type I 0.075
Scalar MNLFA Power 0.459
Scalar MNLFA Type I 0.150
Scalar SEMTREE Power 0.683
Scalar SEMTREE Type I 0.050

MNLFA shows elevated Type I error, whereas SEM Trees provide better error control and stronger scalar-level power under these pilot conditions.

Preliminary Results

Metric Level MI Testing

Scalar Level MI Testing

At the metric stage, MNLFA Type I error decreases, reaching 0.050 only when both sample size and reliability are high, whereas SEM Trees remain comparatively stable. At the scalar stage, SEM Trees retain high power under quadratic moderation, while MNLFA power drops markedly.

Questions & Concerns


  • How do I ensure that the comparison is fair, the best?
  • What Prepruning to use for enhancing power of the trees?
  • When do tree splits count as correct?
  • What alternative can i use other than significance testing for model fit evaluation?
  • Did I forget something in the simulation set-up?

Preliminary Results

Type I Error and Power Metric Stage

N Reliability Moderator MNLFA Type I MNLFA Power SEMTREE Type I SEMTREE Power
300 0.60 linear 0.200 1.000 0.050 0.988
300 0.60 noise 0.200 0.200 0.050 0.050
300 0.60 quadratic 0.200 0.740 0.050 0.575
300 0.95 linear 0.150 1.000 0.050 1.000
300 0.95 noise 0.150 0.150 0.050 0.050
300 0.95 quadratic 0.150 0.755 0.050 0.750
1000 0.60 linear 0.150 1.000 0.100 1.000
1000 0.60 noise 0.150 0.150 0.100 0.100
1000 0.60 quadratic 0.150 0.862 0.100 0.725
1000 0.95 linear 0.050 1.000 0.100 1.000
1000 0.95 noise 0.050 0.050 0.100 0.100
1000 0.95 quadratic 0.050 0.792 0.100 1.000

Preliminary Results

Type I Error and Power Scalar Stage

N Reliability Moderator MNLFA Type I MNLFA Power SEMTREE Type I SEMTREE Power
300 0.60 linear 0.100 1.000 0.100 1.000
300 0.60 noise 0.100 0.100 0.100 0.100
300 0.60 quadratic 0.100 0.273 0.100 1.000
300 0.95 linear 0.100 1.000 0.100 1.000
300 0.95 noise 0.100 0.100 0.100 0.100
300 0.95 quadratic 0.100 0.245 0.100 1.000
1000 0.60 linear 0.200 1.000 0.000 1.000
1000 0.60 noise 0.200 0.200 0.000 0.000
1000 0.60 quadratic 0.200 0.175 0.000 1.000
1000 0.95 linear 0.200 1.000 0.000 1.000
1000 0.95 noise 0.200 0.200 0.000 0.000
1000 0.95 quadratic 0.200 0.417 0.000 1.000

Preliminary Interpretation

MNLFA

  • High power for detecting metric noninvariance
  • Elevated Type I error at both levels
  • Reduced power at the scalar level
  • Sensitive to functional-form misspecification
    • Linear analysis vs. nonlinear data-generating processes
  • Sequential testing may propagate misspecification

SEM Trees

  • More stable Type I error control across conditions
  • Competitive power, particularly for scalar noninvariance
  • Performance depends on:
    • Sample size
    • Reliability
    • Pruning / stopping rules
  • May be conservative in low-signal settings

MNLFA is efficient under correct specification but sensitive to misspecification, whereas SEM Trees provide more robust error control at the cost of potential conservatism.

References

Arnold, M., M. C. Voelkle, and A. M. Brandmaier. 2021. “Score-Guided Structural Equation Model Trees.” Frontiers in Psychology 11: 3913. https://doi.org/10.3389/fpsyg.2020.564403.
Brandmaier, A. M., T. von Oertzen, J. J. McArdle, and U. Lindenberger. 2013. “Structural Equation Model Trees.” Psychological Methods 18 (1): 71–86. https://doi.org/10.1037/a0030001.
Brandmaier, A. M., J. J. Prindle, J. J. McArdle, and U. Lindenberger. 2016. “Theory-Guided Exploration with Structural Equation Model Forests.” Psychological Methods 21 (4): 566–82. https://doi.org/10.1037/met0000090.
Cheung, Gordon W., and Roger B. Rensvold. 2000. “Assessing Extreme and Acquiescence Response Sets in Cross-Cultural Research Using Structural Equations Modeling.” Journal of Cross-Cultural Psychology 31 (2): 187–212. https://doi.org/10.1177/0022022100031002003.
Curran, Patrick J., James S. McGinley, Daniel J. Bauer, Andrea M. Hussong, Alison Burns, Laurie Chassin, Kenneth Sher, and Robert Zucker. 2014. “A Moderated Nonlinear Factor Model for the Development of Commensurate Measures in Integrative Data Analysis.” Multivariate Behavioral Research 49 (3): 214–31. https://doi.org/10.1080/00273171.2014.889594.
Kolbe, Laura, Terrence D. Jorgensen, and Dylan Molenaar. 2021. “The Impact of Unmodeled Heteroskedasticity on Assessing Measurement Invariance in Single-Group Models.” Structural Equation Modeling: A Multidisciplinary Journal 28 (1): 82–98. https://doi.org/10.1080/10705511.2020.1766357.
Kolbe, Laura, Dylan Molenaar, Suzanne Jak, and Terrence D. Jorgensen. 2024. Assessing Measurement Invariance with Moderated Nonlinear Factor Analysis Using the R Package OpenMx.” Psychological Methods 29 (2): 388–406. https://doi.org/10.1037/met0000501.
Maassen, Esther, E. Damiano D’Urso, Marcel A. L. M. van Assen, Michèle B. Nuijten, Kim De Roover, and Jelte M. Wicherts. 2025. The Dire Disregard of Measurement Invariance Testing in Psychological Science.” Psychological Methods 30 (5): 966–79. https://doi.org/10.1037/met0000624.
Meredith, William. 1964. “Notes on Factorial Invariance.” Psychometrika 29 (2): 177–85. https://doi.org/10.1007/BF02289699.
Putnick, Diane L., and Marc H. Bornstein. 2016. “Measurement Invariance Conventions and Reporting: The State of the Art and Future Directions for Psychological Research.” Developmental Review 41: 71–90. https://doi.org/https://doi.org/10.1016/j.dr.2016.06.004.
Rohrer, Julia M., and Borysław Paulewicz. 2025. “Rethinking Measurement Invariance Causally.” Current Research in Ecological and Social Psychology 9: 100241. https://doi.org/10.1016/j.cresp.2025.100241.