What is PeopleAnalyst?

PeopleAnalyst is the front door for people-analytics research: 205+ works indexed and profiled, 40+ citation-grade findings extracted, and peer-reviewed behavioral science translated from academic to actionable — the missing manual for the people analytics you always meant to do.

What is people analytics?

People analytics is not a dashboard. It is behavioral science and statistical inference applied to workforce decisions — a discipline with its own methodology, spanning measurement, organizational design, talent, leadership, and analytics craft.

Why does AI in HR need measurement science?

AI is being deployed in high-stakes people decisions — hiring, performance, attrition — without the measurement science to evaluate whether it works or whom it harms. Construct validity, effect sizes, and criterion validity are the vocabulary for asking an AI vendor the right questions.

How is the research made accessible?

The evidence is indexed and searchable: 205+ works, 40+ citation-grade insight cards, and 8 research arcs, so the right finding reaches the right decision at the right time.

What separates good people measurement from assertion?

Good measurement has a method: construct validity, reliability, and effect-size interpretation are not optional — they are what separates evidence from assertion.

library / libee190f163e45fc1d

Sem Principles Practice Kline

In a sentence

A practical and accessible guide for researchers and students on the principles, assumptions, and application of Structural Equation Modeling (SEM) without requiring an extensive quantitative background.

This book serves as an accessible and comprehensive guide to the powerful statistical technique of Structural Equation Modeling (SEM). Written for researchers and students who may not have advanced quantitative training, it breaks down complex concepts into understandable principles using words and figures rather than dense matrix algebra. The book covers core SEM techniques like path analysis and confirmatory factor analysis, as well as more advanced topics such as latent growth models and multiple-sample analyses. With numerous real-world examples from various social sciences, practical advice on using popular SEM software, and a focus on avoiding common pitfalls, this book equips readers with the essential skills to confidently apply SEM in their own research, fostering a more disciplined and thoughtful approach to statistical modeling.

The four lenses

Science
Statistics
Systems
Strategy

The model

This model outlines the implicit framework presented in the book, where rigorous methodological and theoretical practices in model specification, data preparation, and analysis lead to better statistical outcomes (fit, admissible estimates), which in turn produce more valid, replicable, and theoretically meaningful scientific inferences about a phenomenon.

Theoretical Grounding of Specificationdesign lever

The extent to which the specified model is based on established theory, prior research, and substantive knowledge, rather than being arbitrary or purely data-driven. This includes a clear rationale for all paths and their directions.

Data Preparation and Screening Rigordesign lever

The thoroughness of examining data for accuracy, missing values, outliers, distributional assumptions (e.g., normality, linearity), multicollinearity, and other potential problems before conducting the primary analysis.

Psychometric Quality of Indicatorsdesign lever

The established reliability and validity of the observed variables (indicators) used to measure the latent constructs in the model. This includes using multiple indicators per construct to account for measurement error.

Model Identification Correctnesscontextual condition

The degree to which the model is structured such that a unique estimate for every parameter can be mathematically derived. This is a necessary logical condition that must be met prior to attempting statistical estimation.

Analytic Rigorbehavioral pattern

The appropriate application of estimation methods based on data properties, correct and theoretically-guided model respecification strategies, and the systematic consideration of plausible alternative and equivalent models.

Statistical Model Fitoutcome metric

A statistical summary of the discrepancy between the model-implied covariance matrix (and means, if applicable) and the observed sample covariance matrix. It reflects the overall consistency of the model with the data.

Admissibility and Plausibility of Estimatesoutcome metric

The extent to which the model yields parameter estimates that are logically possible (e.g., positive variances, correlations within +/- 1.0) and theoretically sensible (e.g., direction and magnitude of effects are plausible).

Validity of Inferenceoutcome metric

The degree of confidence in the substantive conclusions drawn from the model, including claims about causal relationships, the nature of constructs, and broader theoretical implications. This goes beyond statistical fit to include theoretical coherence.

Replicability of Findingsoutcome metric

The likelihood that the model structure, its fit to the data, and its estimated parameters will be successfully reproduced in an independent sample from the same population, indicating the stability and generalizability of the results.

Theoretical Advancementoutcome metric

The extent to which the research findings, derived from the SEM analysis, contribute to the cumulative knowledge and refinement of theory in a given domain by confirming, challenging, or extending existing theoretical frameworks.

How they connect

theoretical grounding of specification → influences analytic rigor
data preparation and screening rigor → influences admissibility and plausibility of estimates
psychometric quality of indicators → influences admissibility and plausibility of estimates
model identification correctness → influences analytic rigor
analytic rigor → influences statistical model fit
analytic rigor → influences validity of inference
statistical model fit → influences validity of inference
admissibility and plausibility of estimates → influences validity of inference
validity of inference → predicts replicability of findings
validity of inference → predicts theoretical advancement

A candidate measure

Sem Principles Practice Kline — derived measurement candidates

Theoretical Grounding of Specification

Qualitative rating (e.g., 1-5 scale) of the strength of the theoretical rationale presented in a research paper.; Count of hypothesized paths that are explicitly justified by theory or prior findings.; A binary check for the presence of a formal a priori model statement.

self-report suitability: none

Data Preparation and Screening Rigor

A checklist of key data screening steps (e.g., normality check, outlier check, missing data analysis) reported in the methods section.; Qualitative rating of the thoroughness of the reported data screening process.

self-report suitability: none

Psychometric Quality of Indicators

Average reported reliability coefficient (e.g., Cronbach's alpha) for all measures used.; Average number of indicators per latent variable in the model.; Binary check for whether measures are established instruments versus ad-hoc.

self-report suitability: none

Model Identification Correctness

A binary judgment (identified/not identified) based on an audit of the model structure.; Successful passage of empirical tests for identification (e.g., re-running with different start values).

self-report suitability: none

Analytic Rigor

Binary check for whether alternative or equivalent models were discussed.; Qualitative rating of the justification provided for model respecifications.; Binary check for whether the two-step modeling approach (CFA then SR model) was used.

self-report suitability: none

Statistical Model Fit

Value of the model chi-square and its p-value.; Value of the Comparative Fit Index (CFI).; Value of the Root Mean Square Error of Approximation (RMSEA).; Value of the Standardized Root Mean Square Residual (SRMR).

self-report suitability: none

Admissibility and Plausibility of Estimates

Binary check for the presence of Heywood cases (e.g., negative error variances).; Binary check for the presence of correlations > |1.0|.; Qualitative judgment of the theoretical plausibility of the final parameter estimates.

self-report suitability: none

Validity of Inference

Qualitative rating of the strength and appropriateness of the conclusions drawn in a research paper.; Count of causal claims versus associational claims.; Binary check for discussion of alternative theoretical interpretations.

self-report suitability: none

Replicability of Findings

Binary outcome (successful/unsuccessful) of a formal replication study.; Comparison of parameter estimates and fit indices between an original study and a replication.; Number of subsequent studies that find similar results.

self-report suitability: none

Theoretical Advancement

Citation count of a research paper in subsequent literature.; Qualitative assessment of a paper's influence based on review articles and textbooks.; Number of subsequent studies that build directly upon the paper's findings.

self-report suitability: none

Run the assessment

The story

The reader Researchers, graduate students, and applied statisticians in the social and behavioral sciences who want to answer complex research questions involving relationships among multiple variables. They want to move beyond traditional methods like regression and ANOVA to test comprehensive theoretical models that include latent constructs and measurement error.

External problem

Traditional statistical methods are limited; they can't easily test complex theoretical models, handle latent variables, or simultaneously model measurement error and structural relationships.

Internal problem

They feel frustrated and limited by their current statistical toolkit, intimidated by the perceived complexity and mathematical opacity of SEM, and uncertain about how to correctly apply these powerful techniques without making critical errors.

Philosophical problem

It's wrong that researchers should be held back from testing their sophisticated theories simply because the statistical methods seem inaccessible or are presented in overly mathematical ways. Good research requires tools that can match the complexity of the questions being asked.

The plan

Master fundamental statistical concepts such as correlation, regression, and data screening.
Learn the core principles and steps of SEM: specification, identification, estimation, and assessment.
Apply core techniques, starting with Path Analysis, then Confirmatory Factor Analysis, and finally integrated Structural Regression models.
Explore advanced techniques like mean structures, latent growth models, and multiple-sample analysis.
Internalize best practices and learn how to avoid common mistakes in application and interpretation.

Success

The reader can confidently specify, test, and interpret a wide range of structural equation models.
They can critically evaluate SEM research in their field, avoid common analytical and interpretative errors, and publish more sophisticated and robust research that truly tests their theoretical ideas.
Their research 'thinks' the way they do, allowing for a seamless transition from theoretical model to statistical test.

At stake

The reader remains limited by basic statistical techniques, unable to properly test their complex models.
They may misapply SEM, leading to flawed conclusions, rejected manuscripts, and a feeling of being left behind by modern quantitative methods.
They risk fooling themselves with statistically significant but meaningless or incorrect results.

Chapter by chapter

ch01Introduction
This chapter lays the groundwork for understanding Structural Equation Modeling (SEM), outlining its framework, terminologies, and significance in data analysis while mapping the structure of the book.
- SEM provides a nuanced framework to model complex relationships that traditional methods may overlook.
- Establishing clear notation is crucial for effective communication and understanding of SEM principles.
- Computer programs tailored for SEM can enhance your ability to navigate and analyze intricate datasets.
- Given the importance of social factors, incorporating family values into your models can offer deeper insights into behavioral patterns.
ch02Basic Statistical Concepts: I. Correlation and Regression
This chapter provides a comprehensive overview of correlation and regression, emphasizing their significance in understanding relationships between variables and informing statistical analyses.
ch03Basic Statistical Concepts: II. Data Preparation and Screening
Effective data preparation and screening are crucial for ensuring the reliability and validity of statistical analyses, which directly impacts the conclusions drawn from research data.
ch04Core SEM Techniques and Software
This chapter provides a comprehensive overview of core Structural Equation Modeling (SEM) techniques, including path analysis and confirmatory factor analysis, detailing how these methods can solve complex research questions in social science.
ch05p01Introduction to Path Analysis (part 1/3)
This chapter introduces the foundational concepts of path analysis, clarifying the distinctions between correlation and causation, and how various path models can be specified for research purposes.
ch05p02Introduction to Path Analysis (part 2/3)
This chapter explores the intricacies of path analysis (PA), focusing on the method's fundamentals, including variable specification, measurement issues, and the challenges in establishing causal relationships among observed variables.
- Path analysis provides valuable insights into causal relationships, despite challenges in establishing causation based merely on observed correlations.
- Model specification is crucial, as omitting relevant variables can lead to biased estimates of causal effects.
- The assumption of measurement error-free variables for exogenous measures is fundamental yet often unrealistic; researchers must accept the trade-offs involved.
- Thoughtful consideration of both directionality and potential confounding variables is necessary for credible conclusions from path models.
ch05p03Introduction to Path Analysis (part 3/3)
This chapter discusses the principles of path analysis, focusing on model parameters, types of path models, and the challenges of identification for correct model specification.
- A path model must not have more parameters than observations to ensure it can produce unique estimates.
- Properly distinguishing between free, fixed, and constrained parameters is essential for model clarity and effectiveness.
- Nonrecursive models necessitate careful analysis due to the complexity of feedback loops and correlated disturbances.
- Identification is a critical property of models; without it, analytical efforts are likely to fail.
ch06Introduction to Path Analysis
This chapter introduces path analysis as a powerful statistical technique for exploring causal relationships between variables, outlining its requirements and providing foundational knowledge essential for application.
- Path analysis provides a sophisticated approach to exploring causal relationships, grounded in observed variables and supported by rigorous theoretical frameworks.
- Both just-identified and overidentified models have distinct implications for hypothesis testing and data fitting, making understanding their nuances crucial.
- The choice of estimation method significantly influences the quality of analysis, with maximum likelihood estimation being the preferred and most widely applicable approach.
- Sample size and complexity interaction critically dictate the reliability of estimation; aim for high ratios of cases to parameters to ensure statistical stability.
ch07Details of Path Analysis
This chapter explores the intricate methods involved in path analysis, emphasizing how to interpret parameter estimates, assess effects, and evaluate model fit within structural equation modeling.
- Effective path analysis hinges on correct interpretations of parameter estimates to uncover relationships between health determinants.
- Achieving homogeneity among variances is crucial for the success of path analysis in addressing ill-scaled data challenges.
- A robust understanding of indirect and total effects in path analysis enhances insights into variable interactions.
- Employing a diverse array of fit indices offers a well-rounded assessment of model adequacy, underscoring the importance of statistical rigor.
ch08Measurement Models and Confirmatory Factor Analysis
This chapter examines the application of Confirmatory Factor Analysis (CFA) in structural equation modeling to identify measurement models that accurately reflect the relationships among observed indicators and latent variables, emphasizing the need for a robust model specification process that mitigates measurement errors.
- Implementing multiple indicators per construct enhances research validity by reducing measurement error.
- The distinction between unidimensional and multidimensional measurement must be made with theoretical substantiation to ensure accurate model specifications.
- CFA models require careful identification, and truly understanding the nature of relationships among indicators is paramount for establishing empirical soundness.
- The success of measurement models hinges on substantive principles, placing ethical responsibility on researchers to avoid misrepresentation in analyses.
ch09Models with Structural and Measurement Components
This chapter explores structural regression (SR) models, emphasizing their capacity to integrate path analysis and measurement models in assessing both structural and measurement relationships in data.
- Structural regression models combine elements of path analysis and confirmatory factor analysis, allowing for a more comprehensive evaluation of data involving latent variables.
- Valid measurement models are prerequisites for conducting reliable structural analyses; erroneous measurement can obscure true relationships.
- The two-step modeling approach facilitates a systematic validation process, ensuring the integrity of both measurement and structural components.
- Four-step modeling serves as an advanced method for diagnosing areas within SR models that require specification adjustments.
ch10Nonrecursive Structural Models
This chapter explores nonrecursive structural models, emphasizing their complexity due to feedback loops and the nuanced identification challenges they present compared to recursive models.
- Nonrecursive structural models provide a more nuanced understanding of complex interpersonal dynamics, allowing for mutual causal influences.
- Identification issues inherent in nonrecursive models can lead to erroneous interpretations if not properly addressed through systematic evaluation of the order and rank conditions.
- Utilizing supplementary estimation techniques, such as two-stage least squares, can equip researchers to overcome common pitfalls associated with nonrecursive analysis.
- A robust theoretical grounding is necessary when specifying models, as arbitrary changes for identification can compromise the integrity of causal claims.
ch11Mean Structures and Latent Growth Models
This chapter underscores the significance of incorporating mean structures in Structural Equation Modeling (SEM) and introduces latent growth models (LGMs) to analyze longitudinal data effectively.
- Incorporating mean structures into SEM reveals the full breadth of data analysis capabilities and enhances understanding of latent variable relationships.
- Neglecting means can lead to misinterpretations and fundamentally flawed conclusions in behavioral studies.
- Latent growth models offer a robust framework for analyzing change over time and capturing individual trajectories.
- The precision gained from integrating means into statistical models significantly improves the relevance and applicability of research findings.
ch13How to Fool Yourself with SEM
Structural Equation Modeling (SEM) is a powerful analytical tool, but its misuse can lead to serious misinterpretations of data, often stemming from incorrect specifications, data handling, analysis techniques, and interpretations.
- Specification is the most critical phase in the SEM process; errors made here can have cascading negative effects on the entire model.
- Relying solely on statistical measures for model validation is a significant error; substantive theoretical backing is equally crucial.
- The four categories of SEM pitfalls serve as a practical checklist to ensure thoroughness in data analysis.
- Measurement error can severely undermine the validity of results; psychometrically rigorous indicators are essential.
ch14Other Horizons
This chapter concludes the exposition on Structural Equation Modeling (SEM) by introducing advanced topics such as interaction and curvilinear effects, as well as multilevel SEM, while also directing readers towards further resources for deeper exploration.