What is PeopleAnalyst?

PeopleAnalyst is the front door for people-analytics research: 205+ works indexed and profiled, 40+ citation-grade findings extracted, and peer-reviewed behavioral science translated from academic to actionable — the missing manual for the people analytics you always meant to do.

What is people analytics?

People analytics is not a dashboard. It is behavioral science and statistical inference applied to workforce decisions — a discipline with its own methodology, spanning measurement, organizational design, talent, leadership, and analytics craft.

Why does AI in HR need measurement science?

AI is being deployed in high-stakes people decisions — hiring, performance, attrition — without the measurement science to evaluate whether it works or whom it harms. Construct validity, effect sizes, and criterion validity are the vocabulary for asking an AI vendor the right questions.

How is the research made accessible?

The evidence is indexed and searchable: 205+ works, 40+ citation-grade insight cards, and 8 research arcs, so the right finding reaches the right decision at the right time.

What separates good people measurement from assertion?

Good measurement has a method: construct validity, reliability, and effect-size interpretation are not optional — they are what separates evidence from assertion.

library / libf510f1896c1c631d

Statistical Rethinking Mcelreath

In a sentence

A course that re-trains researchers to approach statistics as a principled process of building, comparing, and critiquing generative models within a Bayesian framework to achieve causal understanding and predictive accuracy.

For researchers uneasy with the traditional statistical cookbook of p-values and canned tests, 'Statistical Rethinking' offers a complete, hands-on course in modern Bayesian data analysis. It reframes statistical modeling as 'golem engineering,' a craft of building custom models from first principles to answer specific scientific questions. Using a code-intensive approach with R and Stan, the book guides readers from the fundamentals of probability as counting possibilities to the construction of sophisticated tools like multilevel models and causal inference with Directed Acyclic Graphs (DAGs). By emphasizing practical implementation, prior predictive simulation, and principled model comparison, it empowers researchers to not only use statistics, but to truly understand, justify, and critique their own analytical work.

The four lenses

Science
Statistics
Systems
Strategy

The model

This model represents the book's central argument: that a set of 'rethinking' practices (Design Levers) improves the quality of a researcher's modeling process (Psychological/Behavioral States), which in turn leads to higher quality scientific outcomes. The model frames the book's teachings as a causal pathway to better scientific inference.

Generative Thinkingdesign lever

The practice of designing statistical models by first articulating a 'data story'—a causal or descriptive narrative of how the data could have been generated. This contrasts with selecting a pre-existing model from a menu of options.

Bayesian Updating Practicedesign lever

The application of Bayesian inference, which uses probability theory to logically update the plausibility of parameter values in light of data. It treats the entire posterior distribution as the estimate, fully propagating uncertainty.

Causal Reasoning Practicedesign lever

The use of formal causal models, such as Directed Acyclic Graphs (DAGs), to make causal assumptions explicit, identify potential confounding, and guide the design and interpretation of statistical models for causal inference.

Principled Model Comparisondesign lever

The practice of comparing and critiquing models based on their estimated out-of-sample predictive accuracy, using tools like information criteria (WAIC, PSIS) and posterior predictive checks, rather than null hypothesis tests or in-sample fit.

Adaptive Regularization Practicedesign lever

The use of model structures, particularly multilevel (hierarchical) models and thoughtfully chosen priors, to regularize parameter estimates, prevent overfitting, and pool information across data clusters.

Model Validity for Causal Inferencepsychological state

The degree to which a statistical model's structure is appropriate for estimating a specific causal effect, as determined by a corresponding causal model. A valid model correctly closes backdoor paths without opening new confounding paths.

Model Generalizability for Predictionpsychological state

The extent to which a model is expected to make accurate predictions on new data from the same generative process, achieved by effectively balancing the underfitting-overfitting trade-off through regularization and appropriate complexity.

Researcher Understanding of Modelpsychological state

The researcher's deep comprehension of their model's assumptions, mechanics, and implications, enabling them to interpret parameters correctly, visualize predictions, and diagnose problems.

Causal Inference Qualityoutcome metric

The accuracy, precision, and robustness of the causal claims derived from the statistical analysis, reflecting a successful isolation of a causal effect from confounding associations.

Predictive Performanceoutcome metric

The measured accuracy of the model's predictions on new, unseen data, typically quantified by a scoring rule like log-probability or mean squared error on a hold-out test set.

Scientific Insightoutcome metric

The degree to which the modeling process generates new knowledge, refines scientific theory, or reveals previously unknown patterns in a phenomenon, beyond mere statistical significance or fit.

How they connect

causal reasoning practice → influences model validity for causal inference
adaptive regularization practice → influences model generalizability for prediction
principled model comparison → influences model generalizability for prediction
generative thinking → influences researcher understanding of model
bayesian updating practice → influences researcher understanding of model
model validity for causal inference → predicts causal inference quality
model generalizability for prediction → predicts predictive performance
researcher understanding of model → influences causal inference quality
researcher understanding of model → influences predictive performance
causal inference quality → influences scientific insight
predictive performance → influences scientific insight

The story

The reader A researcher in the natural or social sciences who has a basic understanding of regression but feels uneasy and unconfident about conventional statistical practices (p-values, a zoo of tests) and wants a more intuitive, unified, and powerful framework for statistical modeling.

External problem

Standard statistical toolboxes are inflexible, confusing, and often ill-suited for the specific and novel research contexts that modern researchers face, making it difficult to analyze complex data correctly.

Internal problem

The researcher feels anxious about their statistical choices, fearing they are using the 'wrong' test, misinterpreting results, and lacking the ability to build the models they truly need to answer their questions.

Philosophical problem

It's just wrong that scientific training provides a confusing menagerie of black-box procedures instead of a unified, principled framework for building and understanding statistical models from the ground up.

The plan

Learn the fundamentals of Bayesian inference as a logical system of counting possibilities.
Master building and interpreting a wide range of models (linear, GLM, multilevel) using an explicit, formula-based language.
Employ formal tools for causal reasoning (DAGs) and model comparison (information criteria) to make principled analytical decisions.

Success

The researcher becomes a confident 'golem engineer,' able to design, build, and critique custom statistical models tailored to their specific research questions.
They can make more robust inferences, perform causal analyses with clarity, and transparently communicate their statistical assumptions and results.
They transform statistical anxiety into statistical wisdom, equipped with a powerful and flexible toolkit for modern scientific research.

At stake

The researcher remains stuck with an inadequate and confusing toolbox of statistical tests, leading to fragile inferences and continued anxiety.
They risk making serious analytical errors by misapplying standard procedures or failing to account for the causal structure of their data.
They will be unable to leverage modern statistical methods to answer their most interesting and complex research questions.

Chapter by chapter

ch03Sampling the Imaginary
This chapter explores how sampling from posterior distributions simplifies the interpretation of Bayesian statistics, illustrating intuitive ways to derive meaningful insights from complex data.
- Bayesian models become more accessible when framed in terms of natural frequencies rather than abstract probabilities.
- Sampling provides a robust framework for summarizing complex datasets, shifting focus from intricate calculus to straightforward counting.
- Point estimates may obscure uncertainty; therefore, presenting credible intervals offers a more honest reflection of findings.
- The efficacy of models improves when parameter uncertainty is propagated into predictive distributions via sampling.
ch04p02Geocentric Models (part 2/8)
This chapter reveals the complexities and nuances involved in modeling relationships between variables, specifically focusing on the challenges and methodologies of polynomial regression and spline models.
ch04p03Geocentric Models (part 3/8)
This chapter delves into the use of indicator and index variables in regression models, particularly in the context of categorical data, effectively illustrating their impact on statistical relationships and predictions.
- Using indicator variables introduces complexities in regression-based model interpretations, which must be navigated carefully.
- Models using index variables allow for efficient representation of multicategorical data, reducing overfitting risks while maintaining interpretative rigor.
- Significance testing of parameter differences goes beyond mere non-zero values; actual contrast calculations are necessary for credible inference.
- The careful selection and representation of categorical predictors can drastically alter model performance and interpretation, emphasizing the need for rigorous statistical practices.

Related in the library