library / lib45eb9ee01b943a9e
Statistics_ A Very Short Introduction (Very Short Introductions)
In a sentence
A concise tour of modern statistics that reframes the discipline as the exciting technology of extracting meaning and understanding from data rather than tedious arithmetic.
Statistics: A Very Short Introduction dismantles the dusty Victorian image of statistics and replaces it with a vivid portrait of a vibrant, computer-powered discipline that underpins medicine, government, commerce, science, and everyday life. David J. Hand takes a bird's-eye view of the whole field — from simple summaries of data, through the collection of good data, probability, estimation and inference, statistical models and methods, to the transformative role of computing — showing how each idea connects to the others as part of an integrated whole. Rather than teaching mechanical procedures, the book conveys statistical philosophy, the importance of data quality, the meaning of uncertainty, and the power of modern tools to reveal truths invisible to the naked eye. Anyone wanting to understand why statistics matters, how it really works, and why no educated citizen can afford to ignore it will find this an illuminating and surprisingly thrilling introduction.
The four lenses
- Science
- Statistics
- Systems
- Strategy
Tags
The model
A causal framework in which the design choices of data collection and modeling, mediated by data quality and the rigor of probabilistic/inferential reasoning, produce valid understanding and useful decisions while guarding against misleading conclusions.
Data Collection Designdesign lever
The deliberate design of how data are gathered, including experimental design, randomization, balance, and survey sampling strategies intended to maximize information for given cost and reduce bias.
Model Complexitydesign lever
The number of parameters and structural elaborateness of a statistical model chosen to represent the phenomenon, balancing between underfitting simplicity and overfitting elaboration per Occam's razor.
Data Qualitycontextual condition
The completeness, correctness, and representativeness of the raw data, encompassing absence of missing values, measurement errors, selection bias, and definitional ambiguity that would otherwise distort conclusions.
Probabilistic and Inferential Reasoningpsychological state
The correct application of the laws of probability and inferential principles, including conditional probability, independence, Bayes's theorem, and avoidance of fallacies, used to quantify and reason about uncertainty in data.
Estimation and Inference Accuracybehavioral pattern
The degree to which estimates and inferences drawn from samples closely reflect true population values, captured by properties such as bias, mean squared error, and confidence interval coverage.
Sample Sizecontextual condition
The number of observations collected, which through the law of large numbers and the Central Limit Theorem governs how precisely population quantities can be estimated and how confident conclusions can be.
Valid Understanding and Sound Conclusionsoutcome metric
The outcome in which analysis yields genuine insight into the phenomenon studied, supporting accurate prediction, effective decisions, and trustworthy scientific or commercial conclusions.
Misleading or Mistaken Conclusionsoutcome metric
The adverse outcome in which analysis produces false, biased, or overfitted conclusions due to poor data, flawed reasoning, or inappropriate models, eroding trust and producing harmful decisions.
How they connect
- data collection design → predicts data quality
- data quality → predicts estimation accuracy
- probabilistic reasoning → predicts estimation accuracy
- sample size → moderates estimation accuracy
- estimation accuracy → predicts valid understanding
- model complexity → influences misleading conclusions
- model complexity → influences valid understanding
- data quality − predicts misleading conclusions
- probabilistic reasoning − predicts misleading conclusions
A candidate measure
Statistics_ A Very Short Introduction (Very Short Introductions) — derived measurement candidates
Data Collection Design
presence of randomization; sampling method type; design rigor rating
self-report suitability: low
Model Complexity
parameter count; degrees of freedom; structural complexity index
self-report suitability: none
Data Quality
missingness rate; error/outlier count; representativeness index; unit consistency check
self-report suitability: low
Probabilistic and Inferential Reasoning
fallacy-avoidance rate; correctness of probabilistic derivations; expert review score
self-report suitability: medium
Estimation and Inference Accuracy
bias; mean squared error; confidence interval coverage; out-of-sample error
self-report suitability: none
Sample Size
record count (n)
self-report suitability: none
Valid Understanding and Sound Conclusions
out-of-sample accuracy; replication success rate; decision outcome quality
self-report suitability: low
Misleading or Mistaken Conclusions
replication failure rate; overfitting gap; bias magnitude
self-report suitability: low
The story
The reader A curious, educated non-specialist who wants to understand what statistics really is and how it shapes the modern world.
External problem
Statistics seems opaque, intimidating, and easy to misuse, making it hard to interpret data-driven claims.
Internal problem
They feel anxious, mistrustful, or bored by numbers and fear being misled by statistics.
Philosophical problem
It is wrong for educated citizens to be ignorant of the discipline that underpins science, medicine, government, and daily life.
The plan
- Let go of the old image of statistics as tedious arithmetic.
- Learn to summarize and describe data with simple statistics.
- Understand how to collect and judge the quality of data.
- Grasp the language and laws of probability.
- See how estimation, inference, and testing draw conclusions from data.
- Appreciate how models, methods, and computing tie it all together.
Success
- The reader sees statistics as an exciting tool of discovery, reads data-driven claims critically, and understands uncertainty and inference.
At stake
- The reader remains mystified by data, vulnerable to misleading statistics, and unable to participate fully as an informed citizen.
Related in the library
- Introduction to Survey Sampling (Quantitative Applications in the Social Sciences)shared: Statistics
- The Art of Statisticsshared: Statistics
- Data Mining for Business Analytics: Concepts, Techniques, and Applications
- Data Science from Scratch: First Principles with Python
- Data Warehouse and Data Mining
- Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow
Related in the literature
The measurement literature behind this signal — sourced, so you can defend it.
“McHale, Ian 104 McKendrick, Anderson 90 mean value 52–4, 55, 57–8, 60, 67, 68, 69, 72, 73, 79, 81, 92, 93, 95, 102– 4, 107, 111–2, 115–6, 120 misprints 37 misunderstandings 61–2 Monte Carlo simulation 82–5, 104–5 Morse code 85–6 Mosteller, Frederick 5 Multiplication Law, 18–20,…”
— Probability a Very Short Introductionmatch 65%
“67 SOCIAL MEDIA Very Short Introduction Join our community www.oup.com/vsi • Join us online at the official Very Short Introductions Facebook page. • Access the thoughts and musings of our authors with our online blog . • Sign up for our monthly e-newsletter to receive…”
— Big Data a Very Short Introductionmatch 61%
“Organizations: A Very Short Introduction VERY SHORT INTRODUCTIONS are for anyone wanting a stimulating and accessible way in to a new subject. They are written by experts, and have been published in more than 25 languages worldwide. The series began in 1995, and now represents a…”
— Organizations a Very Short Introductionmatch 60%
Resources: Probability a Very Short Introduction · Big Data a Very Short Introduction · Organizations a Very Short Introduction