What is PeopleAnalyst?

PeopleAnalyst is the front door for people-analytics research: 205+ works indexed and profiled, 40+ citation-grade findings extracted, and peer-reviewed behavioral science translated from academic to actionable — the missing manual for the people analytics you always meant to do.

What is people analytics?

People analytics is not a dashboard. It is behavioral science and statistical inference applied to workforce decisions — a discipline with its own methodology, spanning measurement, organizational design, talent, leadership, and analytics craft.

Why does AI in HR need measurement science?

AI is being deployed in high-stakes people decisions — hiring, performance, attrition — without the measurement science to evaluate whether it works or whom it harms. Construct validity, effect sizes, and criterion validity are the vocabulary for asking an AI vendor the right questions.

How is the research made accessible?

The evidence is indexed and searchable: 205+ works, 40+ citation-grade insight cards, and 8 research arcs, so the right finding reaches the right decision at the right time.

What separates good people measurement from assertion?

Good measurement has a method: construct validity, reliability, and effect-size interpretation are not optional — they are what separates evidence from assertion.

library / lib4437afc86b2cacdf

Data Analysis with LLMs

Immanuel Trummer · 2025

In a sentence

A hands-on guide showing developers and data scientists how to use large language models—across text, tables, images, audio, and graphs—to build effective, cost-efficient data analysis pipelines in Python.

Data Analysis with LLMs by Cornell professor Immanuel Trummer is the practical field manual every data practitioner needs to exploit the transformative capabilities of modern language models. Starting from first principles—what a prompt is, how tokenization works, why few-shot examples help—the book walks readers step by step through real Python mini-projects that classify text, extract structured information, cluster documents, translate natural language into SQL and Cypher queries, answer questions about images and videos, transcribe and translate audio, and build voice-driven database interfaces. It then tackles the hard economic problem every production team faces: how to get high-quality results without overpaying. Chapters on model selection, parameter tuning, prompt engineering, and fine-tuning demonstrate concrete cost-quality tradeoffs on a running sentiment-classification scenario. The final section broadens the toolkit to GPT alternatives (Anthropic, Cohere, Google, Hugging Face), the LangChain agent framework, and LlamaIndex for multimodal retrieval—giving readers everything they need to design sophisticated, maintainable AI pipelines. Whether you are a software developer, data scientist, or curious hobbyist, this book turns the magic of LLMs into systematic, replicable engineering practice.

The four lenses

Science
Statistics
Systems
Strategy