peopleanalyst

magazine · Thesis · the economy of work

Every decision about work joins on one field — the job — and almost no one has bothered to standardize it. That's not a gap. It's the most undervalued position in the data economy.

By Mike West

June 16, 2026

The Join Key

There is a field in nearly every dataset about working life that almost no one has bothered to standardize, and it happens to be the one everything else joins on.

It's the job.

Open any system that touches work — the applicant tracker, the HRIS, the payroll run, the org chart, the compensation survey, the labor-market dashboard, the workforce plan — and somewhere in every record sits a job: a title, a code, a level, a family. It is the spine the data hangs on. You compare pay by job. You benchmark headcount by job. You plan hiring, model attrition, audit representation, map careers — by job. When an analyst asks the most ordinary question in the field — are we paying this person fairly? — the entire answer depends on correctly matching one job to another. The job is the join key. It is the primary key of the whole economy of work.

And it is, almost everywhere, a string of free text that two companies fill in differently and no one reconciles. "Senior Software Engineer II" means one thing here and something two levels apart there. "Account Executive" covers a person closing six-figure enterprise deals and a person dialing a call list. The most load-bearing field in the data is the least disciplined. That is not a small gap. It is, I want to argue, the most undervalued position in the data economy — undervalued precisely because the thing sitting on it looks boring.

They say the job is busywork

Ask a hiring manager about the job description and watch the enthusiasm drain from the room. The job description is the thing HR makes you write before you're allowed to post the req. It's the document nobody reads after the candidate signs. It's compliance, it's overhead, it's the part of the work that feels like not the work. In the mental hierarchy of every operator I've met, the job sits near the bottom — a label, a formality, a box.

The market absorbed that judgment and built around it. The last decade's energy in talent technology went to skills — the skills graph, the skills cloud, the skills-based organization — on the premise that the job is a clumsy, dying unit and the skill is the real atom of work. It went to people — the profiles, the networks, the inferred expertise. It went to outcomes — the dashboards of what got done. All of it interesting. All of it built on top of a job field that stayed a string.

Here's the tell. Every one of those systems, to be useful, eventually has to answer for which job? Skills are powerful, but you don't hire a skill, pay a skill, or budget headcount in skills — you hire, pay, and budget roles, and the skills attach to the role. A skill without a job to attach to is a word. The industry chased the attribute and skipped the entity. It mistook the adjective for the noun.

The noun nobody owns

So why is the join key still free text, decades into the age of data? Because building a real one is genuinely hard, and the two camps that tried each solved a different half and stopped.

The governments built the public taxonomies — the occupational classification systems that the labor statistics run on, the ones that catalog the world of work in fewer than a thousand occupational categories.1 They are real, they are open, and they are coarse by design: built to count a national workforce, not to tell one company how its "Staff Product Designer" lines up against another's. They were never meant to carry a pay decision, and they buckle when you ask them to.

The compensation-survey houses built the other half — the proprietary job-leveling frameworks that large employers actually pay against, with their own families, their own levels, their own match logic.2 These are far richer than the public codes. They are also locked: each is its own dialect, mutually unintelligible, licensed by the seat, and engineered so that the structure stays inside the vendor. You can rent a view of the join key. You cannot own one.

Between the coarse-but-open and the rich-but-locked sits the thing nobody built: a canonical job architecture that is both — granular enough to carry a real decision, open enough to compute on, and neutral enough that it can speak every dialect without belonging to any one of them. It didn't get built because the field had quietly agreed the job wasn't worth it. The boring primitive defended itself by looking boring.

What the key unlocks

Here is the part that turns a classification chore into a strategy. Get the job architecture right — define a job as the honest intersection it actually is, a function crossed with a level, synthesized so it maps to every source instead of copying any one of them — and the rest of the economy of work composes onto it.

Pay composes onto it: not as a borrowed median from a survey cut, but as a model — what this function, at this level, is worth, adjusted for industry and company size and stage and country and metro, calibrated against public data rather than copied from it. Skills and competencies compose onto it: the KSA profile of a role, the thing the skills movement wanted, but anchored to the entity decisions are actually made about. Performance composes onto it. Supply and demand compose onto it — hiring rates, attrition, employment levels become benchmarkable reference points once they're keyed to a stable definition of what job. Representation composes onto it: you cannot ask whether a workforce mirrors its labor market until "the job" means the same thing on both sides of the comparison. Careers become paths between jobs — transfers along shared skill, rather than leaps between unrelated strings.

One backbone, and every service in the field is a query against it. That is what people mean, without quite saying it, when they imagine a public, authoritative place to ask what a job is, what it takes, and what it pays — the way you'd look up a company's fundamentals. The reason no such place exists is not that the demand isn't there. It's that the join key it would require was never built, because the people who could have built it didn't think the job was worth the trouble.

The shot they didn't take

I'll say the strategic part plainly, because the analysis earns it. The value in data accrues to whoever controls the keys that everything else joins on — and the most valuable key is the one that ties together the most questions while costing the least to defend. The job is exactly that key. Nearly all of human activity routes through work, and nearly all of economic life routes through the job; an architecture that holds the canonical definition of job, and offers services off it, sits at the confluence of more value than almost any other primitive in the field.

That position was available for an unglamorous reason: everyone looked at job descriptions, decided they were tedious, and looked away. The hilltop with the best view of the whole economy of work went unclaimed because the path up it was labeled boring. This is a recurring shape worth naming — the most valuable join key is usually the one the field has agreed to treat as a label. It's a Chesterton move: the fence everyone wants to tear down is load-bearing, and the chore everyone wants to skip is the asset.

The discipline to claim it is not mysterious. Synthesize the structure rather than copy any source, so it can translate between all of them. Keep the public codes as a translation layer between datasets, not as the architecture itself — a Rosetta stone, not the language. Make pay a model over real dimensions instead of a median frozen in a survey. Carry provenance on every number, so the whole thing can be trusted rather than merely consulted. None of these are exotic. They are simply the work the field declined to do on the primitive it declined to respect.

The unit AI has to reason about

There's a reason this matters more now than it did five years ago. We are handing the decisions of working life to systems that reason over data, and a system can only reason as well as its join key lets it. Ask an AI to compare two roles, price a hire, plan a reorg, or flag a pay inequity, and it inherits the same problem the analyst always had: it has to know that this job and that job are the same job, or it's confidently joining noise to noise. The free-text job field is where a great deal of machine reasoning about work quietly goes wrong. A canonical job architecture isn't infrastructure plumbing off to the side of the AI story. It's the part that decides whether any of the reasoning on top of it means anything.

The job was never the boring field. It was the primary key of the economy of work, sitting in plain sight, wearing the disguise of a chore. The opportunity was never hidden. It was skipped — and a thing that's skipped, unlike a thing that's hidden, is still there for whoever decides to walk up and take it.


This is a cross-portfolio thesis in the People Analyst program — the argument that "job" is the undervalued, undefended join key for the data of working life, and that building its canonical architecture is a position rather than a feature. It's the spine beneath the portfolio's compensation and job-structure work (anycomp, the compensation toolbox) and a sibling in posture to the Measurement Meets AI essays: the answer was sitting in plain methodological sight the whole time. This piece is thesis, not announcement — it makes no product claims; the architecture it describes is the work, not a launch.

Footnotes

  1. The U.S. Standard Occupational Classification (SOC) and the O*NET system built on it catalog the national world of work in fewer than a thousand detailed occupational categories — designed for labor-market statistics and cross-employer counting, not for the granularity a single employer's pay or leveling decision requires. They are public and authoritative, and deliberately coarse.

  2. The major compensation-survey and job-architecture providers (e.g., the leveling frameworks behind the large salary surveys) maintain proprietary job families, levels, and match methodologies that participating employers benchmark against. These are substantially richer than the public codes but mutually incompatible and licensed, by design — the structure remains inside the vendor rather than in the customer's hands.

Was this useful?

Anchored in

← All magazine pieces