methodology note · v1.0 · published 2026-07-03

How the Scoreboard works.

This note is public. It travels with every Scoreboard report and every issue of the State of Mid-Market Industrial AI index. If we change the method, we change the version number and say what moved.

Why this instrument exists

Most AI assessments are sales documents. A vendor or a consultancy scores your "readiness," finds you almost ready, and — conveniently — sells the thing that closes the gap. The diagnosis is written by the party that profits from a bigger treatment. That structure, not the technology, is why the failure numbers look the way they do.

The numbers are worth sitting with:

Every one of those failures was, at some point, a proposal on an executive's desk that nobody was equipped to interrogate. The Scoreboard exists to make that interrogation routine — twenty scored questions an operator can answer in a few minutes, scored against the documented ways these projects actually die. It is free, it stays free, and it is not a funnel into an implementation contract, because we don't sell implementation. That last part is in writing; see the charter section below.

I spent my career in audited, expensive-to-get-wrong industrial delivery, where no structure gets built from a drawing nobody checked. The Scoreboard applies that discipline — check the drawing before you build — to AI spending decisions. That background is the proof behind the instrument, not its market: Tektari serves mid-market manufacturers, distributors, logistics, and industrial-services operators.

What it measures

Four disciplines, five questions each, mapped one-to-one onto the failure evidence:

  1. Problem definition. Does the business problem exist on paper before the technology does — a written problem statement, a sponsor-signed baseline, a pass/fail line, a kill rule? This section instruments RAND's #1 failure cause.
  2. Data plumbing. Can your own team locate, trust, and trace the records the AI would run on? AI fitted to miscoded records repeats the miscoding faster and with more confidence.
  3. Vendor exposure. Who holds the pen — who wrote the scope, who defined "delivered," who measures it, who carries the cost of failure, and what do you keep if you walk? Most mid-market AI arrives on vendor paper (MIT NANDA found external partnerships behind roughly two-thirds of deployments), so this is where most of the risk gets signed.
  4. Workforce readiness. Can your leadership team defend the initiative under questioning, and can your frontline supervisors carry the change? This is the MI/PwC gap, measured on your own organization.

What it deliberately does not measure

How scoring and banding work

Each question has four answers scored 0 to 3, worst to best, in the open — the options aren't shuffled or disguised, because a respondent who games a free self-assessment only cheats themselves. Sections score 0–15; the total scores 0–60. All four sections weigh equally in version 1.0: we have no empirical basis yet for claiming one discipline predicts failure more than another, so we don't. If the data ever supports weighting, that becomes a new version, not a quiet edit.

Four result bands, by total score: Vendor's market (0–15), Pilot graveyard (16–30), Instrumented, with gaps (31–45), Buyer in control (46–60). The band descriptions in your report say what the score exposes you to in plain terms — for example, that your vendor can currently define your problem, grade their own work, and get paid either way. Every report also names your lowest-scoring section explicitly, because that is where your next dollar of failure is most likely to come from.

How peer comparison works — and why yours might say "not yet"

We compare you only against qualified peers: executive respondents (CEO/President, COO, CFO, or equivalent) at $50M–$500M manufacturers, distributors, logistics, and industrial-services operators. And we publish the pool size — n — every single time a comparison appears.

The reason is simple: a percentile computed on a pool of 30 is a costume, not a statistic. We would rather show you a coarse true number than a precise fake one, and an instrument about verification discipline doesn't get to fake its own statistical authority.

Anonymization and aggregation rules

The charter this sits under

The Scoreboard is part of Tektari's verification line, which operates under a published charter: we sell no implementation, we take no vendor commissions or referral fees, and verification fees never credit toward any other Tektari offer. Our prices are published. The paid step after the Scoreboard, for those who want it, is the Problem-Definition Audit — three weeks, $12,500 fixed, producing one AI use case, a sponsor-signed baseline metric, and a written vendor acceptance test your team can run without us; $2,500 of that fee is invoiced only when the scoped pilot passes its acceptance test. If the benchmark told you to buy something from the benchmark's author, you'd be right not to trust the benchmark. This one can't, by charter. Read the Verification Charter.

Versioning

This is instrument and methodology v1.0. The rules:

Questions about the method: hello@tektari.com. The full question set and scoring rubric are visible inside the instrument itself — there is nothing hidden to reverse-engineer.

Take the Scoreboard