Skip to content

§ 05 of 05 — Stage Five · Maintain

Identity as stewardship.

The four rooms before this taught how an identity is named, told apart from its namesakes, classified, and connected. This room is about what happens after — for decades — to keep that identity true. Names change. Records get corrected. Entities get merged, split, transliterated, and re-transliterated. Cataloging is not clerical work; it is the ongoing care of identity, meaning, relationships, and trust. Stage 5 walks one record's ledger of changes, then asks the harder question: what would it take to verify all this against something stronger than institutional reputation?

Exhibit A

One identity, forty-five years of kept promises

Each entry below is a real type of change a real authority record undergoes over its lifetime. The dates are anchored to public events where they are documented and are representative otherwise — what matters here is the shape of the stewardship work: who acts, what gets recorded, and why each kind of edit matters. Click any row to read the editorial note.

Stewardship ledger
Stephen King
9 entries · since 1979

VIAF 27066711 · LC n79018049

Published Merkle root · sha-256
Commits to all 9 entries
c32a9badd90afb549a545fb080b475a3c1868e4652f9a400d1ae1acaf22a3b30

Every entry below carries the sibling hashes needed to prove it is a member of this root. Expand any entry and click verify — the browser recomputes the root from the entry up the tree and confirms the match.

Curated · entries are real change-types; specific dates are anchored where public and representative otherwise

Live · Merkle root computed at build, verified in browser via SubtleCrypto

Each entry above is a piece of institutional labor performed for free, by people whose job titles do not say 'identity engineer'. The maintenance is what makes the identifier worth pointing at decades later. Without it, every persistent identifier is just a string that used to resolve.

The harder question

What would a verifiable trust layer over all this look like?

Stage 1 introduced an entityMd5 — a content fingerprint OCLC publishes alongside each WorldCat entity, used by downstream consumers to detect when a record has changed. The fingerprint is a useful primitive but it is change-detection, not tamper-evidence — and it is computed by the same institution whose record it attests to. A verifiable trust layer would have to do three things the current architecture does not.

01

Signed change manifests

Every committed change to an authority record is signed by the cataloger's institution. The signature attests not just that the record changed but that a specific actor — LC, DNB, VIAF — endorsed the change. Verification no longer requires trusting the host. The signature is checkable against the institution's published key.

02

Merkle-rooted history

The ledger of changes is committed to a Merkle tree, with each new entry chained to the previous root hash. A consumer holding a recent root can prove that any earlier entry has not been silently rewritten — and the cluster's history becomes auditable end-to-end with the storage cost of a single hash.

Demonstrated above on Authority Arc's own ledger — try the verify button on any entry.

03

Replicated commitments

The Merkle roots are co-signed and replicated across multiple authorities — LC, DNB, VIAF, and any peer that wants to participate — so no single host can revise the past unilaterally. Stewardship becomes verifiable in the cryptographic sense, not just the reputational one.

None of this is exotic. The primitives are forty years old. What is missing is a deployment that takes institutional stewardship seriously enough to commit to it cryptographically — to say, here is what we changed, here is who changed it, here is the chain that proves we did not rewrite the past. The catalog is already keeping the records. The next layer is keeping the records keepable.

Honest Capability

What this page actually does, and what it doesn't.

Demonstrated
Reproducible from the source code
  • A curated stewardship ledger for one real authority cluster, showing the kinds of changes that real catalog records undergo over their lifetime.
  • Editorial notes per entry on *why* each change type matters — what it preserves, who acted, and what the institutional logic of the move was.
  • A working Merkle tree over the ledger: the root is computed at build time over the canonical serialization of every entry; each entry carries its sibling-hash inclusion proof; the browser re-verifies via SubtleCrypto on click. This is the second pillar of the trust-layer proposal, applied to our own data.
  • A clear framing of what a federated verifiable trust layer over real institutional stewardship would additionally require: signed change manifests, replicated co-signed commitments.
Aspirational
What the page implies but does not prove
  • That the dates shown are the exact dates each event happened. Specific events anchored to public facts (e.g. King's 1985 acknowledgment of Bachman) are accurate; others are representative ranges.
  • That signed change manifests and federated peer commitments exist today in the bibliographic universe. They do not — VIAF and OCLC ship records but not signatures. The first and third trust-layer pillars remain a forward-looking proposal; only the Merkle-rooted history pillar is demonstrated here, and only over Authority Arc's own curated ledger.
  • That cryptographic stewardship would replace institutional stewardship. It would not. It would let institutional stewardship be *checkable* without requiring the consumer to trust the host on faith.
Faked, with cause
Narrative liberties, named honestly
  • Specific year-precise dates on the ledger entries are anchored where the underlying event is public (1985 pseudonym reveal, 2003 VIAF launch, 2012 Wikidata launch) and are illustrative otherwise. The change-types are accurate; some calendar-year placements are representative.
  • OCLC's published entityMd5 is real (visible on every Entities API response); the cryptographic shortcomings explained in Stage 1 are accurate. The signed-manifest and federated-replication pillars are editorial — there is no production deployment of signed authority manifests at OCLC or VIAF as of this writing.
  • The Merkle root demonstration runs over Authority Arc's own curated ledger, not over real-time edits to VIAF. The primitive is real (SHA-256, standard Merkle proofs, browser-side SubtleCrypto verification); the analogue at OCLC/VIAF scale is the proposal the rest of the trust-layer section frames.
A note from the curator

The most radical thing a library does is keep remembering.

Amodern technology stack maintains its identifiers for as long as the company that owns them remains solvent. A library catalog maintains its identifiers for as long as the institution remains. The difference matters because the time horizons are different by orders of magnitude — and because the work to bridge that gap is done in public, by people who are usually not paid commensurate with the responsibility.

The premise of this arc has been that the discipline of cataloging is a kind of identity engineering — older than the web, broader in scope than most modern entity-resolution systems, and conducted under a much harder constraint: the answers have to stay true, not just locally correct. Stewardship is the part that does not show up on the page. The identifier on top of every record is the visible end of a promise that thousands of people, over decades, have agreed to keep.

If the modern era brings any improvement to this picture, it is not algorithmic. The cataloging work itself will not be automated; it requires judgment about evidence, and the evidence itself is unstructured human testimony. What modern primitives can offer is the layer beneath the work — a way to make the stewardship checkable, to chain the changes to a public log, to let a consumer in 2080 verify that a record they see today is the record their predecessor wrote, signed by the institution that wrote it.


The five rooms of this museum end here. What it is meant to leave you with is not a feeling that any of this is solved — none of it is — but a better sense of which problems are old, which are new, and which ones the field has been quietly thinking about longer than any of the names you would normally associate with them. The catalog is one of the longest-running working identity systems on the planet. It is worth knowing what shape it has.