Identity is what survives when the name no longer carries it. The work is older than the web.
§ 02 of 05 — Stage Two · Distinguish
Which Stephen King?
Stage 1 took an identifier and told you what it pointed at. Stage 2 starts where two strings refuse to settle. Authority records disagree. Sometimes two records describe one person and the catalog has to merge them. Sometimes one name spans two people and the catalog has to keep them apart. Either way, the discipline that decides is the older institutional cousin of the identity-resolution work modern systems quietly reinvent from scratch.
Exhibit A
Watch the candidates appear — live, ranked, named
Type any name. The request goes straight to VIAF's AutoSuggest endpoint, which returns the clusters it has on hand that prefix- match your query — ranked by VIAF's own scoring against the indexed name forms across forty-plus national authorities. Each row is a candidate identity. You haven't decided anything yet — you have only seen what the catalog could plausibly mean.
Type to search.
Live · VIAF AutoSuggest API
Each row is a separate authority cluster. The work of Stage 2 is deciding which of them should still be separate.
Exhibit B
Two records. Same person, or two?
Three curated cases — a pseudonym merge, a cross-script transliteration merge, and a same-string split. For each, you see two authority records as a cataloger would: heading, dates, field, known works. Commit to an answer before reading the reveal; the lesson sticks only if you have already chosen.
Same person, or two?
Case twain-clemens
Record ALC · n79021164
Twain, Mark, 1835-1910
Dates
1835–1910
Nationality
American
Field
Author, humorist, lecturer
Curator
Library of Congress / NACO
Known for
Adventures of Huckleberry Finn
The Adventures of Tom Sawyer
Life on the Mississippi
Cross-references
Twain, Marc
Twen, Mark
Record BLC · n80050259
Clemens, Samuel Langhorne, 1835-1910
Dates
1835–1910
Nationality
American
Field
Author, journalist, riverboat pilot
Curator
Library of Congress / NACO
Known for
Letters from Hawaii (as Samuel L. Clemens)
Autobiography of Mark Twain
Cross-references
Clemens, Sam
Clemens, S. L.
Your call
Same person, or two?
Case tchaikovsky-transliterations
Record ALC · n80018500
Tchaikovsky, Peter Ilich, 1840-1893
Dates
1840–1893
Nationality
Russian
Field
Composer (Romantic-era Russian)
Curator
Library of Congress / NACO
Known for
Symphony No. 6 in B minor 'Pathétique'
Swan Lake (ballet, 1875–1876)
Eugene Onegin (opera, 1879)
Piano Concerto No. 1 in B-flat minor
Cross-references
Tchaikovsky, P. I.
Tchaikovsky, Pyotr Ilyich
Tschaikowsky, Peter
Record BDNB · 118621777
Čajkovskij, Pëtr Il'ič, 1840-1893
Dates
1840–1893
Nationality
Russian
Field
Komponist (Romantik, russisch)
Curator
Deutsche Nationalbibliothek
Known for
Sinfonie Nr. 6 h-Moll 'Pathétique'
Schwanensee (Ballett, 1875–1876)
Eugen Onegin (Oper, 1879)
Klavierkonzert Nr. 1 b-Moll
Cross-references
Tschaikowski, Pjotr Iljitsch
Čajkovskij, P. I.
Tchaikovsky, Peter
Your call
Same person, or two?
Case two-john-smiths
Record ALC · n50054268
Smith, John, 1580-1631
Dates
1580–1631
Nationality
English
Field
Soldier, explorer, colonial administrator
Curator
Library of Congress / NACO
Known for
The Generall Historie of Virginia, New-England, and the Summer Isles (1624)
A Description of New England (1616)
Cross-references
Smith, John, Captain
Record BLC · n79139831
Smith, John, 1797-1867
Dates
1797–1867
Nationality
American
Field
Painter (American landscape)
Curator
Library of Congress / NACO
Known for
View of the Susquehanna (1832)
Autumn in the Catskills (1845)
Cross-references
Smith, John (American painter)
Your call
Curated — record data and evidence summarized from real LC + VIAF entries
Field guide
The kinds of disagreement, named
Disagreement between authority records isn't a failure case — it is the normal condition of a working catalog. The discipline gives the recurring shapes their own names so a future cataloger can see, at a glance, which kind of disagreement they are looking at.
Pseudonym link
Type
Two name forms, one person.
The most documented disagreement: a writer publishes under one name and lives under another, and the catalog must decide whether to keep both records or unify them under one preferred heading. The link is typically provable from the person's own correspondence, a contemporary biographer, or a publisher's records.
e.g. Mark Twain ↔ Samuel L. Clemens · George Eliot ↔ Mary Ann Evans · le Carré ↔ Cornwell
Transliteration variant
Type
Same person, different alphabet.
A name moves between writing systems and accumulates spellings. Russian names in particular split across ALA-LC, BGN/PCGN, ISO 9, and library-local schemes — and across decades, since transliteration practice itself changes. The cluster must hold all of them without making any one the truth.
e.g. Чайковский → Tchaikovsky · Tchaïkovsky · Chaikovskii · Čajkovskij
Contested dates
Type
When the records disagree about a year.
Birth and death years are the strongest disambiguation signal — and they are also among the most commonly wrong. A 19th-century author may have a birth year that varies by three years across the national authorities, depending on which baptismal record, census, or self-report was used. The cataloger has to pick one or note the range.
e.g. Christopher Marlowe — 1564 across most authorities, but some 18th-century sources record 1563.
Posthumous merge
Type
Two records become one decades later.
An author publishes under a pseudonym in life and the link is only confirmed after their death (or, sometimes, after they themselves confirm it in old age). The catalog must merge two records that had been separately maintained — sometimes for half a century — into one cluster, while preserving the access points so existing citations still resolve.
e.g. Robert Galbraith ↔ J. K. Rowling, 2013 — three months of separate maintenance, then merged.
Posthumous split
Type
A merged record turns out to be two people.
The reverse case, which is rarer but cleaner to demonstrate: someone in the 19th century cataloged two distinct people under a single entry because the source documents looked similar enough. Later evidence (new archival material, a discovered passport, a contradictory date) forces the cluster to be split — and every record that pointed at the old cluster has to be rerouted.
e.g. Multiple known cases in medieval and early-modern hagiography where two saints with similar names were unified for centuries before being re-separated.
Anonymous attribution
Type
A work without a confident author.
Some works can only be attributed to 'Author of <other work>' or to a uniform title that floats outside any person cluster. The authority record becomes the link itself — a placeholder that may, decades later, get tied to a real person if new evidence surfaces.
e.g. 'Author of the Cloud of Unknowing' — a real LC heading. No person cluster, but a stable identity.
Honest Capability
What this page actually does, and what it doesn't.
Demonstrated
Reproducible from the source code
Live VIAF AutoSuggest queries with debounced typeahead, ARIA combobox semantics, and keyboard navigation.
Three curated cases — Twain/Clemens (pseudonym merge), Tchaikovsky transliterations (cross-script merge), and two different John Smiths (split) — each with the actual evidence catalogers would cite.
Vote-then-reveal interaction that requires the visitor to commit to an answer before the reveal renders.
An editorial field guide naming the recurring shapes of disagreement: pseudonym link, transliteration variant, contested dates, posthumous merge/split, anonymous attribution.
Aspirational
What the page implies but does not prove
That every case shown here has a single correct answer. In practice, many clusters carry an acknowledged probability of being wrong; VIAF documents the residual disagreement rather than pretending it resolved.
That AutoSuggest's ranking reflects truth. It reflects VIAF's confidence against indexed name forms — which is a function of how many authorities have already crosswalked a cluster, not whether the cluster is correct.
That the field guide is exhaustive. It is a starting taxonomy; real catalogs carry edge cases (composite identities, institutional pseudonyms, attributed-pseudonymous works) that don't fit any of these.
Faked, with cause
Narrative liberties, named honestly
The three curated cases use illustrative record IDs and a summarized evidence list, not raw MARC dumps. The decisions, life-date data, and transliteration systems are accurate to the real records; the rendering is a museum placard, not a cataloger's worksheet.
The 'painter John Smith' record (n79139831) is representative rather than a specific historical figure — there are several 19th-century painters named John Smith in LC; this card is composited from the demographic shape they share.
Real LC authority records do not include a 'Field' line as such; that label is condensed from the 670-source citations and 678-biographical/historical notes for legibility.
A note from the curator
The work is a kept promise, not a clean answer.
The uncomfortable truth about authority work is that most of it ends in probable, not certain. A catalog of fifty million names cannot afford to suspend belief on every disputed case forever. So the discipline does the thing every serious identity system eventually does: it makes a decision, names the evidence, and leaves a door open.
The door is the 4XX and 5XX fields in the MARC authority record — the see-from and see-also references — and the linked-data equivalents in modern formats. They say: we have decided these two names point at the same person, and here is the evidence; if you find a reason to think otherwise, here is where to write your objection. The decision is provisional in the sense that any institution is provisional. Tomorrow's evidence can change today's cluster.
This is also why VIAF does not call itself the authority. It is a consensus layer over forty-plus national authorities, each of which is itself a kept promise from a specific institution to the rest of the bibliographic universe. When two authorities disagree about whether two name forms point at one person, VIAF preserves the disagreement and ships both interpretations until somebody does the archival work to resolve it.
The next room is harder still. Once you have decided who a record is about, you still have to say what it is about — where in the structure of human knowledge it sits. There are two competing answers to that question, and they are not just formats. They are different epistemologies. Stage 3 puts them next to each other.