/records / CP-SIG-60 · Page model JSON · Portable export bundle

record CP-SIG-60slug cp-sig-60

Pseudonymity Collapse at Scale: ETH Zurich Measures What LLMs Can Infer from Retained Records

profileCounterpose Publicationcounterpose_publication
publisherCounterposecounterpose
statusActive
lifecyclePublishedanchorsAnchors not applicable
eligibilitypublication-eligible

computed at 2026-03-19T00:00:00Z

rendered at 2026-03-19T00:00:00Z

profileCounterpose Publicationcounterpose_publicationsource kindcaptured_counterpose_signal

Editorial republication

This page republishes captured Counterpose editorial from the GARPedia admitted pool. It is not a GARP-verified or governed record. The disclosures below are render-time only and assert no substrate verification, admission, or enforcement.

Verification posture (derived from this record)

  • This page is not source-verified by GARPedia.
  • No GARP receipt backs this content; it is not receipt-verified or cryptographically validated.
  • Claims here are republished editorial, not admitted as verified fact by GARPedia.
  • No registered source captures are anchored to this record.
  • No citation spans are present in this record.
  • No GARPedia lineage / audit events are recorded for this record.

Profile-driven rendering

Active profile

Counterpose Publication

Fixture key counterpose_publication · publication-context expanded

provenance
expanded
citation
expanded
source/reference
source references with citation context
redaction
no redaction boundary notice
lint
object lint not represented
receipts
receipt fixtures not represented
traversal
cross-artifact traversal links shown

Shown by fixture

  • expanded provenance posture
  • expanded citation posture
  • source references with citation context
  • cross-artifact traversal links shown

Not represented

  • No redaction boundary applies to this fixture.
  • No object lint findings are represented.
  • No receipt fixtures are attached to this record.

Counterpose Publication fixtures emphasize publisher, edition, source, receipt, and cross-artifact context for a public signal.

Profile rendering parameters are local synthetic fixtures; they do not create private reader views, login states, or role-gated access.

Signal body

# Pseudonymity Collapse at Scale: ETH Zurich Measures What LLMs Can Infer from Retained Records

Counterpose | CP-60 | March 19, 2026

A publication of Vega Commons Project, Inc.

---

Researchers at ETH Zurich published a study in March 2026 demonstrating that LLM-based systems can match anonymous social media accounts to real-world identities at 68 percent accuracy with 90 percent precision, a result the researchers describe as outperforming human investigators performing the same task. The system applied to publicly available data sources: Hacker News posts, LinkedIn profiles, Anthropic interview transcripts, and Reddit accounts across both identified and pseudonymous corpora.

Lead researcher Daniel Paleka described the findings as making it "very clear" that pseudonymous online activity cannot be assumed private when personal details are shared across platforms over time. The paper's conclusion, as reported: "Users, platforms, and policymakers must recognise that the privacy assumptions underlying much of today's internet no longer hold."

## Records as Re-identification Inputs

A custody surface is the set of records an AI system generates during operation that can be discovered, subpoenaed, or compelled through legal process. An interaction record is the log of what a user asked, what the system responded, and any reasoning the system performed. Prior entries in this corpus document the custody surface structurally: records exist, they can be compelled, privacy controls do not eliminate them. The ETH Zurich study provides the first quotable empirical metric for what happens when those records are aggregated across platforms.

Two features of the methodology bear directly on record governance. First, the data sources the study treats as inputs (posts across platforms, interview transcripts, identified and pseudonymous account pairs) are exactly the record categories that retention governance addresses. Records that do not exist in retained form on reachable infrastructure cannot be aggregated for re-identification. Records that do exist constitute inputs to re-identification regardless of the pseudonymity controls applied at any individual platform. Second, the study used Anthropic interview transcripts as a source, meaning records produced in an AI company's evaluation process contributed to the de-anonymization corpus. The mode of record creation places this directly within the domain that record governance addresses.

## Individual and Organizational Exposure

For organizations whose employees use enterprise AI tools, the re-identification capability operates at two levels. At the individual level, an employee who has maintained a pseudonymous professional presence (a common practice in regulated industries and security-sensitive roles) may have that identity linked to their real identity through LLM-enabled aggregation of interaction records across platforms, including records created through employer-provided tools. At the organizational level, an organization's collective AI interaction record, accumulated across employees, time periods, and AI platforms, constitutes an aggregable dataset from which organizational strategy, personnel decisions, and sensitive deliberations can be inferred even where no single record is compellable.

The aggregation risk compounds across vendors. No single vendor's retention posture addresses the re-identification risk because the risk operates across vendor boundaries. Multi-vendor deployment distributes records across multiple custodians, which reduces concentration risk but increases the aggregation surface available to an LLM-enabled re-identification system.

## Privacy Controls and Content-Level Exposure

The identification methodology did not require access to private infrastructure. It operated on data the subjects believed to be either public-but-attributable or public-but-pseudonymous. The linkage was performed by aggregating writing style, topical patterns, temporal behavior, and incidental identifying details across these layers. Content-telemetry separation, which addresses the division between deliberative content and operational metadata, does not reach this vector because the identifying information is in the content itself. Where content cannot be separated from its re-identifying properties, non-retention is the only operative control.

A subpoena or government demand for records concerning an individual who maintained a pseudonymous presence reaches records the individual did not believe were attributable to them. If an employer's AI tool retained interaction records, and those records are legally producible, the production may reveal identity connections the employee did not know existed.

The question the ETH Zurich study quantifies is whether 68 percent accuracy at 90 percent precision, using only public data and applicable to accounts their owners believed were anonymous, will change how organizations think about the records their AI tools retain, or whether the measurable collapse of pseudonymity will proceed without a governance response until it produces consequences in litigation or regulatory enforcement.

---

## Sources

| Source | Date | Description | URL | |--------|------|-------------|-----| | Paleka, Daniel et al. (ETH Zurich) | March 2026 | De-anonymization study, preprint | | | Uniladtech | March 17, 2026 | Reporting on the study's findings | https://www.uniladtech.com/tech/social-media/anonymous-social-media-accounts-could-be-exposed-by-ai-warned-770657-20260317 |

---

## Amendment Log

*No amendments to date.*

---

The observations presented reflect analytical assessment of publicly available information and do not constitute legal, insurance, or investment advice. Counterpose maintains no formal relationship with any vendor, regulator, or standards body referenced in this publication.

Audit timeline

No governed events are recorded against this record.

Cross-artifact traversal

Fixture-backed links across the rendered public artifacts. This panel is read-only and does not imply a live graph, search index, backend lookup, or verification action. Profile posture: cross-artifact traversal links shown.

Method

Method transparency for this rendered record. The panel surfaces the rules and posture under which the record was rendered, in plain view, alongside the audit chain that produced its current state.

Profile admission boundary

profileCounterpose Publication (counterpose_publication)

Captured Counterpose signal republished from the GARPedia admitted pool. The editorial body is projected verbatim and unverified: GARPedia records that the signal was admitted and renders it, but asserts no independent citation, source, or anchor verification over its claims.

Discovery posture

No discovery posture metadata is recorded for this record.

GARPedia does not run search or retrieval. The fields above are render-time metadata only; they do not configure any indexer.

Control discharge

No control-discharge notes were recorded for this rendered record.

GARPedia does not host operator controls. Mutation, review approval, publication approval, dispute adjudication, and other governed actions live in the GARP Workbench, not on this surface.