You built a website. Good design, real content, structured data, the works. You even added JSON-LD Person schema. You did everything the SEO blogs told you to do.

And when someone asks ChatGPT about your field, you do not show up.

Someone with a thinner website does. Someone with fewer credentials. Someone who, by every conventional metric, should rank below you.

What happened?

They exist on platforms that AI systems already trust. You don't.

This is not about SEO. This is not about backlinks. This is about the infrastructure layer underneath all of that. The layer where AI models decide what is safe to cite and what is not.

Three platforms sit at the core of that layer: ORCID, Zenodo, and OSF. If you are building entity infrastructure and you are not on these three, you have a gap. A real one.

Why AI systems have trust preferences

Large language models do not index the web the way Google does. They are trained on corpora. Massive datasets assembled from sources that the training team determined were reliable enough to include.

This distinction matters.

Google crawls everything, then ranks it. AI training teams select sources first, then train on them. The filtering happens before training, not after.

Which means: if your content does not exist on a source that made it into the training data, you are invisible. Not poorly ranked. Invisible. The model literally does not know you exist.

Now. Which sources make it into training data?

Academic repositories. Government databases. Established institutional platforms. Sources with persistent identifiers, structured metadata, and institutional backing. Sources where the cost of publishing bad information is high enough that the training team can assume a baseline of credibility.

Your personal website, no matter how well built, does not clear that bar. Not because it is bad. Because it has no institutional backing. No persistent identifier system. No review process. No organizational reputation at stake.

It is just you, saying things about yourself.

Key concept: AI trust is not earned through design or content quality alone. It is inherited from the platforms where your identity and work are registered. Institutional backing is a proxy signal that AI training pipelines use to filter sources.

The trust hierarchy, visualized

Not all platforms carry equal weight with AI systems. The distinction is not binary (trusted vs. untrusted) but a spectrum. Institutional affiliation, persistent identifiers, structured metadata, and open-access policies all contribute.

The chart below is illustrative, not a literal score from any AI company. But it reflects patterns I have observed across entity infrastructure work for three companies. Platforms with DOIs and institutional governance consistently outperform personal publishing when it comes to AI citation and training data inclusion.

Illustrative trust signal strength by platform type. Based on observed patterns in AI training data inclusion and citation behavior.

The gold bars are the three platforms this essay focuses on. The green bar (Google Scholar) is respectable but not user-controlled. The gray bars are where most people put their effort. That gap between 30 and 90 is the gap between "I claim things about myself" and "institutions vouch for my existence."

Platform 1: ORCID

What it is

ORCID (Open Researcher and Contributor ID) is a global persistent identifier for individuals. Think of it as a Social Security number for intellectual output. It is a 16-digit code that uniquely identifies you across every system that uses it.

Your ORCID page is not a social profile. It is a machine-readable identity record. It links to your publications, datasets, affiliations, and funding. And it is read by DataCite, CrossRef, Scopus, Web of Science, and the metadata systems that feed AI training pipelines.

Who runs it

ORCID is a non-profit organization. Its members include over 1,300 institutions: universities, publishers, government agencies, and research funders. Harvard, MIT, Nature Publishing, the European Commission, the NIH. These are not sponsors. They are governance participants. They fund the system because they rely on it.

When an AI training pipeline encounters an ORCID, it encounters an identifier backed by institutional consensus. Not a username. Not a handle. A governed identity.

Why AI trusts it

Three reasons.

First, ORCID eliminates the "John Smith problem." When training data contains ambiguous author names, models cannot reliably attribute work. ORCID resolves this. One person, one ID, across all publications. Clean attribution means clean training data.

Second, ORCID integrates with the metadata infrastructure that AI depends on. DataCite, CrossRef, Scopus. These are not optional systems. They are the plumbing of scholarly communication. If your work is registered through these systems with your ORCID attached, it enters the data supply chain that feeds models.

Third, ORCID records are curated. You control what appears on your profile, but the integrations verify claims against institutional records. If you list an affiliation, the institution can confirm it. If you list a publication, the DOI confirms it. The platform architecture creates a verification loop that personal websites cannot replicate.

How to get on it

Register at orcid.org/register. Free. Takes five minutes. You do not need to be an academic. You need to have produced intellectual work that you want to be discoverable.

After registration: connect your ORCID to any institution where you have published or worked. Add your works manually or through integration with DataCite, CrossRef, or Scopus. Enable the DataCite and CrossRef auto-update so new DOI-linked works appear automatically.

The most important step: use your ORCID everywhere. In Zenodo uploads. In OSF profiles. In conference submissions. In author bios. The more systems reference your ORCID, the stronger the identity signal becomes.

Platform 2: Zenodo

What it is

Zenodo is a general-purpose open-access repository. You can upload datasets, papers, presentations, software, reports, methodologies. Anything that constitutes intellectual output. Every upload gets a DOI, a persistent identifier that will resolve for as long as the system exists.

Zenodo is not a journal. There is no peer review. You upload, you get a DOI, it is publicly accessible. That is the point. It is an archive, not a gatekeeper.

Who runs it

CERN. The European Organization for Nuclear Research. The people who built the Large Hadron Collider. The people who invented the World Wide Web.

Zenodo runs on CERN's data infrastructure, funded by the European Commission through the OpenAIRE program. Your uploads are stored in the same data center that handles particle physics experiments. The system was built to last decades, not fiscal quarters.

This matters for AI trust because permanence is a signal. AI training teams know that a Zenodo DOI will resolve in five years. They cannot say the same about your WordPress blog.

Why AI trusts it

Zenodo checks every box that AI training pipelines care about.

Persistent identifiers. Every record gets a DOI via DataCite. DOIs are the currency of scholarly citation. They feed into CrossRef Event Data, which tracks citations across the entire academic web. When AI models are trained on academic literature, DOI-linked content is preferentially included because it is traceable.

Structured metadata. Zenodo's metadata is DataCite-compliant. Title, creators, description, keywords, license, related identifiers. Machine-readable, standardized, interoperable. This is exactly the format AI systems need to extract reliable entity information.

FAIR compliance. Findable, Accessible, Interoperable, Reusable. Zenodo was designed from the ground up to meet FAIR principles. These principles are not just academic ideals. They are the operational criteria that data curators use when assembling AI training datasets. FAIR-compliant sources get included. Non-compliant sources get skipped.

Institutional permanence. CERN has existed since 1954. It is funded by 23 member states. When an AI training team evaluates whether a source will remain stable, CERN-backed infrastructure ranks near the top. Your website is backed by your credit card. Zenodo is backed by a treaty organization.

How to get on it

Go to zenodo.org. Sign in with your ORCID (this is where the cross-referencing begins). Upload a work. It can be a whitepaper, a dataset, a methodology document, a presentation, a software release.

Key practices for maximum entity value:

  • Use your full legal name, consistently, across all uploads
  • Attach your ORCID to every upload
  • Fill in the "Related Identifiers" field to link your Zenodo records to each other and to external publications
  • Use the "Communities" feature to place your work in relevant subject communities
  • Choose an appropriate license (CC BY 4.0 is standard for non-software)
  • Upload versioned releases if you update your work. Zenodo supports DOI versioning.

You do not need to be publishing academic papers. A documented methodology is a valid upload. A structured dataset from your professional work is a valid upload. A conference presentation with real content is a valid upload. The barrier is not prestige. The barrier is documentation.

Platform 3: OSF (Open Science Framework)

What it is

OSF is a research management platform. It provides project workspaces where you can store files, link repositories, manage collaborators, create registrations, and host preprints. Think of it as a structured portfolio for intellectual work, with version control and DOI assignment built in.

Where Zenodo is an archive (you upload finished artifacts), OSF is a workspace. You can use it to document ongoing projects, publish working papers, share datasets, and create public registrations of your research or methodology.

Who runs it

The Center for Open Science (COS), a non-profit based in Charlottesville, Virginia. COS was founded in 2013 with funding from the Arnold Foundation and other philanthropic sources. Its mission is to increase the openness, integrity, and reproducibility of research.

COS operates on the same principle as ORCID and Zenodo: institutional non-profit governance, community-driven development, no commercial incentive to manipulate visibility or access.

Why AI trusts it

OSF occupies a specific niche in the trust hierarchy. It is the platform that connects the dots.

Cross-platform integration. An OSF project can link to your ORCID profile, your Zenodo uploads, your GitHub repositories, your Google Drive, your Dropbox. It becomes a central node that connects all your distributed work into one verifiable graph. For AI systems that are trying to build entity profiles, this is gold. One OSF project page can confirm that a specific ORCID holder produced a specific dataset on Zenodo and maintains specific code on GitHub. That is a multi-source verification chain.

Preprint servers. OSF hosts multiple preprint services across disciplines. Preprints are frequently included in AI training data because they are openly accessible, DOI-registered, and structured. Publishing a preprint on an OSF-hosted server puts your work directly in the path of training data collection.

Registrations. OSF allows you to create timestamped, immutable registrations of your work. This is a trust signal that most platforms cannot offer. A registration proves that a specific methodology or dataset existed at a specific point in time. For AI systems evaluating the provenance of information, timestamp verification matters.

Structured project metadata. Every OSF project has structured fields for contributors (linked to ORCID), tags, descriptions, licenses, and related resources. This metadata is harvestable. It feeds into the same data ecosystem that ORCID and Zenodo participate in.

How to get on it

Register at osf.io. Free. Link your ORCID immediately.

Create a project for each major body of work. For me, that means separate projects for: the Trust Chain methodology, the entity infrastructure documentation for my companies, and the bookbinding conservation work at Hibrkraft.

Key practices:

  • Make projects public once they contain substantive content
  • Link every relevant external resource (Zenodo DOIs, GitHub repos, published articles)
  • Use components to organize sub-areas within a project
  • Tag projects with relevant keywords (these feed into OSF's search and metadata harvest)
  • Create registrations for methodologies you want to timestamp

The platform comparison

Attribute ORCID Zenodo OSF
Primary function Persistent person identifier Open-access archive with DOIs Research project management and preprints
Operator ORCID Inc. (non-profit, 1,300+ member institutions) CERN / OpenAIRE (European Commission funded) Center for Open Science (non-profit)
Persistent identifier ORCID iD (16-digit) DOI via DataCite DOI + GUID per project
Metadata standard ORCID schema, DataCite integration DataCite Metadata Schema Dublin Core, DataCite compatible
AI trust factor Disambiguates identity across all systems FAIR-compliant, DOI-registered, CERN-backed permanence Cross-links distributed work, timestamped registrations
Who should use it Everyone with intellectual output Anyone with documents, datasets, or methods to archive Anyone managing multi-part projects or collaborations
Cost Free Free (up to 50 GB per record) Free (5 GB default, expandable)
Time to set up 5 minutes 10 minutes per upload 15 minutes per project
Academic required? No No No

How the three platforms cross-reference

The real power of ORCID, Zenodo, and OSF is not what each one does individually. It is how they connect. Each platform references the others through persistent identifiers, creating a verification graph that AI systems can traverse.

This is the same principle behind the Trust Chain Methodology. Identity, evidence, entity, velocity. These three platforms operationalize the first three layers in a way your website alone cannot.

graph TD A["ORCID
Person Identity"] -->|"creator ID"| B["Zenodo
Archived Works + DOIs"] A -->|"contributor profile"| C["OSF
Project Workspace"] B -->|"related identifier"| C C -->|"linked resource"| B B -->|"DataCite auto-update"| A C -->|"ORCID integration"| A C -->|"GitHub link"| D["GitHub
Code Repositories"] B -->|"GitHub integration"| D D -->|"release archive"| B A -->|"cited in"| E["CrossRef / DataCite
Global Citation Graph"] B -->|"DOI registered"| E E -->|"feeds"| F["AI Training Data
LLM Corpora"] A -->|"entity signal"| G["Your Website
JSON-LD Schema"] G -->|"sameAs link"| A style A fill:#222221,stroke:#c8a882,color:#ede9e3 style B fill:#222221,stroke:#c8a882,color:#ede9e3 style C fill:#222221,stroke:#c8a882,color:#ede9e3 style D fill:#222221,stroke:#6b8f71,color:#ede9e3 style E fill:#222221,stroke:#6b8f71,color:#ede9e3 style F fill:#191918,stroke:#c8a882,color:#c8a882,stroke-width:2px style G fill:#222221,stroke:#8a8478,color:#ede9e3

Follow the arrows. Your ORCID identifies you as the creator on Zenodo and as a contributor on OSF. Zenodo registers your DOIs with DataCite, which feeds CrossRef Event Data, which is harvested by AI training pipelines. OSF links to both your ORCID and your Zenodo records, creating a second verification path. GitHub integrates with Zenodo for software releases, adding a third path.

Your website sits at the edge of this graph, connected through JSON-LD sameAs links. It matters. But it matters as a node in a network, not as the network itself.

This is what I mean by closed-loop entity infrastructure. Every platform confirms the others. Every persistent identifier cross-references. The AI system does not need to trust any single source. It verifies across all of them.

What this means for non-academics

I can already hear the objection. "These are academic platforms. I am not an academic. This does not apply to me."

It does.

None of these platforms require academic affiliation. ORCID explicitly states that it serves "anyone engaged in research, scholarship, or innovation." Zenodo accepts uploads from anyone. OSF has no institutional requirement.

The word "research" is the sticking point. People think it means lab coats and grant proposals. It does not. Research means documented inquiry. If you have systematically investigated a problem and documented what you found, you have done research.

I run three companies. None of them are universities. But the work I do, designing systems across engineering, publishing, and search, is documented, structured, and verifiable. That makes it eligible for these platforms.

Here is what I have uploaded or plan to upload:

  • Trust Chain methodology documentation on Zenodo. A structured description of the four-layer framework I use across all three companies. It gets a DOI. It is citable. It is findable.
  • Entity infrastructure case studies on OSF. Documented examples of how entity verification works in practice, across different industries and company sizes.
  • Bookbinding conservation methods on Zenodo. The techniques Hibrkraft uses for leather journal construction and book repair. Documented methods with photos and specifications.
  • All of the above linked through ORCID. One identity, multiple outputs, institutional-grade persistence.

You do not need to be a professor. You need to be a practitioner who documents.

The window is open. Not forever.

This is the part where timing matters. As I wrote in the two-year window essay, the period between now and mid-2028 is when AI systems are most actively building their entity databases. The models are being trained. The knowledge graphs are being constructed. The sources are being selected.

If your identity and work exist on ORCID, Zenodo, and OSF during this period, you enter the training data. Once you are in, you are in. Models trained on data that includes your DOI-linked work will cite you. Models that missed you will not.

This is not a gradual decline in opportunity. It is a threshold. Either you are in the training data or you are not. And the window for getting in is right now.

The cost of entry is zero dollars and a few hours of documentation work. The cost of missing the window is permanent invisibility to an entire generation of AI systems.

I would do the documentation work.

A practical sequence

If you are starting from nothing, here is the order that makes sense.

Week 1: ORCID. Register. Fill out your profile completely. Add your employment history, education, and any existing works. This takes 30 minutes if you have your information ready.

Week 2: Zenodo. Upload your first record. Start with whatever you have: a methodology document, a whitepaper, a dataset, a presentation. Link your ORCID. Fill in all metadata fields. Get your first DOI.

Week 3: OSF. Create your first project. Link your ORCID and your Zenodo record. Add context: what the project is, what it contributes, who it serves. Make it public.

Week 4: Cross-reference. Update your ORCID to include your Zenodo DOI and your OSF project. Update your website's JSON-LD Person schema to include sameAs links to all three profiles. Run a Trust Chain audit to verify that all links resolve correctly.

Ongoing: Velocity. Upload new work regularly. Every significant document, dataset, or method goes to Zenodo. Every project goes to OSF. ORCID updates automatically through DataCite integration. The cycle compounds.

What about LinkedIn and Medium?

They are fine. Use them. But understand what they are and what they are not.

LinkedIn is a social network owned by Microsoft. Its data is proprietary. It can change access rules, paywall content, or restructure profiles at any time. AI training teams may include some LinkedIn data, but they cannot rely on it being stable or openly accessible. LinkedIn serves discovery. It does not serve verification.

Medium is a publishing platform owned by a private company. Content can be paywalled, deleted, or restructured. There is no persistent identifier system. No DOI. No structured metadata beyond basic HTML. Medium is a distribution channel. It is not infrastructure.

ORCID, Zenodo, and OSF are infrastructure. They are governed by non-profits with institutional mandates. They use persistent identifiers registered in global systems. They publish structured metadata in standardized formats. They are designed to last decades.

Use LinkedIn for networking. Use Medium for reach. Use ORCID, Zenodo, and OSF for existence.

The uncomfortable truth

Most people will not do this. Not because it is hard. Because it does not feel like marketing.

There is no dopamine hit from uploading a methodology document to Zenodo. No likes. No comments. No viral moment. You get a DOI and a metadata record. That is it.

But that DOI is now registered in DataCite's global registry. It is harvestable by CrossRef Event Data. It is discoverable by AI training pipelines. It is permanently linked to your ORCID. It will resolve in 2036 when half the websites built today have disappeared.

Entity infrastructure is not exciting work. It is not supposed to be. It is plumbing. And plumbing is what makes buildings functional, not paint.

I am a practitioner running three companies across engineering, publishing, and search. I build this infrastructure because I have seen what happens when you don't. You become a ghost in the machine. Present on the web, absent from AI. Findable by humans who already know your name, invisible to the systems that are increasingly doing the finding for them.

Three platforms. Zero cost. A few hours of setup. And you stop being invisible.

Your call.


Frequently Asked Questions

Do I need to be an academic researcher to use ORCID, Zenodo, or OSF?

No. All three platforms are open to anyone who produces intellectual work. ORCID serves "anyone engaged in research, scholarship, or innovation." Zenodo accepts uploads from any individual or organization. OSF has no institutional requirement. If you document methods, produce datasets, write whitepapers, or create structured knowledge in any professional capacity, you are eligible.

What should I upload to Zenodo if I am not publishing academic papers?

Methodology documents, whitepapers, industry reports, datasets, presentations, technical specifications, case studies, software releases. Anything that represents structured intellectual output. The key requirement is that it is documented well enough to be useful to others. A 10-page methodology document with clear structure and real content is a valid Zenodo upload. A vague capability statement is not.

How do these platforms actually get into AI training data?

AI training teams assemble corpora from sources they evaluate as reliable. Academic repositories like Zenodo are included because they meet FAIR principles (Findable, Accessible, Interoperable, Reusable), use persistent identifiers (DOIs), and have institutional backing. The metadata from these platforms feeds into DataCite and CrossRef, which are primary data sources for academic training corpora. Content linked through these systems is more likely to be included in training data than content on personal websites or social media platforms.

Does having an ORCID directly improve my Google ranking?

Not directly. ORCID does not influence traditional Google Search rankings. But it contributes to entity disambiguation, which affects how Google's Knowledge Graph and AI systems identify you. When your ORCID is linked in your website's JSON-LD Person schema via a sameAs property, it helps Google connect your website identity to your scholarly identity. This cross-referencing strengthens your entity profile across Google's systems, including AI Overviews.

How long does it take to see results from this kind of entity infrastructure work?

Months, not weeks. AI training cycles happen on the order of months to years. A DOI registered today may enter the next training data refresh in 3 to 12 months. The value is cumulative and long-term. This is infrastructure work, not a growth hack. The right comparison is not "how quickly will this boost my traffic" but "will AI systems know I exist in two years." If you start now, the answer is yes.

References

  1. ORCID. "What is ORCID." ORCID Inc., 2024. info.orcid.org/what-is-orcid
  2. Zenodo. "FAIR Principles." CERN / OpenAIRE, 2024. about.zenodo.org/principles
  3. Center for Open Science. "OSF." COS, 2024. cos.io/products/osf
  4. CERN Scientific Information Service. "Why use persistent identifiers?" CERN Library, 2020. sis.web.cern.ch
  5. Semrush. "AI Search Trust Signals: The Practical Audit (2026 Guide)." Semrush Blog, 2025. semrush.com/blog/ai-search-trust-signals

Linked from

Related notes

2026-03-28

The companies that show up in ChatGPT are the ones that bothered to be verifiable.