LinkML
LinkML is a modeling language for defining data schemas that can generate JSON Schema, SHACL, SQL DDL, Python dataclasses, and more from a single YAML source definition.
LinkML serves as the common denominator for custom metadata schemas in the con/serve ecosystem. By defining schemas in LinkML rather than ad-hoc JSON or YAML, tools gain automatic validation, documentation generation, and interoperability with linked data standards.
Used by:
- citations-collector – citation data model aligned with CiTO/FaBiO ontologies
- Experience Ledger – dataset identity and operational knowledge schemas
- concepts.datalad.org – the DataLad metadata vocabulary
Adopting LinkML as the standard schema language means any tool in the ecosystem can consume schemas defined by other tools, validate metadata at ingest time, and produce output that conforms to shared vocabularies.
citations-collector
·5 mins
Discovers citations across CrossRef, OpenCitations, DataCite, and OpenAlex; syncs with Zotero; acquires PDFs with git-annex provenance tracking; and stores everything in a DataLad dataset using a LinkML schema aligned with CiTO and FaBiO ontologies.