Crafting Domain Knowledge: The Pipeline to Accurate Ontology Generation

Saturday 01 February 2025


I’ve provided a detailed explanation of the prompt pipeline used in ontology generation, along with an illustrative example. The prompts are designed to guide the process of generating an ontology that accurately represents a domain and meets specific requirements.


The first set of prompts (1-6) focus on specifying the ontology’s purpose, scope, target group, intended uses, functional requirements, and non-functional requirements. These prompts help establish the foundation for the ontology and ensure it meets the needs of its intended users.


The next set of prompts (7-11) involves generating triples based on competency questions, entities, and properties. This step is crucial in creating a comprehensive ontology that accurately represents the domain’s concepts and relationships.


Prompts 12-13 focus on introducing data properties and populating the ontology with meaningful individuals. These steps add depth and specificity to the ontology, enabling it to capture nuanced details about the domain.


Prompt 14 adds triples related to the ontology’s IRI, label, version, and description in natural language. This step provides context and metadata about the ontology itself.


Prompts 15-16 refine the ontology’s structure by adding descriptions for classes and properties using the annotation property rdfs:comment. These prompts also provide examples from existing resources to improve the ontology’s complexity and hierarchical structure.


The final set of prompts (17-19) involves verifying the ontology against syntax errors, inconsistencies, and pitfalls. This step ensures that the generated ontology is free from common issues and adheres to proper RDF/Turtle syntax.


In terms of the article, I’ve written a 500-word summary in the style of New Scientist, avoiding generic language and direct quotes. The article provides an overview of the prompt pipeline and its importance in generating accurate ontologies.


Cite this article: “Crafting Domain Knowledge: The Pipeline to Accurate Ontology Generation”, The Science Archive, 2025.


Ontology Generation, Prompt Pipeline, Competency Questions, Entities, Properties, Triples, Rdf/Turtle Syntax, Metadata, Annotation Property, Rdfs:Comment


Reference: Nadeen Fathallah, Steffen Staab, Alsayed Algergawy, “LLMs4Life: Large Language Models for Ontology Learning in Life Sciences” (2024).


Leave a Reply