Abstract
Keywords
(Hucklenbroich, 2014)
Introduction
Disease is a central focus in biological and medical communities, a challenge for ontological analysis, and a concern of the utmost importance for all persons. Forming a general description, if not a definition, of
This is not to say that there should be a single definition, for it may do more harm than good (Hesslow, 1993). As a general term or concept,
Within biomedical informatics disciplines, there is an apparent need to manage disease-related biological data, establish clear vocabularies therein, and form computational models that accurately reflect clinical and general biomedical knowledge. Ontological engineering may support clinicians and researchers of disease toward those goals. Commonly cited potential benefits include: reduction of ambiguity; data sharing; annotation; semantic interoperability; automated reasoning; and clinical decision support. There is also potential to advance the education and pedagogy of disease etiology and development. Consider disease ontology and disease data visualization applications, for instance. Philosophical ontology and analysis contribute by providing concept analysis and development, logical rigor and general insights into understanding the pathological phenomena (or our conceptualizations of them) generically referred to as ‘disease’.
The River Flow Model of Diseases (RFM for short), originally introduced in Mizoguchi et al. (2011), is an ontological theory of disease. The RFM has been developed using the HOZO (2013) ontology editor. RFM classes are part of YAMATO: Yet Another More Advanced Top-level Ontology (Mizoguchi, 2010), which is part of the Japan Medical Ontology Development Project for Advanced Clinical Information Systems (Imai et al., 2009; Clinical Medical Ontology, 2013). Both ontological models are under development and no claims to completeness are made. The RFM is a causal account of disease for use in biomedical ontology and clinical settings, wherein medical practitioners, informaticians, data modelers, and ontologists organize, retrieve, and reason over the growing (Mihăilă et al., 2013) sea of biomedical data. Kozaki et al. (2015) describe an application of the RFM, an online graphical user interface (Disease Ontology, 2013; Disease Chain LD, 2014) displaying the causal structure of various diseases in terms of causal chains. The RFM aims to provide a general account that is consistent with clinicians explicit and intuitive beliefs on causality by employing the concept of
Mizoguchi et al. (2011) define
That
Given this, and given that causal thought and language are fundamental, widespread, practical, and explanatory, employing causal notions or categories in an ontological account of disease is a step in the right direction toward comprehensible, useful and scientifically accurate descriptions and formal representations. This approach provides an intuitive and accurate way of conceptualizing and describing disease entities. It reflects and appeals to our knowledge and experience of causality in the world; and the fundamentality and centrality of causal reasoning (Lagnado, 2011). “Causality is something we must reason with constantly in life” (Kleinberg & Hripcsak, 2011, p. 3).
This paper stresses the importance of representing causality in ontological models of disease. In so doing, we aim to contribute to the discussion of causality in ontology development. Within this context, we clarify and develop the RFM. An updated
The paper is divided as follows. Section 2 introduces top-level categories adopted by the RFM/YAMATO. Section 3 presents a philosophical discussion of causality and causal chains independent of any particular theory, but focusing on disease causation. Section 4 explains the RFM river analogy. Section 5 unpacks the RFM disease definition. A revised definition is presented, whereby a disease is described as interrelated causal chains of pathological entities in an organism. We also demonstrate similarities between the RFM and another ontological theory of disease, discussing strategies for interoperability between the two. Section 6 offers two generic (theory-independent) definitions of disease. Finally, concluding remarks are in Section 7.
The word ‘entity’ is used to encompass all that can or does exist at least at the particular level: objects (and their parts), processes, properties, qualities, attributes, features, functions, systems, states, etc. ‘Biological entity’ and ‘pathological entity’ are similarly used, but constrained to the domains of biology and pathology. ‘Pathological’, like ‘abnormal’ is undefined, but associated with (potential) harm, dysfunction, poor health, etc. Pathological entities, then, include: disorders; pathological states, processes, features; etc. Double quotes are used for quotations from references, single quotes for individual words or phrases. Key phrases may be either italicized or in bold. Capitalized, italicized and bold relational terms mark (existing or candidate) ontological predicates (relations). Bold, camel-cased category terms mark (existing or candidate) ontological categories.
Background concepts
Some widely used philosophical categories and distinctions, including the universal–particular distinction (Simon et al., 2006) are adopted. Similarity and generality have historically been distinguished from particularity, the former being described using the notion of universals (categories, kinds, types, classes), the latter with particulars (individuals, tokens, instances). Heart disease, as a general, universal or repeatable entity, is distinguished from a specific case of heart disease affecting an individual human being.
The domain-neutral ontological categories, These categories reflect the traditional metaphysical perspectives of Endurantism and Perdurantism (Lowe, 2002, p. 49), philosophical theories on persistence that classify entities in the world as essentially either kinds of objects or kinds of processes. Borgo and Mizoguchi (2014) present a first-order formalization of
These ontological perspectives are considered insufficient by themselves to accurately represent the world. Neither is ontologically prior to the other. Rather, each is dependent on the other (Galton & Mizoguchi, 2009, p. 72). Object and process – or object-like and process-like aspects – co-exist (Mizoguchi, 2004) and are existentially dependent on one another. They are
Using distinctions from (Galton & Mizoguchi, 2009), a given object is described as being the “interface between internal and external processes”. This description is insightful because it communicates our dynamic and changing world, and emphasizes the interconnectedness and mutual interdependence between what are called
The next section is a non-exhaustive3 Counterfactual dependence, for example, is often associated with causation, but not discussed here.
The literature on causation is extensive4 See Schaffer (2008) and Causation Annotated Bibliography (2008) for a sample.
Causation predicate, its inverse, and candidate domain and range
Common formal properties of a causation relation
A causation relation is typically understood as having the following formal properties (Table 2), both of which are consistent with, but do not exhaust, causal scenarios at biological scales. Both are translatable into a logical formalism, such as first-order predicate logic. Here, they are implicitly universally generalized (∀).
In what follows ‘→’ signifies implication, ‘¬’ negation, ‘∧’ conjunction, ‘∃’ the existential quantifier (read: there exists), ‘<’ temporal precedence is undefined here but can be interpreted along the lines of Allen’s Interval Calculus (Allen, 1983), and ‘A#’ enumerates causal axioms.
If one desires an irreflexive causal relation, communicating the idea that the relata do not cause themselves, then we have (A3):
Philosophical theories posit general or type-level causation holding between domain-neutral categories, such as Event, Process, Property, State and Fact. They also posit instance- or token-level causation between instances of these categories. However, there is no philosophical agreement on “how to relate the type and token levels” (Kleinberg & Hripcsak, 2011, p. 3). Given that YAMATO classes quantify over particulars (Borgo & Mizoguchi, 2014), at least a token-level causal account is to be formulated for the RFM.
Generally speaking, one can formulate distinct theories in which domain-specific entities of different ontological categories are candidate
The question of the type of causal relata remains open in the causation literature. However, relata
If the domain and range of a causal predicate is, indeed, constrained to
Change is implied in production and causation. That something caused something else involves one or more changes, but exactly what kind of change varies.
Furthermore, biomedical investigation into disease informs us that there exist various
The foregoing causal considerations, among others, should be taken into account when representing disease causation and ontology. A generic bio-causal account should also address the
Given that causation involves recurrence and regularity, it is reasonable to believe that the category of
Biological activity and the stability of entire living systems have been described as being dependent upon the interdependent and interconnected character of separate processes and events (Lillie, 1940, p. 331). That is, the repeatability of
Various, if only partial, pattern-based descriptions of disease types are conceivable: disease as (Hucklenbroich, 2014)5 Hucklenbroich considers
We should note, however, that one can question whether there is mind-independent unification between all the parts/elements of a pattern (and causal chain) that would justify the characterization as
The ordinary idea of a causal chain (C-C for short) and its application in discourse conveys a connected and ordered sequence of entities, and is often metaphorical. C-C’s are typically conceived of as some occurrence causing another, which then causes another, and so on over time. Visual representations of causal chains often take the form of unidirectional arrows connecting alphanumeric characters or shapes. The latter, also called ‘nodes’, represent causal relata, whereas arrows represent causal relations. A C-C is not a mere sequence or succession of occurrences just as the relationship between cause and effect is neither mere correlation nor regular constant conjunction. Rather, the connection is a stronger relationship between that which the links in the chain represent (the causal relata). In a sense, C-Cs are
Both C-Cs and sequences involve a following of one entity after another; they are ordered and have direction. The
By contrast, a literal chain taken end for end does not have a preferred direction. It exhibits directional symmetry such that opposing spatial paths can be traced along it. This is due to the compositional structure of the chain: being a material (and thus spatially-extended) object. A C-C is representative of a spatio-temporal path from the beginning of the causal sequence to the end. Many causal chains, then, are lawfully connected, ordered and directed sequences of entities. For example,
Each link (node) in a C-C may be viewed as the cause/effect for the subsequent/preceding link.6 As such, Cause and Effect categories may be modeled as
These links need not be interpreted as representing discrete (rather than continuous) causal relata. Just as the links in an actual chain overlap one another through the holes they form, so can the relata overlap. A causal scenario may involve continuous, simultaneously occurring, causally-connected or overlapping processes with vague (if any) spatio-temporal boundaries. This is often the case in biology where crisp spatio-temporal boundaries are difficult to discern, and where complex causal interactions takes place over time.
Toward formalizing a generic Causal Chain class (C-C FOL), a first-order formalization is as follows. For the moment, assume relata are processual entities, ignore temporal arguments, and keep the parent class of Causal Chain open. Lower case letters signify individuals, bold terms classes, and italicized terms relations. ‘instance_of’ signifies the instantiation relation between classes and their instances. (C-C FOL) says that instances of causal chains have occurrent parts that cause one another in some manner, sequence, etc.
To the extent that the relata are parts of a C-C, the mereological relation of Depending on the account, what actually overlaps may be causal processes, a temporal part, an interval, etc.
Now, if we do not assume parts are the same type as the whole, then another relation is needed. If
For example, assuming occurrent relata: a continuant C-C is
Finally, the concept of disease C-Cs appears similar to that of biological pathways.
A representation of the full causal complexity inherent in biology and medicine may be a directed web or network of pathways and nodes, but for medical practitioners selectively focusing on specific biological interactions will likely be the task of the day. This focus is not uncommon in causal attribution and causal determination in general. After all, “a ‘complete’ causal account of an effect would be unmanageable or even impossible ever to achieve. The causal chain, or, to use a more holistic metaphor, the causal web, may extend indefinitely” (Raisanen et al., 2006, p. 294).
Like Causation, the general category of Causal Chain is broad, and applicable to most, if not all, domains and levels of granularity. A C-C class and one or more causation predicates (relation terms) can therefore take a place in the class and relation hierarchy of domain-neutral ontologies.
A disease is like a river. So the analogy goes, but how are they similar? The most general similarity according to the River Flow Model of Diseases is that both entities are of the same ontological category: Continuant. The central characterizing feature of rivers, however, is distinctly Occurrent: flowing waters. Consider the Heraclitean (Graham, 2011) proverb that one cannot step twice into the same river. It communicates the reality of change in and of the world, and uses a conspicuously dynamic natural phenomenon to do so. A more precise interpretation of this proverb conveys the idea that a river (anything whatsoever) is constantly changing, or involves change of one sort or another while retaining its distinctiveness. A yet stronger interpretation is that a river maintains its identity for but an instant … or not at all.
Yet we often speak of individual rivers. We are able to visit and later refer to the same river – or the unfolding geographic processes we call ‘rivers’ – time and again. To the extent that the general term ‘river’ and the corresponding concept reflects a natural kind, and is indispensible in hydrology, fluvial geomorphology and related scientific disciplines, it reflects an ontological category for scientific domain ontologies.
Whether rivers are more accurately described as a process, a continuant, or otherwise, the phenomenon in question seemingly exhibits some constancy and stability such that we identify persisting conditions and properties. We believe that
This last interpretation is consistent with the ideas adopted by the RFM, e.g., that an object (here, a river) is “a point of stability” between or in virtue of processes (Galton & Mizoguchi, 2009, p. 25). Both communicate the
We identify the physical boundaries of rivers while being cognizant of the fact that those boundaries objectively change as flowing waters continuously shape and carve their way into the surrounding terrain. Rivers “exhibit tremendous variability in the quantity, timing, and temporal patterns of river flow, and this profoundly influences their physical, chemical, and biological condition” (Allan & Castillo, 2007, p. 31). This constant change in the measurable properties of depth, shape, temperature, content (sediment, nutrients, biota), and current velocity (Allan & Castillo, 2007; Hebert & Ontario, 2008) is observable over time. Hence, rivers are described as persisting in time through changes in these, and other, properties. The RFM, therefore, ontologically categorizes rivers, like waterfalls (Galton & Mizoguchi, 2009), as
The
Diseases, like rivers, are continuants in the RFM. A disease is held to persist as one and the same entity while affecting an organism over time. Personal physicians describe a patient’s disease in continuant-like terms, tracking an instance of a disease over time while observing changing signs and symptoms.
There is also a distinct process-like character to diseases. They develop over time, harmfully affect body parts, functions and systems, all of which are amenable to study and measurement just like the features of rivers. These pathological changes are processual parts composing the course of the disease. Just as a river affects its surroundings, such as the surfaces of the host terrain it flows over and through, so a disease affects the host organism.
The external–internal process distinction applied to rivers is also applied to disease. The
Constraints on the properties rivers and diseases have are placed, in part, by the surrounding environment (Allan & Castillo, 2007, p. 32) – the wider ecosystem and the organism, respectively. The direction and flow of water, for example, is determined by certain conditions and laws of nature. In an analogous way, diseases develop in a biologically principled manner with a myriad of potential causally efficacious conditions and effects.
The changes that occur during the course of each disease instance vary from patient to patient. That is, the development of a disease takes different symptomatic and processual paths. They can spread in a multitude of ways, affecting different areas of the body. These variations consist of different causal chains. Distinct patients may exhibit different physical manifestations (signs) and experience different symptoms just as distinct rivers uniquely affect their surrounding environments. Rivers change size, shape, and exhibit branching topological properties. So the analogy goes, a disease itself branches (spreads) into different parts of the body.
A disease type may manifest in different ways – in different disease courses of causal chains – but the idea is that instances of that type will nevertheless exhibit
Formal ontological accounts of disease
Existing ontological theories have put forth distinct accounts of disease, the River Flow Model being one of them. In this section, its disease definition (Mizoguchi et al., 2011) is unpacked and developed further using the preceding analysis. We first explain the original natural language (NL) definition of the RFM Disease class. Second, we discuss additional classes found in the RFM implementation, but not mentioned in the NL definition. We change some to more clearly reflect the concept of causal chains. We do not provide formal definitions for the C-C subclasses, however. The difference between the original definition as found in RFM publications and its current implementation is an important discrepancy to address because ideally they should cohere. The changes we present are a step toward greater consistency. Third, a revised definition is presented to better match the actual classes in the YAMATO hierarchy, of which RFM classes are a part. Fourth, the RFM is compared with the Ontology of General Medical Science’s (OGMS, 2014) disease account. We offer potential strategies for their interoperability.
The River Flow Model disease definition
The RFM is neutral with respect to the cornucopia of theories of causation. It focuses on knowledge and intuitions of causality at the mesoscopic scale of daily occurrences, as understood in biology and at the relevant biological scales. If we accept that “[…] in medical contexts causality talk has the same character as everyday causality talk” (Johansson & Lynoe, 2008, p. 127), and that this mirrors (to some degree) the causal structure of the world, then we can apply causal notions to finer biological scales.
A causation relation (
We unpack the original natural language disease definition (RFM-NL) by discussing the relevant ontological categories as they appear therein.
A dependent continuant constituted of one or more causal chains of clinical disorders appearing in a human body and initiated by at least one disorder (Mizoguchi et al., 2011, 2012).
The supposed ontological classes mentioned in this definition are:
The actual formal implementation of the RFM, however, utilizes classes not mentioned in the NL definition (and vice versa). RFM-NL is at least acceptable as a shorthand, expressing the main idea of the theory, for the more complicated implementation.
As a subclass of Continuant, a Equivalent to BFO: Dependent Continuant (version 1.1).
To say a disease is

The constitution of the RFM causal structure class and causally-linked occurrents. The rounded rectangles signify that, depending on the disease, each causal sequence of occurrents can be represented as a particular causal chain. (Colors are visible in the online version of the article;
‘ The parts of a whole process stand in the

RFM classes and relations in YAMATO. Solid and dashed
We now present the additional categories and relations that are part of the RFM implementation. Although Causal Chain, Disorder and
The RFM uses different kinds of Causally-linked Occurrents to distinguish causal scenarios. For example, the ordinary conception of causal chains is one of sequential happenings. Our
A growing blood clot – a process with a clot as a participant – simultaneously causes the reduction of the cross section of the respective blood vessel, which simultaneously causes a reduction in oxygen supply to some organ. If left untreated, organ death will occur. At least three distinct processes occur simultaneously.10 Further analysis may lead to replacing a
In the dependent entity portion of the YAMATO, Equivalent to BFO: (Specifically) Dependent Continuant (BFO, 2013).
The implementation – including OWL2 (OWL, 2014)12 OWL files by Kouji Kozaki and Riichiro Mizoguchi:
‘State’ has been defined as one or more time-indexed properties (Yamagata et al., 2013) that may or may not change over time. It is a snapshot of a process. A state category, like a C-C category, abstracts or groups together certain realities to be represented as a single whole. Thus, bio-causal systems can be described and symbolically represented as units.
Disease C-Cs are
If external agents are included in a wider causal description of etiopathogenesis, they effectively cause (if only partially) the initiating disorder. For example, invading infectious foreign bodies, pathogens, harmful doses of radiation, or some combined set of factors, cause the formation of a disorder (or some other pathological entity). This external (set of)
“Appearing” expresses See Smith et al. (2005) for a proposed formal definition.
As pathological entities affect an organism, a disease spreads and the C-C changes. Depending on how the disease course unfolds – that is, what organism parts, properties and systems are affected and where – the causal chain may branch or fork … just like a river.
Note that for dynamic scenarios, subclasses can be asserted with a temporally-indexed ternary
Consistency in ontology development is important to, among other things, avoid confusion. In order to have a NL definition that coheres with the implementation (and vice versa), we present two revised RFM disease definitions. They are stated in genus–species–differentia form in order to foster a computable formalization. The first revised definition takes RFM-NL and includes locational relations, Abnormal/Pathological State and the parent class of Disease.
An
A FOL formalization of RFM-A is similar to RFM-FOL, but with the obvious substitutions. (i) + (ii) from (C-C FOL) express the causally-linked occurrents, and (iii) expresses the composition of causal chains.
The second definition closely matches the YAMATO hierarchy and implementation by including the classes and relations actually used in the implementation. As such this definition is more easily used as a computable definition. Using abbreviated class names, we have the following.
An
Let us note two points. First, logical definitions for each class removes the need for explicitly including all the relations in these definitional variations. For example, an axiom describing ACS14
An important class neither mentioned in the definition nor found in the current implementation (yet described at length in publications) is
To the degree that there are essential characteristics of disease, the CCCs of a disease are those causal chains
For example, cases (instance) of non-latent type 1 diabetes will involve elevated glucose and a deficiency of insulin, which give rise to a common set of effects on certain kinds of biological entity (blood vessels, kidneys, eyes). CCC serves to help identify a clinical case as an instance of a disease, distinguishing it from others. They are therefore necessary conditions, and minimum identity criteria for diseases. CCCs have Occurrents as parts, and are therefore parts of the Causally-linked Sequences of Occurrents. To express this idea we assert (A5).
Disease causal chains represent the causal structure of pathology through the full temporal range of a disease. They express the causal relationships of past to presently ongoing pathological changes. For example, RFM’s YAMATO: Events (but not Processes) are completed temporally extended wholes. Disease Causal Chains in Continuant and Occurrent terms
The question ‘What is the causal structure of disease?’ can be partially answered with a description of the relevant causally-connected pathological entities (i.e. disease causal chains). An exhaustive causal representation of a disease that reflects the changes affecting a patient over time – the overall etiopathogenesis of the disease – will, needless to say, be highly intricate. Contributing causal factors, (quasi)simultaneous causation, reciprocal causation, and bio-causal feedback mechanisms will be involved. In short, considering only sequential/linear causal scenarios does not do justice to biological reality, which is more interrelated and continuous (Schulz & Johansson, 2007). Desiderata for any causal account of disease includes the ability to represent this complexity. Toward that end, systems biology may inform representations of this bio-causal complexity. (Renton, 1994, p. 82, italics added)
Other attempts at a general account of disease include the Ontology of General Medical Science (OGMS, 2014). It draws ideas from Scheuermann et al. (2009), and uses the BFO as its top-level ontology. It distinguishes disease from disease courses, a distinction reflecting the two ontological perspectives adopted by its top-level. OGMS:
The realizable nature of dispositions is intended to account for the fact that a disease may not manifest itself, i.e., without the organism exhibiting observable signs and symptoms, at all times it exists in the organism. A disease, then, is conceived as something that is potentially manifested via pathological processes. Once manifested, it causes other disorders. Causality in OGMS is unstated, implicit or stated indirectly. OGMS:
As discussed in Section 3, production is causative. Etiological and pathological processes stand in causal relations16

OGMS classes with suggested areas for causal relations (red). Rectangles and rounded rectangles represent continuant and occurrents, respectively. (The colors are visible in the online version of the article;
One or more causation predicates are also potential additions to, say, the Relation Ontology (RO, 2014), which is used by OGMS and BFO. Mihăilă et al. (2013, p. 3) note that although general physical causation is of interest in biocuration efforts, terms such as ‘cause’ rarely appear in domain ontologies. Terms such as ‘regulation’ and ‘stimulation’ are used instead. Both the comment annotation for RO: See:
Before we discuss strategies for interoperability between the RFM to OGMS, there is one concern to mention. Although dispositions capture the idea of disease manifestation/realization, categorizing all disease as disposition appears false. Medical practitioners work with concrete biological realities, and are naturally concerned with domain-specific phenomena. Far from being the sort of entity that biologists study, dispositions (at least in a pre-theoretic sense) are intuitively of a more intangible and perhaps inaccessible nature. How are dispositions detected when they are not manifested, for instance. It may appear too detached (or abstract) from the concrete biomedical reality practitioners encounter on a daily basis. Moreover, medical discourse using ‘disposition’ or ‘predisposition’ does not necessarily indicate reference to the posited corresponding ontological categories, and does not necessarily reflect widespread (or even individual) belief in diseases
That being said the RFM generally aims for interoperability with other ontological accounts of disease, while putting forth an alternative and autonomous account (a point we will return to in Section 5.5). If individual disease accounts succeed in providing representations of some, but not all, aspects of disease, then an application of one to the other may yield a more thorough representation of disease.
There are a few points of overlap between the RFM (and thus YAMATO) and OGMS (and BFO). As noted in Section 2, the River Flow Model of Diseases shares upper-level categories and distinctions with BFO and DOLCE. It specifically shares the classification of disease as a (Specifically) Dependent Continuant with OGMS (Mizoguchi et al., 2011, p. 64). A partial one-to-one mapping is therefore feasible. Another similarity is with respect to causal notions.
The RFM explicitly applies causal concepts to disease ontology development by using classes that can be formalized with one or more causation predicates. A formal account of cause–effect pairs and causation in DOLCE is found in Lehmann et al. (2004). The BFO discusses types of causal unity in BFO 2.0 (2013), but not causation or causal relations. Each ontological theory seems to share an understanding of the general idea of
For example, according to the RFM and YAMATO, some internal processes of a solid material object help sustain (or maintain) its existence. Internal interactions and molecular bonds maintain its structural integrity. This is on conceptual par with the second kind of causal unity described in BFO 2.0 (2013), namely “causal unity via internal physical forces”, as well as Topological Unity as described by DOLCE (Guarino & Welty, 2000).
Generally speaking, causal unity helps make something a physical, but at least a material, object. As Bird (2014, pp. 24–25) states, a physical object is more than just the mereological sum or physical proximity of its parts; “some kind of causal unity among those parts is also required”. Material objects are those composed of some portion of matter. Physical objects are, more broadly, spatio-temporal wholes.
The unity of disease causal chains is partly in virtue of the causal connectedness of the interacting, dysfunctional parts, processes, and the surrounding environments. According to the RFM, a disease, like a river, is an example of this unity.
Looking more closely at OGMS classes we see another similarity. OGMS:
Strategies toward harmonization between disease accounts
Although OGMS defines Disorder as a Material Entity (a BFO: Continuant), the RFM would represent them as Abnormal States in causal chains (Causally-linked Occurrents). These states can be interpreted as the
Consider another idea. If OGMS persists with a dispositional classification, its Disease class can be described as an Abnormal Causal Disposition, which would be similar (perhaps equivalent) to some RFM classes.
In any event, at least two OGMS classes are candidate causal relata in RFM causal chains. The class
Some Disorders may be asserted as

The causal structure of a generic disease causal chain. The dotted yellow lines from the Causal Chain node represent an expansion/zooming. Squares are causally-linked biological continuants. Rectangles are causally-linked processes. Red arrows signify one or more causal relations. Two opposing arrows indicate
The top part of the yellow rounded-rectangle displays a disease causal chain in terms of pathological processes, with the lower part in terms of causally-linked biological continuants. If one seeks to describe a DCC using only processes, then the top part is appropriate. Together, top and lower portions demonstrate the combining of process-like and object-like ontological perspectives (Table 3). If processes presumably play the preferred role of causal relata.
At least part of an OGMS:
To partially address any incompatibility between YAMATO: Process and BFO: Process, consider this. If either (or both) BFO or YAMATO generalize their Process (or Occurrent classes), then they may be in a better place toward harmonization. Alternatively, if either (or both) disease account assert a Pathological Occurrent superclass (to represent types of disease occurrents) in their disease models, then an equivalence between the respective classes appears viable. This class can, for instance, subsume RFM: Abnormal State as well as OGMS: Pathological (Bodily) Process. Classes from other biomedical ontologies, such as the Gene Ontology, the Infectious Disease Ontology (IDO, 2013), and the Disease Ontology (DO, 2013) should be applicable in a similar fashion. Further research is in order.
To the extent that the RFM concentrates on the instance-level,18 This would be consistent with philosophical accounts holding causation to be of the token, not type, variety. Either simultaneous with
For example, if Arterial Hypertension is a disposition, then its realization process would stand in certain causal relations to other parts of the disease course directed to heart disease. If treated, the hypertension is not manifested via some elevated blood pressure process, i.e., the respective C-C or causal process does not occur. If untreated, there is high blood pressure: the C-C occurs.
The problem with this example is that if Arterial Hypertension simply is high blood pressure (not a measurement of it) or that which causes it, then instances of hypertension are continuously ongoing, abnormal processes of blood flow. There may be no observable symptoms, but these processes still occur
The River Flow Model does not require disposition classes to define disease, but allows for them. Moreover, talk of dispositions is unnecessary for
One or more disease C-Cs at one or more scales is constantly influencing the organism. When they cause specific pathological effects at the same (or more coarse-grained) levels, signs and symptoms become observable to the physician and patient. In short, some pathology may always be present.
Consider the blood clot example. The clot decreases the cross section of the blood vessel. The result is an abnormally narrow channel for blood to flow through. Rather than a dispositional account according to which the disorder affords the existence of a disposition, we simply state the causal realities. The clot – the existence of which already establishes some pathology and the partial causal basis for subsequent pathological entities – causes certain abnormal processes, e.g., abnormal blood flow processes. It disrupts what otherwise is normal blood flow. The dysfunctional blood flow processes continuously unfold for the duration of the clot’s existence regardless of the degree of deviation from healthy blood flow. They merely do not cause certain signs or symptoms at some level.
OGMS would describe this scenario as one of an unrealized disposition, but this may be unnecessary for some diseases because talk of
Regardless of the classification or ontological description of disease, ultimately what is important is the degree to which the ontological characterization can help support the prevention and curing of illness/disease.
Two general points about disease causation and ontology are easily accepted. (i) The pathogenesis of diseases is comprehensible, lawfully unfolding in time according to knowable principles, and thus has discoverable causes. This affords the manifestation and perception of recurrent but variable, discoverable and determinable patterns, which help form putative disease kinds. (ii) Some dysfunctional or disordered organism parts and processes are causally connected. They stand in certain causal relations (causal … dependence, necessitation, determination, etc.) to one another. They are interconnected such that changes in or
Clinicians and biomedical researchers study not merely disordered organs and their characteristics, but also the dysfunctional or pathological processes occurring in the patient. Biomedical specialists may also be unconvinced by having a specific entity (e.g., disorder), classified/characterized in a specific way, take an exclusive role in disease representation. A generic disease C-C or causal structure class, then, communicates the causal interconnectedness of some pathological entities, their causal efficacy and the way they temporally unfold. In other words, it expresses the causal aspect of pathology.
With this in mind, more inclusive causal and pattern-based definitions of disease can be formed by using higher-level categories such as Pathological Entity, Condition (as in footnote 20), and so on. For example, a broader disease definition that allows for any pathological entity playing a causal role in the disease development is this: (
This definition retains the intuitions that disease is existentially dependent and causally efficacious without committing to a more specific classification (e.g., pathological causal structure, disposition, process, etc.). It encompasses RFM and OGMS conceptions of disease, while not necessarily requiring all the relations used in the RFM conception of disease (similar to Fig. 4). A problem with this formulation, however, is that it may be too broad. In replacing ‘Pathological Entities’ with ‘Pathological Occurrents’ (and defining it appropriately), one constrains the relata while allowing for both Abnormal States and Pathological Process classes. It also allows for domain-specific specializations, e.g. biological occurrents, mental occurrents, etc.
In Section 3.1 the category of Pattern was mentioned as being related to causation, and thus as being potentially applicable to the representation of disease causation. To briefly explore one of the pattern-based characterizations, consider this generic disease definition (the terms of which are undefined). (
To the extent that Patterns and Dispositions are potentially realizable entities, we may also assert the following translation or equivalence. A disease disposition is equivalent to a disease causal pattern. The realization of the former is then equivalent to that of the latter.
To speculate on an application, consider this. Envision a multi-scale graphical simulation in which the causal influence of a disease is visualized at different scales, and superimposed on a simulation of the organism body. Both typical and atypical (but possible) disease courses can be represented through all temporal modalities: past, present and future. The usefulness of such a system is as a pedagogical tool for students of biology and medicine, and a tool medical researchers and practitioners. It would help in visualizing how various diseases commonly unfold in time, and the unique permutations of effects and disease courses. It could stimulate comprehension of: factors believed to be causally-relevant for the onset, development, and remediation of a disease; specific effects of disease entities, the primary changes characterizing a disease, and specific causal relationships involved.
The preceding discussion has offered some ideas toward better incorporating the causal aspect of etiopathogenesis in ontological accounts of disease. As a final, very important, point, note that the descriptions of disease discussed in this paper do not mention the causal roles played by other important factors in pathology development. Examples include: diet, mental states, stress-levels, lifestyle, social conditions, etc. These are essential factors to represent in any ontological account of disease because they are causally related to (often pivotally so) susceptibility to, onset, progression and curing of disease.
Conclusion
We have revisited the river, and begun to navigate the currents of causality in disease ontology. The River Flow Model of Diseases (RFM) is an ontological account representing the causal structure of pathology. It differs from other ontological theories in that it explicitly takes causation into account. Diseases, such as Diabetes, are described as being constituted of causal chains of pathological occurrents in an organism. Disease causal structures are modeled as wholes that are constituted by abnormal states and processes (causal relata) composing causally-linked occurrents (causal chains) that may temporally unfold in a sequential or simultaneous manner.
This paper provided a general discussion of causality and causal chains vis-à-vis disease ontology; and a clarification and development of the RFM, including a disease definition that more closely coincides with the formal implementation of the theory. By providing formal axioms, we have begun a first-order formalization of the RFM. We also demonstrated the commonalities between the RFM/YAMATO, OGMS and the BFO, and discussed avenues for their harmonization.
The RFM is in accord with some “desirable features” for disease ontologies suggested by Bodenreider and Burgun (2009). For example, although OWL has been unnecessary for applications of the RFM in Japan, OWL translations of the RFM are under development. This is in accord with the feature of having a standard and friendly format. In reusing existing ontological category terms, the RFM is on course toward interoperability with other ontologies, another suggested feature.
Further development of the RFM should include: clear natural language and formal definitions of YAMATO classes, curation efforts for greater comprehensibility; additional development of RFM causal relations; a complete first- or higher-order formalization; and applications to other disease ontologies. Specific research topics include systems biology and biological pathways, the former of which overlaps with the RFM in that both appear to have holistic approaches. Representing pathological causal feedback mechanisms and cyclical causality is also highly relevant, and should be included within the scope of the RFM (and other disease accounts).
Criteria for the success of the RFM, like other ontological theories of disease, include: faithfulness in representing disease entities; consistency and coherence; applications to biomedical data and specific diseases, and the degree to which it assists medical practitioners and researchers.
Just as neighboring rivers are likely to flow into the same ocean, so the “flow” (goal) of theories of disease must be toward supporting the ocean of research on preventing and curing disease. In other words, a shared purpose among medical, informatics and ontology communities involved in the study, management, and representation of biomedical data and disease entities should be to heal.
