Abstract
Keywords
Introduction
The use of qualitative research methods in the social and human sciences considerably increased as of the 1990s (Chamberlain, Stephens, & Lyons, 1997; Flick, 2007; Miles & Huberman, 1994; Murray, 2000; Silverman, 1993; Smith, Harré, & Van Langenhove, 1995). This dissemination quickly reached the health field through nursing, medicine, and public health. This renewed interest in qualitative research promoted the publication of numerous “quality criteria” grids/guidelines (we use the words
As part of an ongoing research (this document represents part of a larger research funded by the Swiss National Science Foundation (www.snf.ch), 2011-2014: “Quality of Qualitative Research in the Health Sciences: Which Evaluation Criteria?”), we examined 133 evaluation grids in five major health science fields: “medicine,” “public health,” “nursing,” “psychiatry and psychology,” and “research methodology.” To better understand their variability and its origins, we conducted a computer-assisted lexicometric analysis, our premise being that the diversity of the grids should not only be linked to the various disciplinary fields but is also intrinsic to various qualitative research conceptions among grid designers. In this article, we start by presenting a brief status report on the health science research field, in terms of quality criteria. We then present our methodology. In the third part, we present our main results. Finally, we discuss the inherent limitations of the various research conceptions highlighted within the different disciplines they represent.
Problem: A Shift in the Quarrels From the Last Century?
In a context of an increase in the publication of qualitative studies both in books and recognized scientific journals, debates on the relevance of developing (or not) specific criteria for qualitative methodologies have evolved continuously (Devers, 1999; Dixon-Woods, Shaw, Agarwal, & Smith, 2004; Mays & Pope, 2000; Yardley, 2008). Certain authors deem that the validity and reliability concepts stemming from quantitative methodologies are suitable to reach a high level of rigor in qualitative research (Miles & Huberman, 1994; Morse, Barrett, Mayan, Olson, & Spiers, 2002; Porter, 2002); however, another group of authors finds that applying the “reliability” and “validity” criteria directly is unacceptable for qualitative studies. This last group argues for differentiated criteria for qualitative work based on specific philosophical and methodological foundations. They argue their position by insisting on the structured thought needed in all paradigm transfer, especially to apply the same criteria to social and human sciences as those applied to natural sciences (Cicourel, 1964; Garfinkel, 1967; Hamberg, Johansson, Lindgren, & Westman, 1994; Schutz, 1962; Van den Hoonaard et al., 2004). Somewhere in the middle, a third group of authors argues that evaluation criteria can be transferred from quantitative to qualitative methods, as long as they are adapted (Pope & Mays, 2006).
In following these debates, we notice that the discrepancies regarding the quality criteria seem to reflect different conceptions of qualitative research (Santiago-Delefosse, 2007; Schneider, 1998). Yet, these discrepancies may explain why it is so hard to build consensus that will allow for the evaluation of qualitative research quality within the scientific community, especially because they remain obscure in literature on quality criteria grids for qualitative research.
Some work highlights the variability of existing grids (Cohen & Crabtree, 2008; Flick, 2007; Mays & Pope, 2000; Popay et al., 1998) and the difficulty in comparing them. The inconsistency of the number and objectives of the criteria used, the absence of consensual and explicit definitions, and the absence of construction consistency between the grids of one disciplinary field were identified in a previous publication (Santiago-Delefosse, Bruchez, Gavin, & Stephen, 2015). Therefore, research conceptions underlying these grids may be questioned as well as the epistemological and paradigmatic positions that guided their designers. This questioning is still highly relevant as the lack of consensus regarding a certain number of criteria, even minimal, is an argument to which article and fundraising evaluators refer and which motivates the rejection of projects.
Given these facts, our research focused on analyzing the links between the health science disciplinary fields and the major paradigms guiding grid designers in determining the criteria used to evaluate qualitative research. We used Kuhn’s (1962) general definition, which states that a paradigm is a set of disciplinary convictions shared by the scientific community at a given time during its history. Any science is stabilized by sharing the same paradigm, which marks its field and investigation procedures. This paradigm is developed by groups of socially dominating researchers who control the transmission of knowledge through research funds, positions, and publications. This definition captures the epistemological and paradigmatic choice that is made in all research conducted, whether this choice is deliberate or the outcome of reflection, or much more implicit.
Method
Make-Up of the Corpus
In total, 133 grids (checklists or guidelines) were examined in the various health fields, from the MEDLINE, PsychInfo, CINAHL, PeruniL, ScienceDirect, and Web of Sciences databases, using the keywords “health,” “qualitative research,” “assessment,” “appraising,” “quality criteria,” “peer review process,” “guideline,” “evaluation,” and “standards.” Data were used according to the following inclusion criteria: (a) grids published for health scientists (medicine, nursing, health psychology, health sociology and anthropology, public health, etc.), (b) guidelines that were sufficiently developed and commented, and (c) work published by health journals, institutes, and organizations whose scientific quality has been validated.
Whether they are taken from scientific journals (instructions to authors), articles for authors and researchers to improve their qualitative work or as an evaluation support tool, or methodology manuals, most of the grids examined are presented as lists of criteria to meet to produce “good qualitative research” that is rigorous and reliable. Most of these grids are accompanied by a text explaining the criteria (definitions and uses) and/or a theoretical or methodological text that is sufficiently detailed to support the quality criteria analysis.
Most of the grids examined come from publications dating from 1993 to 2011, which correspond to the increase in debates on the quality of qualitative research in the 1990s. Most grids come from medical journals (45), nursing literature (39), research methodology journals/literature (20), public health literature (19), and psychiatry/psychology literature (10). Most evaluation grids were published in medical and nursing literature.
We submitted our corpus to two analyses: a written analysis with inter-evaluator agreement that allowed us to determine coding categories for our second analysis and a computer-assisted lexicometric analysis (Alceste®).
Thematic Analysis of the Content and Theoretical and Methodological Foundations of the Analysis Categories
Using Alceste® requires prior coding of the text to show the software which variables must be identified in the text. To do this and identify the variables that would be most relevant to our questioning, we conducted a first thematic analysis of the grid content. This first content analysis helps to identify the “contextual variables” specific to each text (i.e., author, discipline, and paradigm; “paradigm” here refers to a system of fundamental beliefs belonging to the researchers and used as the basis for their theoretical and methodological choices in the conception of the grids). The variables are then coded to make them compatible with the computerized analysis. The content analysis was conducted independently by three researchers. The results were triangulated and discussed collectively to come to an inter-evaluator agreement (Braun & Clarke, 2006; Robert & Bouillaguet, 2002).
The reference framework used for this content analysis was the one used for the work conducted by Denzin, Guba, and Lincoln. Indeed, besides their heuristic scope, they are the most often cited in literature on the importance of paradigms and epistemologies in qualitative methods (Denzin & Lincoln, 1994/2011, Santiago-Delefosse & Rouan, 2001).
In 1994, Denzin and Lincoln proposed a classification that included two major paradigms subdivided into four epistemological research positions: the positivist/post-positivist paradigm (including the positivist or post-positivist epistemological approaches) and the interpretivist/constructionist paradigm (including the critical or constructivist epistemological approaches). Notice that in 2011, they added a “participative” epistemological position to the interpretivist/constructivist paradigm. A certain number of criteria differentiate these epistemological and paradigmatic approaches. Among these criteria, three major axes caught our attention as they can be operationalized and help to compare the criteria in the grids: (a) the form and nature of reality (following the preexisting or co-constructed one), (b) the researcher’s type of position regarding research (outside observer vs. participant involved in the co-construction), and (c) the use of the research (discovery of inherent truths vs. work on subjectivity and the impact of its construction on the world, work on the co-construction of realities, work on dominating forces and empowerment). We added a fourth axis regarding the presence of ethical questions, which also indicate the researcher’s position toward the research (absent when the research is considered an objective report on what exists vs. always present when the researcher, his or her values and the context are considered a part of the research at all times).
We analyzed our corpus by focusing on these four major axes, which help to report on the conceptions underlying the research in the 133 grids examined. As a result, we identified five paradigmatic and epistemological approaches: The first category was defined as “post-positivism”; the second one, “post-positivism/interpretivism” bringing together grids that combine both approaches; the third category entitled “constructivism/interpretivism” was built upon a similar basis; the fourth one was “critical constructivism”; and the fifth one was the “post-modern” approach including grids with criteria dealing with radical relativism/constructivism (considering, for example, that qualitative research is too singular to be evaluated). No grid was identified in the “positivist” epistemological approach.
Presentation of Alceste® and Relevance of the Lexicometric Analysis Based on Our Hypothesis
We used an analysis produced by Alceste® to process the contents of this corpus. Alceste® is textual or statistical data analysis software, originally designed by Max Reinert, of the CNRS, in France (Reinert, 1990, 1993). It has been used in the human and social sciences since the 1990s. It counts words in a frequential manner and helps to obtain analysis units that are based on formal criteria. It uses an inductive and recursive approach and helps to identify co-occurrences, or word associations in a sentence, by processing the words according to their similarities and differences. Technically, Alceste® breaks the corpus into fragments that are relatively similar in size, referred to as “context units” (e.c.u. the smallest statistical units created by the software, based on a compromise between the syntactic form [proper punctuation] and the statistical constraints [these units must be of comparable size]). These same fragments are then reclassified statistically and split into classes whose specific vocabulary is as different as possible. These classes are called “lexical worlds” and are considered to present an “image” of the representations contained in the text, the ideas and the corpus’ major themes (Garric & Capdevielle-Mougnibas, 2009). Second, it defines a link to what software developers refer to as “contextual variables” (data elements known by the researchers and used as instructions for the software so that the lexical analysis can be conducted with these variables in mind). These variables are used to identify the texts and are related to their content. The researchers introduce them according to their relevance to the research questions. The classes identified by the software must then be examined and linked to the co-occurrences to give them meaning and explain their differences (Aubert-Lotarski & Capdevielle-Mougnibas, 2002). Notice that the software’s results are based on frequencies rather than weighting, which could be considered a limitation.
We deemed it relevant to use this software because of its textual classification abilities, given that we hypothesized that the differences between the grids were not solely linked to disciplinary fields in the health sciences, but represented paradigmatic and epistemological positions within certain disciplines, connecting certain grids of disciplines that differed from those from which they stemmed. If this hypothesis was validated, we would obtain a classification of grids that would be more connected to paradigmatic/epistemological positions than to disciplinary fields. This does not mean that certain disciplinary fields cannot be consistent among one another (i.e., the older and more solid ones).
The following variables were used for the lexicometric analysis conducted as part of this work:
The five health disciplinary fields in which grids were published: medicine, nursing, public health, psychiatry/psychology, and research methodology.
The five paradigmatic and epistemological approaches as identified in the thematic analysis of the content: post-positivism, post-positivism/interpretivism, constructivism/interpretivism, critical constructivism and post-modernism.
Table 1 presents the “contextual variables” used and adapted for Alceste® coding and stemming from our written content analysis.
Contextual Variables Used for Alceste® Coding.
Results
During the written analysis and according to the contextual variables used, we obtained a distribution of grids by “disciplinary field” and “paradigmatic and epistemological approach” (Table 2).
Distribution of the Grids by Disciplinary Field and Paradigmatic and Epistemological Position.
There are only a few post-positivist (pp1) grids in the corpus. They generally propose similar criteria from a positivist perspective: There are some in medicine, research methodology and public health, one in nursing, but none in psychology/psychiatry. The post-positivist/interpretivist approach (pp2) is the most important in our corpus (57/133) and concerns grids that include criteria close to the positivist field and more interpretative criteria: They are mostly found in medical journals. Constructivist/interpretivist grids (c1) are not as numerous (31/133) and propose criteria specific to qualitative research, integrating contextualized constructivism and the researcher’s interpretative position: They are mostly found in the fields of nursing and medicine. Grids classified in the critical constructivism category (c2) are slightly less numerous (27/133) and include criteria specific to qualitative research, with a constructivist approach: They are found in all disciplines, predominantly in nursing. Finally, there is a small group of five grids that is harder to classify in the previous categories, with a profile that is often explicitly geared toward a post-modern approach (c3) and published mainly in nursing.
We note that the grids are mainly split into two large groups. The “post-positivist/interpretivist (pp2)” group represents half of the corpus (57/133), and then the sum of the two “constructivist (interpretivist and critical)” also represents half of the corpus (58/133). Consequently, the issue regarding the discrepancies between the grids may stem from the difference between these two large groups, especially in the definition of the two “interpretivist” positions.
Lexicometric Analysis and Distribution of the Corpus Among Six Classes
Once Alceste® processed the corpus, the latter was split into six distinct classes. Figure 1 details the class distribution and indicates the five most significant terms (χ2) given by the software for each of the six classes as well as the contextual variables related to them: discipline and paradigmatic approach. To make it easier to read the analysis, we also numbered the quadrants of the factorial design (A, B, C, D).

Distribution of the six classes and contextual variables.
The factorial design (Figure 1) shows the interrelations between the classes and their respective positions in the quadrants. Certain classes, namely, Classes 1 and 5 in Quadrant C, and Classes 2 and 6 in Quadrant B, are closely linked to one another, almost interwoven. As for Classes 3 and 4 (Quadrants A and D), they are more well-defined and independent. We thus notice that the classes do not necessarily cover the disciplines, as there are two psychology/psychiatry groups with different approaches, as well as two nursing grids (Classes 2 and 4) or public health grids (Classes 4 and 6) characterized by different paradigmatic approaches. We also notice that Class 4 includes grids from three disciplines: medicine, public health, and nursing.
The factorial design also indicates a bipartite distribution between the grids underlying approaches that are essentially constructivist (c1) and focused on the researcher’s qualities as an integral part of the research in quadrants of C and D (Classes 1, 4 and 5), and the grids with more post-positivist approaches that highlight the “good research practices” needed to ensure the research’s validity and objectivity, in Quadrants A and B (Classes 2, 3, and 6).
Dendrogram of the Six Classes and Significant Terms
The examination of the significant terms (χ2) present in each class can explain the specificities in the factorial design and identify those of each class. This analysis highlights the specific identity of the grids and the differences/similarities between them. The dendrogram (Figure 2) shows how the six classes are organized and linked.

Dendrogram: Distribution of the corpus into six classes with significant terms (χ2).
The six classes are split into two main branches including on one hand Classes 1, 4, and 5 (Quadrants C + D) and on the other Classes 2, 6, and 3 (Quadrants A + B).
In the first branch (Quadrants C + D), two classes containing psychology/psychiatry grids stand out: Class 1 with its constructivist/interpretivist approach (c1) and its criteria focused on experience, the meaning of things and comprehension; and Class 5, which is more focused on validation criteria (triangulation, member-checking) and a constructivist approach mixed with post-positivist criteria (c1, pp2). Some medicine, nursing, and public health grids (Class 4) can also be found in the constructivist/interpretivist group (c1), using criteria that are more based on practices, care, research use value, and clinical contributions.
The second branch (Quadrants A + B) includes grids dealing with a post-positivist approach: On one hand, public health grids (Class 6) with a post-positivist approach, using certain criteria specific to the interpretivist field (pp2) and describing mostly sample selection methods and inclusion/exclusion criteria, and on the other nursing grids (Class 2) focused on literature, work framework, research process, and being a little more related to the constructivist field (pp2, c2). As for the research methodology grids in Class 3, they present very diverse approaches, ranging from post-positivism to post-modern (pp1, c3, c2), with their content focused on practices, tools and research techniques. In that respect, these grids, compared with the other groups, seem to be more linked through criteria based on methodological rigor alone, than through the discipline or paradigm.
It is worth noting that Quadrant D (Classes 1 and 4) mainly corresponds to the constructivist/interpretivist position (c1) while Quadrant B (Classes 2 and 6) mainly corresponds to the post-positivist/interpretivist position (pp2). These two quadrants include criteria from grids from four of the five disciplinary fields; however, Quadrant C corresponds to certain criteria from the psychology grids only (Class 5) and Quadrant A to certain criteria from the research methodology grids (Class 3).
These initial results show that the characteristics of the grids do not necessarily cover a classification based on the disciplines and that their differences or similarities are more linked to the paradigmatic and epistemological positions that underlie them. Only two classes (5 and 3) seem to be more determined by disciplinary proximity (Class 5 to a lesser extent though). Consequently, these differences must be examined to determine their meaning. The following analysis provides further detail on the six classes based on their elementary context units (e.c.u., semantic content) and contextual variables (disciplines and paradigms) to better understand their weight in the differentiation of the grids.
Analysis of the Classes According to the Significant Terms and Contextual Variables
Quadrants C and D (Classes 1, 5, and 4): “Mainly” constructivist approaches
Class 1 (psy, c1)
This class includes a group of grids that stems mainly from psychology/psychiatry (psy, χ2 = 60), with a constructivist/interpretivist approach (c1, χ2 = 8). This group of grids is characterized by terms such as
This class proposes specific evaluation criteria for qualitative research. They are linked to the meaning produced, in a context explained by researchers who clarified the implications, and demonstrated their training in terms of interpretative theories. In this group of grids, and in keeping with the constructivist paradigm as defined, the goal of the research is to understand and “reconstruct” its initial subject, as a consensus between the researchers and participants, and be open to new interpretations. As such, the validation of the research places a high value on the researchers’ training and experience, as well as on the consideration of their own “biases” and the characteristics that are likely to influence the research. The criteria proposed in these grids are usually adapted to qualitative or mixed methodologies.
Class 5 (psy, c1, pp2)
A second group of psychology/psychiatry grids (χ2 = 3), different from the first, holds a distinctive place in this class. This group is also dominated by a constructivist/interpretivist approach (c1, χ2 = 9), but with a post-positivist-interpretivist component (pp2, χ2 = 4) also present, which provides this group with an approach that is increasingly marked by research bias validation and reduction notions (e.g., Meyrick, 2006 or Stiles, 1999). This class presents significant terms such as
In this group, as per the constructivist paradigm, research is considered to be a collaborative and collective co-construction endeavor whose researcher and participants are tools in the search for consistency and the integrity of the reports and inferences. The content and vocabulary used by the quality criteria of the grids, however, also indicate a need to objectify the research through validation techniques stemming from more positivist approaches. The criteria proposed in these grids are adapted to qualitative or mixed methodologies.
Class 4 (med, nurs, pubh, c1)
A group of grids in medicine (χ2 = 10), public health (χ2 = 5) and nursing journals (χ2 = 2) holds a distinctive place in Quadrant D, while still being close to Classes 1 (psy, c1) and 2 (nurs, pp2, c2). For example, it includes the Cohen and Crabtree (2008) grid published in medicine, the Walsh and Downe (2006) grid for nursing and the Dixon-Woods et al. (2004) grid published in public health. Various significant terms given by the software, such as
Most grids in the class highlight the importance of the connection between the study conducted and the existing knowledge, the fact that any new knowledge can contribute to new conceptualizations or questions, lead to the development of basic theories or hypotheses, question existing theories or provide methodological information leading to improvements in the practice. Certain criteria, especially in public health, focus on the social mandate, moral obligation as a rationale for health sciences. Like for the grids in Class 1, Class 6 displays a paradigmatic constructivist approach in which the goal of the research is the “reconstruction” of its subject and the development of new interpretations. The criteria proposed in these grids are adapted to qualitative or mixed methodologies.
Quadrants A and B (Classes 2, 6, and 3): “Mainly” post-positivist approaches
Class 2 (nurs, pp2, c2)
This class includes a group of grids stemming from the nursing field (χ2 = 11), mainly displaying a post-positivist/interpretivist approach (pp2, χ2 = 4), with criteria stemming from a critical constructivist approach (c2, χ2 = 3). For example, there are Cobb and Hagemaster (1987), Russell and Gregory (2003) and Beck’s (2009) grids. This class’s significant lexical content includes terms such as
In a mainly post-positivist/interpretivist perspective, the criteria in these grids focus on the reasoning of the researcher and methodological rigor, with, as its validation terms and conditions, the faithful reporting of the logic of the steps, sampling and data collection. The criteria proposed in these grids are generally adapted to quantitative or mixed methodologies.
Class 6 (pubh, pp2)
This class is closely intertwined with the one formed by the nursing grids described previously (Class 2). It also includes grids with a post-positivist/interpretivist approach (pp2, χ2 = 14), but mostly stemming from public health journals (pubh, χ2 = 13), such as the grids by Tong, Sainsbury, and Craig (2007), Daly et al. (2007), or the Critical Appraisal Skills Programme (CASP; 2006). This group of grids is composed of criteria that differ from those in the public health grids of Class 4 (med, nurs, pubh, c1), and are more closely linked to methodology and sampling issues. The main significant terms of this class highlight the technical considerations surrounding sampling and its justification:
Often closely linked to positivist criteria, the criteria in Class 6 marks the need to represent the population (sometimes related to statistical analysis) to ensure the transferability of the results. Like for the grids in Class 2, step description and data collection are also preferred in guaranteeing the rigor of the research as a whole. The criteria in these grids are usually generally adapted to quantitative or mixed methodologies both because of the paradigmatic approach (pp2) and the discipline concerned (pubh).
Class 3 (meth, pp1, c3, c2)
This class is well specified and does not overlap onto any other class. On the disciplinary front, there are many grids stemming from research methodology articles and/or literature (meth, χ2 = 94). From a paradigmatic and epistemological point of view, however, it remains very diversified, with an essentially post-positivist predominance (pp1, χ2 = 29), but also including the presence of criteria linked to a post-modern (c3, χ2 = 9) or critical constructivist approach (c2, χ2 = 6). For example, there are grids by Flick (2006) or Sandelowski and Barroso (2007). The most significant terms in this class are linked to the quality of the data (from their collection to their retention) and the training of the researchers in terms of the methods used:
The grids in this class provide a thorough description of the research process, emphasizing the data collection tools and data retention and protection methods. Indeed, data storage must allow for independent control, and the specific description of the process must show how the data are processed and how they correspond to the goal of the study. In keeping with the post-positivist paradigm as described, concern for validation is reflected through issues related to the proof and quality control (of the data, processing, and training of the researcher). There are also ethical criteria among the quality criteria. In this class, this criterion constantly refers back to the consent of participants as well as to the information they receive on the objectives and process of the research, more than to the researcher’s position. Generally speaking, the content of the grids refers back to general criteria for qualitative research, and to criteria specifically adapted to quantitative or mixed methodologies. Finally, this class stands out from the rest because of its roots in issues related to methods, dominant both among the authors of the grids stemming from positivist approaches and constructivist approaches. This may explain its diverse character, given that it includes grids from post-positivist (pp1), post-modern (c3), and critical constructivist (c2) paradigms.
Our results show a certain order among the six classes. In Quadrants C and D, there is a first group that includes grids whose dominating theoretical model is identified as constructivist, that is, they base their relationship with reality on the idea that the human being is constructed collectively, and that the best we can do is interpret in a more or less shifting social, cultural, or temporal context. The criteria of this group of grids present the researcher as an integral part of his or her study subject, and in constant interaction with it. As a guarantee of the “authenticity” of the research data and results, these grids emphasize the faithful reporting of the experiences lived by participants and the clarification of the researcher’s personal implications and biases, as well as his or her experience and training in terms of interpretative theories. The second group (Quadrants A and B) includes grids whose dominating theoretical model was identified as post-positivist, with the addition of criteria stemming from post-positivist and interpretivist approaches for some. The criteria in this group of grids suggest looking at reality as a series of existing facts and/or events irrespectively of the researcher. The goal is to discover or at least approximate. Therefore, methodological rigor is a guarantee of validity, with emphasis put on the logic of thought and the transparent description of the research process.
Table 3 summarizes the results and highlights how the grids are split according to a paradigmatic and epistemological continuum, all the while showing that the boundaries between post-positivism and constructivism are not clearly specified. This result is even more important in that the sum of grids pp2 (post-positivism/interpretivism) and c1 (constructivism/interpretivism) is 88/133. As such, the “interpretive” position seems to represent the quality criteria grids for qualitative research the most. Moreover, given that the software Alceste® sorted the data “Interpretation” into two separate categories, we acknowledged that there are at least two distinct meanings for this concept. The first is more or less solipsistic, that is, the researcher knows and can interpret the data using the right theory. The second is more specific in that the interpretation is actually a co-constructed language between the researcher/subject context. In the first case, the interpretation is linked to the examination of a preexisting reality. Linked with the literature and theory, it must explain a situation using an appropriate sample selection method. In the second case, the interpretation is linked to meaning, in connection with cultural comprehension; it calls on the researcher’s reflexivity and commitment, helps to clarify the practice, and allows for useful applicability for patients. More generally, we also notice that few grids were classified in/under the “post-positivist” and “post-modern” paradigms. By contrast, a greater number were included in the columns that bring together mixed approaches. An important finding stemming from the classification of grids based on their underlying research conceptions was their distribution along a continuum from post-positivism to post-modernism, including interpretivist and constructivist approaches, and representing a Bell curve (Table 3).
Distribution of the Paradigmatic and Epistemological Positions According to a Continuum: Summary of the Results.
Discussion
Our content analysis and Alceste® lexicometric analysis confirm the presence of major discrepancies between the quality criteria grids examined and their relationship with the various paradigmatic and epistemological approaches identified in the grids. References to the major research paradigms or epistemological positions of the researchers are generally implicit, and are not easily identified because of the complexity of the content and structure of the grids. Indeed, on one hand, the same approach can be identified in grids belonging to different classes, and on the other hand, different approaches can be merged into the same grid. Although the six classes stemming from the lexicometric analysis can be placed in a consistent order based on the vertical partition of the previous factorial design (Figure 1) and the significant terms related to them (Figure 2), this distribution remains global. Indeed, the dispersion of items from the same grid to different classes reveals a lack of uniformity in the structure of the grids and the underlying theoretical approaches in the content of the criteria. This diversity shows how unclear the underlying paradigmatic and epistemological approaches in the evaluation grids are and how complex their links are. Consequently, it is hard to draw a clear line between them and to place them in only one category. We can hypothesize that the authors of the grids themselves seem to have overlooked this construction diversity. Yet, it may “objectively” explain the experts’ “subjective” difficulty when they say that they do not have the tools needed to evaluate qualitative research. The latter notices the lack of consistency between the different grids and even sometimes within the same criteria grid. Still, beyond this lack of consistency, our results highlight various key points.
The 133 grids can be categorized on a continuum. First, the grids underlie theoretical approaches close to quantitative work, with the criteria “translated” into qualitative, while being rather focused on the description of the research steps, with an emphasis on methodology and virtually no criteria referring back to the theories, epistemology, and values. Second, grids underlie theoretical approaches and criteria that are more specific to qualitative work, with reference to the researchers’ training and their research position, little or no description of the steps, and an emphasis on the values, epistemology, and theoretical analysis.
The notion of interpretation is a pivotal point that differentiates the research positions, while contributing to their semantic confusion. It seems that interpretivism can be given at least two meanings. It can be an interpretation by the researcher, according to his or her theories regarding the words spoken and the observations made of a preexisting reality, and can also be a co-constructed search for meaning between the researcher (reflexive) and actors, according to a context and in a practical use objective. This last definition could also correspond, in part, to the “participative” position proposed by Lincoln, Lynham, and Guba (2011).
We had considered the call for “ethical” criteria in the grids as an indicator of the researcher’s position with regards to the research, its absence indicating an objectivist position and its presence a position that takes into account the values of the research and the impacts. Our results question this premise. It also seems here that the presence of criteria linked to ethics in the grids indicates at least two different meanings, which would also make this axis a pivotal point among the grids. Indeed, the term
Finally, our results show the need to differentiate “mixed methodologies” and “mixed epistemologies” as they do not have the same consequences. The call for mixed methodologies conducted rigorously is now more established (Creswell, 2013; Creswell & Plano Clark, 2010; Paterson & Pentland, 2008; Tashakkori & Teddlie, 2010). On the contrary, the reference to mixed epistemologies is more complex and only slightly identified. Yet, the compatibility of the paradigms is not only a question of philosophy but also refers back to the practical use of the grids as certain criteria are contradictory. Among these grids, some intersect with various paradigmatic and epistemological positions, and their authors propose a combination of the quality criteria that correspond both to positivist and constructivist approaches, with, for each grid, a conception that slightly differs from the combination to adopt. The fact that criteria linked to different epistemologies can coexist in the same grid, visibly unbeknownst to their author, reveals the existence of confusion between the terms used and questions the validity of their use. This confusion can stem from (a) the different meanings of a term (as seen for the terms
Conclusion
The results of our research explain in large part the difficulties in coming to a consensus on the quality criteria for qualitative research. The conception diversity, both inter- and intra-grid, prevents the coherence of the grids. Our results show that the grids examined are not representative of a discipline but rather of a position with regards to qualitative research and objectives inherent to each one. They thus translate the vision that the authors of the grids have of qualitative research, one that is partial at times, and almost always explicit, as per the diverse results obtained. Yet, the goal is now to provide researchers with explicit and adequate tools to assess their work, and to provide experts (and readers) with grids that allow them to evaluate the quality of qualitative work in various health research fields according to comparable criteria. Although standardizing the grids is not the objective, their acknowledgement requires clarifying the terms and comparing them to show the validity of their use and their scientific reliability. The identification of the paradigms underlying the quality criteria conducted in this document is a first step toward understanding the diversity of the qualitative research evaluation grids in the health sciences. As this article is only one part of a more extended research project conducted from 2011 to 2014, these findings therefore cover one aspect of our study. Another part of the research project included the participation of international experts in qualitative research in the health sciences. Each expert and their peers (56 persons) assessed the grids in their own fields. As a result, these experts considered 12 quality criteria consensual. The main outcomes have been published on a website aimed at the dissemination of this research: https://wp.unil.ch/qualityofqualitativeresearch. However, despite its insightful results, our research has two main limitations. First, this study was carried out on the basis of 133 quality grids published in the health sciences field, but we did not conduct a systematic meta-analysis on all the grids published in the field. This may have influenced the results with regard to the distribution of grids by disciplines and the number and diversity of the criteria. Second, we chose Denzin and Lincoln’s (1994/2011) classification, close to our own approach, and this choice resulted in our classification of the epistemological approaches underlying the grids that would have perhaps differed if based on other fundamental works on paradigms in qualitative health.
In conclusion, more work remains to be done, however, for as long as the concepts considered to be major by the authors of the grids are not clearly explained, the grids will only be of limited use in improving and establishing qualitative research in the health sciences.
