Abstract
“The potential for semantic confusion is greatest with interdisciplinary research” (West, Griffin, and Gardner, 2007, p. 428)
Introduction
In recent years much has been written about the evolution of the human capacity for language, with significant progress on a number of issues, for example how evolution can help explain language universals (e.g., Kirby and Hurford, 2002), how individuals negotiate upon the meaning of signals (e.g., Galantucci, 2005), and how and why the human vocal tract most likely evolved (e.g., Fitch, 2000). However, the social evolution of human communicative behavior has remained at the stage of speculation, with several hypotheses (e.g., Dessalles, 1998; Dessalles, 2000; Dunbar, 1997; Fitch, 2004; Hurford, 2007; Knight, 1998; Miller, 2000; Pinker, 2003; Ulbaek, 1998) but little data advanced. Worse, much of this speculation misuses or misunderstands the key concepts and terms of social evolution theory, and as a result a number of the positions taken are incoherent. The purpose of this paper is twofold. First, to highlight a number of specific examples of such misunderstandings; and second, to show that when social evolution theory is correctly applied it produces simple, parsimonious explanations. As I hope to show with a range of examples, there is a real need for such an analysis, for confusion within evolutionary linguistics on matters of social evolution theory is remarkably commonplace.
In order to prepare the ground for the arguments to come, the next section makes explicit a crucial distinction, between
Ultimate and proximate explanations
The difference between ultimate and proximate explanations is a rich source of possible confusion, and so should be clearly delineated at the outset. The difference is typically explained as the difference between the
Ultimate explanations describe the processes by which the present set of behaviors observed in a population arose and are maintained. It is here that terms like kin selection, reciprocity and the like — the technical terminology of social evolution theory — find meaningful use: as processes that explain the continued presence of behaviors (like altruism) which would be otherwise unexpected under the laws of natural selection. Indeed, such ultimate, process-driven explanations of social behavior are, essentially, the
In fact, ultimate and proximate explanations can be further divided: the former includes both
All of these explanations are desirable. Indeed, they complement each other. Moreover, because the ultimate and the proximate are different
However, such confusion does often arise. To illustrate, consider recent work on the evolution of cooperation in humans, and specifically the idea of
This is not the only example, by any means. The next section surveys some of the various hypotheses that have been proposed in response to the question of why humans should talk to each other at all, and shows that conflation of proximate and ultimate explanations is remarkably commonplace. However, when we keep the distinction clear then a simple and very general answer presents itself: we talk to influence the future behavior of others.
Why talk?
When we talk we pass information onto our conspecifics, some of which will be of value and use to them. Yet talking carries production costs; if nothing else there is the opportunity cost of the time spent in conversation (Maynard Smith and Harper, 1995). Why, then, do we talk? Of course, communication most obviously allows us to coordinate our behaviors with far more ease than otherwise, but our communicative instincts go far beyond this. We have a “robust and passionate urge of some kind to communicate” (Bates, 1994, p. 139) with the result that a surprisingly large proportion of the time we spend in conversation is spent on those relatively mundane and trivial matters that we call gossip (Dunbar, Duncan, and Marriott, 1997). Why, then, are we such enthusiastic users of this skill? This question may sound naive at first blush, but no evolutionary answer is immediately forthcoming. Indeed, any real naivety here lies in “linguistic communism” (Bourdieu, 1991, p. 43): the assumption that humans will simply talk to each other for no reason other than the altruistic sharing of information. Correspondingly, several researchers have correctly identified the matter of why we talk at all as a key question for evolutionary linguistics (Dessalles, 1998; Hurford, 2007; Knight, 1998).
The well-known processes of kin selection and/or reciprocal altruism have both been proposed as solutions, or partial-solutions, to this problem. (Reciprocal altruism is in fact a misnamed theory, for it is not a type of altruism at all. On the contrary, what the theory purports to explain is why superficially altruistic acts are in fact
Moreover, there are further reasons to suppose that speaking has benefits for the speaker, even when unreciprocated. One is our anatomical divergence from other primates: our vocal tract is significantly different to both our primate cousins and our ancestors, mainly due to the descended larynx, while our ears are not. Although it has been argued that the descended larynx is an adaptation for speech (e.g., Lieberman, 1984), the most compelling explanation is that it evolved to exaggerate size, as it has in other species: a descended larynx produces greater formant dispersion, which is negatively correlated with size (Fitch, 1999, 2002). Vocalizations thus produce some payoff for the speaker, at least in some contexts. As for ears, if information transfer were the function of language, then we could reasonably expect our ears to have developed adaptations to better capture all the input that they can, so that background chatter was no longer just noise but a set of many distinct information sources (Miller, 2000). Yet this is not the case. Of course, these thoughts are pure speculation, but the reasoning is certainly not fanciful. At the very least, they are questions that must be addressed by those who suppose that talk is simply the provision of useful information.
A computational model poses a similar question: if the payoff to communication were simply that the listener gains information, then we should predict more true synonymy and less homonymy (Hurford, 2003). The model works as follows. Agents send and receive signals from one another and are evolved according to one of two possible selection pressures: their fitness is defined either as their ability to signal their intended meaning correctly, or as their ability to deduce intended meaning correctly. Under the first condition the resulting language is free of true synonymy and rich in homonymy, just as we observe in all natural languages. Under the second condition, however, the resulting language was rich in synonymy and free of homonymy, a situation never observed among natural languages. The emergent languages hence resemble real languages only when selection is placed on the agent's productive capabilities, and not their receptive capabilities. This suggests, like the anatomic observations above, that speaking is not the altruistic act of information sharing that a naïve reading would suppose it to be. Instead, the speaker is likely to have gained something from the interaction, just as the listener will have done. It is therefore hard to conclude that the stability of speaking is dependent on reciprocation, a process by which individual speech acts are costly but are maintained through a process of exchange. On the contrary, they suggest that the speaker must receive some net payoff from each speech act. In light of these arguments, and the lack of cheater detection discussed above, reciprocation seems a highly unlikely explanation of our enthusiasm to converse (Scott-Phillips, 2006).
One scholar similarly observes that “a human speech-community is not a personal mutual aid network but is typically an extended group transcending the limits of affiliation on the basis of residence, economic co-operation or kinship” (Knight, 1998, p. 75). However, this observation is then followed by just the sort of error discussed earlier: conflation of the proximate and the ultimate. Specifically, it is suggested that the problem was overcome by a “profound coalitionary restructuring [of human sociality]”. This is a proximate explanation — the structure of human sociality is a
This is not an isolated example. Consider the suggestion that our syntactic abilities are based upon the same cognitive mechanisms that must underpin reciprocal altruism (Bickerton, 1998). The assumption here is that because reciprocity is about who did what to whom and when, then there must be some cognitive representation of the corresponding social calculus, and it is this representation that it is suggested was exapted for use in syntax: “thematic analysis
If it is hard to envisage how reciprocity could occur without a representation of who did what to whom and when, then ask the same question about, say, incest avoidance. Our brains are not pre-wired to represent our family-tree. The evidence instead suggests that a simple heuristic, namely “Do not have sex with anybody who grew up in the same household as you”, does the job to a high-enough degree of approximation (Shepher, 1971, 1983; Wolf, 1970, 1995). Similarly, if humans do engage in reciprocity, there is no reason to suppose that it will not be based upon some similar heuristic (possibly, ironically enough, one that makes use of linguistic markers like, say, accent (Nettle and Dunbar, 1997)). It is not the case that “
This conflation of the proximate and the ultimate, which, as observed in the previous section, renders such claims technically incoherent, is not the only error that evolutionary linguists have made with social evolution theory in general and reciprocity in particular. Consider the following: “If we inform only those people who are likely to return the favor, both of us can gain the benefits of trade…” (Pinker, 2003, p. 28). This is the reciprocity hypothesis (c.f. “…benefits of trade…”). This is then followed by “…it seems clear [though no data is offered — TSP] that we do use our faculties of social cognition to ration our conversation to those with whom we have established a non-exploitative relationship” (Pinker, 2003, p. 28). This is a more general statement, and not the same thing as the reciprocity hypothesis, which specifically states, when it is used correctly, that we
That is not to deny that the establishment of equitable relationships was an important landmark in the phylogeny of the language faculty. It is just that such relationships are unlikely to have arisen within the tight and specific constraints of reciprocity. A more general explanation is more likely, and in the next section I will argue for a major role for reputation. Before then, however, I wish to briefly comment on two of the selfish suggestions as to why we talk without relying on reciprocation, both drawn from the evolutionary psychology literature. Doing so will enable us to see more clearly that the “incredibly boring” (Pinker and Bloom, 1990, p. 708) conclusion that language evolved under a pressure to communicate complex propositions is most likely the correct one.
First is the thesis that gossip acts as a form of social glue in the same way that grooming does in non-human primates (Dunbar, 1993, 1997). Although this certainly has a degree of plausibility, two comments should be made. The first is that gossip/grooming is by no means all that we use language for. We frequently talk to others even without any desire to build or maintain social relationships. Indeed, we occasionally talk to people we would like to break ties with. Second, and more pertinently, this hypothesis does not explain why language need be as complex as it is: if social grooming was the sole function of language then grunts could perform the same task, but instead we have a far more cognitively complex and evolutionarily costly communicative mechanism (Pinker, 2003).
A second prominent idea is sexual selection. The suggestion that this process played a prominent role in the evolution of language goes back as far as Darwin (1871) but it, too, cannot tell the whole story, a fact acknowledged by its most prominent modern-day advocate (Miller, 2000). After all, we use communication for far more than attracting mates. That does not rule it out completely, of course. Neither do the following two arguments, both of which have recently been argued to speak against the sexual selection hypothesis (Fitch, 2004): first, that sexually selected traits are usually sexually dimorphic, with the displaying sex (typically, but not necessarily, the male) expressing such traits exclusively, or at least to a far greater degree, than the choosing sex (typically, but not necessarily, the female); second, that sexually selected traits are not usually expressed until puberty.
With regard to the first point, the degree to which we should expect to witness sexual dimorphism is in large part determined by the mating system employed by the species, and in particular the relative levels of parental investment in child rearing: the more investment a parent has to make, the less likely they are to display sexually selected traits (Andersson, 1994; Cartwright, 2000). In humans the norm includes a significant role for
In conclusion, then, we may only be equivocal: sexual advertisement may, like gossip, have a role to play. Yet still, these two possibilities (and any possible effects of kin selection) do not at all exhaust the range of domains in which we speak. In short, we put language to an indefinably large number of uses. This is true of many human traits: “hands may be very good for throwing rocks, shooting arrows, and pressing triggers, but we not infer that manipulability therefore arose as an adaptation for success in aggression” (Gould, 1991, p.53). Indeed; instead we simply conclude that such manipulability is an adaptive end in its own right; a versatile tool with a wide range of applications. Similarly, we use a range of linguistic devices — imperatives, declaratives, requests and the like — to fulfill a diverse range of sociobiological functions, amongst them sexual advertisement and grooming. What these functions have in common is that all involve the alteration of another's behavior. Rather than claim that any single one is
Does such a conclusion actually answer our initial question? We talk to influence the behavior of others, but if that is the whole story then why should others pay attention to our utterances? Indeed, what stops us from simply making claims without any substance, simply so that others will believe them and act accordingly? In the conventional terms of social evolution theory, how is honesty maintained in such a system? That question is the focus of the next section.
Honesty
The theories discussed in the previous section — kin selection, reciprocal altruism, etc. – are those used by social evolutionists to discuss animal social behavior in its many forms, including communication. But communication also brings its own, unique problem, and with it its own set of theories. That problem — honesty, or reliability — is arguably
Before we consider this matter for the case of human communication, a brief terminological aside is merited. In its everyday use,
We may now ask what keeps human communication stable. That is, what prevents dishonesty becoming so widespread that trust in others' utterances simply breaks down? This question is surprisingly little addressed by language evolution researchers, mainly because those who have thought about language in the context of social evolution theory have, as discussed in the previous section, mainly argued that linguistic use can be explained through reciprocity or kin selection, processes that address the problem of honesty before it is asked. But if we accept that these are unlikely explanations of our willingness to talk then we must look elsewhere for a Darwinian explanation of honesty. Outside of social evolution theory it is sometimes thought that the handicap principle (Grafen, 1990; Zahavi, 1975; Zahavi and Zahavi, 1997) is the only process by which cheap signals may remain honest, but this is not the case: cost-free honesty does exist in nature, and not just in humans. Many male passerines, for example sparrows, typically display dominance badges on their plumage; the larger the badge, the greater the bird's RHP (Resource Holding Potential; an indexical measure of all factors that influence fighting ability (Parker, 1974)). However, there appears to be no cost associated with the badge, and no obvious barrier to falsification (Rohwer, 1975; Whitfield, 1987).
What, then, are the alternative processes that may explain such a phenomenon? Several have been identified by social evolution theorists (Maynard Smith and Harper, 2003):
Of these, it is clear that indices cannot offer a solution to our question, since linguistic form is unrelated to meaning. The handicap principle seems similarly unlikely: utterances are metabolically cheap. What of the other three suggestions? A first observation is that they are not strictly independent of one another. A loss of reputation, for example, whilst ostensibly coming under the banner of repeated interactions, could also be seen as a punishment. In fact, something very similar has been proposed as a solution to the problem at hand.
The crucial idea behind this proposal is that costs are paid not by honest individuals as a guarantee of their honesty (
This is different to the dominance badges of sparrows because sparrows will pay costs
In other words, if signals can be verified with relative ease, and if there are costs attached to being revealed as a liar, then cost-free signaling can be evolutionarily stable. In situations where these conditions do not hold, we should expect signals to be costly. Specifically, this will apply in interactions where the protagonists are unlikely to meet again or when communicating about otherwise unverifiable information (Lachmann, Számadó, and Bergstrom, 2001). Of course, dishonesty may still be the more appropriate strategy if the expected payoff is sufficiently large that it outweighs the expected costs. In other words, deception may still pay under certain circumstances, specifically when the benefits associated with doing so outweigh the social costs that will follow.
Hence, honesty can be ensured if dishonesty is costly. One important implication of this analysis is that it allows for signals to take an arbitrary form (
I know of no attempts to test this idea with human subjects. In fact, evolutionary linguistics has thus far shown little interest in Lachmann, Számadó and Bergstrom's work, despite its evident power and plausibility. This is not due to a lack of exposure. The work was published in a major journal (
Conclusions
I have sought to make three crucial points. First, I have made clear the distinction between ultimate and proximate explanations of behavior: the first is concerned with the process of evolution, the second with its product. These are different types of explanation that, when used correctly, will support and reinforce each other. However, the two are often conflated within the language evolution literature, a practice which leads to epistemically incoherent claims.
Second, even when understood correctly, social evolution theory is often over-applied in evolutionary linguistics. A specific example is reciprocity, which in many cases appears to be used in a fuzzy way to refer to general social cognition. However, a brief analysis of linguistic behavior shows that its specific claims — that speaking is costly when unreciprocated — are highly unlikely to be true.
This conclusion opens up the question of honesty. Contrary to a common belief outside social evolution, the handicap principle is not the only explanatory candidate here. My third central point is therefore that one specific alternative — that individuals pay costs for
It is hard to escape the conclusion that social evolution theory has been too often misused by language evolution researchers. This is regrettable, for when applied correctly it is a powerful explanatory tool. It is hoped that this article has shown how that power might be correctly applied to language.
