Monday, July 29, 2019

Simulation of sound change (Open problems in computational diversity linguistics 6)


The sixth problem in my list of open problems in computational diversity linguistics is devoted to the problem of simulating sound change. When formulating the problem, it is difficult to see what is actually meant, as there are two possibilities for a concrete simulation: (i) one could think of a sound system of a given language and then model how, through time, the sounds change into other sounds; or (ii) one could think of a bunch of words in the lexicon of a given language, and then simulate how these words are changed through time, based on different kinds of sound change rules. I have in mind the latter scenario.

Why simulating sound change is hard

The problem of simulating sound change is hard for four reasons. First of all, the problem is similar to the problem of sound law induction, since we have to find a simple and straightforward way to handle phonetic context (remember that sound change may often only apply to sounds that occur in a certain environment of other sounds). This is already difficult enough, but it could be handled with help of what I called multi-tiered sequence representations (List and Chacon 2015). However, there are four further problems that one would need to overcome (or at least be aware of) when trying to successfully simulate sound change.

The first of these extra problems is that of morphological change and analogy, which usually goes along with "normal" sound change, following what Anttila (1976) calls Sturtevant's paradox — namely, that regular sound change produces irregularity in language systems, while irregular analogy produces regularity in language systems. In historical linguistics, analogy serves as a cover-term for various processes in which words or word parts are rendered more similar to other words than they had been before. Classical examples are children's "regular" plurals of nouns like mouse (eg. mouses instead of mice) or "regular" past tense forms of verbs like catch (e.g., catched instead of caught). In all these cases, perceived irregularities in the grammatical system, which often go back to ancient sound change processes, are regularized on an ad-hoc basis.

One could (maybe one should), of course, start with a model that deliberately ignores processes of morphological change and analogical leveling, when drafting a first system for sound change simulation. However, one needs to be aware that it is difficult to separate morphological change from sound change, as our methods for inference require that we identify both of them properly.

The second extra problem is the question of the mechanism of sound change, where competing theories exist. Some scholars emphasize that sound change is entirely regular, spreading over the whole lexicon (or changing one key in the typewriter), while others claim that sound change may slowly spread from word to word and at times not reach all words in a given lexicon. If one wants to profit from simulation studies, one would ideally allow for a testing of both systems; but it seems difficult to model the idea of lexical diffusion (Wang 1969), given that it should depend on external parameters, like frequency of word use, which are also not very well understood.

The last problem is that of the actual tendencies of sound change, which are also by no means well understood by linguists. Initial work on sound change has been carried out (Kümmel 2008). However, the major work of finding a way to compare the major tendencies of sound change processes across a large sample of the world's languages (ie. the typology of sound change, which I plan to discuss separately in a later post), has not been carried out so far. The reason why we are missing this typology is that we lack clear-cut machine-readable accounts of annotated, aligned data. Here, scholars would provide their proto-forms for the reconstructed languages along with their proposed sound laws in a system that can in fact be tested and run (to allow to estimate also the exceptions or where those systems fail).

But having an account of the tendencies of sound change opens a fourth important problem apart from the lack of data that we could use to draw a first typology of sound change processes: since sound change tendencies are not only initiated by the general properties of speech sounds, but also by the linguistic systems in which these speech sounds are employed. While scholars occasionally mention this, there have been no real attempts to separate the two aspects in a concrete reconstruction of a particular language. The typology of sound change tendencies could thus not simply stop at listing tendencies resulting from the properties of speech sounds, but would also have to find a way to model diverging tendencies because of systemic constraints.

Traditional insights into the process of sound change

When discussing sound change, we need to distinguish mechanisms, types, and patterns. Mechanisms refer to how the process "proceeds", the types refer to the concrete manifestations of the process (like a certain, concrete change), and patterns reflect the systematic perspective of changes (i.e. their impact on the sound system of a given language, see List 2014).

Figure 1: Lexical diffusion

The question regarding the mechanism is important, since it refers to the dispute over whether sound change is happening simultaneously for the whole lexicon of a given language — that is, whether it reflects a change in the inventory of sounds, or whether it jumps from word to word, as the defenders of lexical diffusion propose, whom I mentioned above (see also Chen 1972). While nobody would probably nowadays deny that sound change can proceed as a regular process (Labov 1981), it is less clear as to which degree the idea of lexical diffusion can be confirmed. Technically, the theory is dangerous, since it allows a high degree of freedom in the analysis, which can have a deleterious impact on the inference of cognates (Hill 2016). But this does not mean, of course, that the process itself does not exist. In these two figures, I have tried to contrast the different perspectives on the phenomena.

Figure 2: Regular sound change

To gain a deeper understanding of the mechanisms of sound change, it seems indispensable to work more on models trying to explain how it is actuated after all. While most linguists agree that synchronic variation in our daily speech is what enables sound change in the first place, it is not entirely clear how certain new variants are fixed in a society. Interesting theories in this context have been proposed by Ohala (1989) who proposes distinct scenarios in which sound change can be initiated both by the speaker or the listener, which would in theory also yield predictable tendencies with respect to the typology of sound change.

The insights into the types and patterns of sound change are, as mentioned above, much more rudimentary, although one can say that most historical linguists have a rather good intuition with respect to what is possible and what is less likely to happen.

Computational approaches

We can find quite a few published papers devoted to the simulation of certain aspects of sound change, but so far, we do not (at least to my current knowledge) find any comprehensive account that would try to feed some 1,000 words to a computer and see how this "language'' develops — which sound laws can be observed to occur, and how they change the shape of the given language. What we find, instead, are a couple of very interesting accounts that try to deal with certain aspects of sound change.

Winter and Wedel for example test agent-based exemplar models, in order to see how systems maintain contrast despite variation in the realization (Hamann 2014: 259f gives a short overview of other recent articles). Au (2008) presents simulation studies that aim to test to which degree lexical diffusion and "regular" sound change interact in language evolution. Dediu and Moisik (2019) investigate, with the help of different models, to which degree vocal tract anatomy of speakers may have an impact on the actuation of sound change. Stevens et al. (2019) present an agent-based simulation to investigate the change of /s/ to /ʃ/ in.

This summary of literature is very eclectic, especially because I have only just started to read more about the different proposals out there. What is important for the problem of sound change simulation is that, to my knowledge, there is no approach yet ready to run the full simulation of a given lexicon for a given language, as stated above. Instead, the studies reported so far have a much more fine-grained focus, specifically concentrating on the dynamics of speaker interaction.

Initial ideas for improvement

I do not have concrete ideas for improvement, since the problem's solution depends on quite a few other problems that would need to be solved first. But to address the idea of simulating sound change, albeit only in a very simplifying account, I think it will be important to work harder on our inferences, by making transparent what so far is only implicitly stored in the heads of the many historical linguists in form of what they call their intuition.

During the past 200 years, after linguists started to apply the mysterious comparative method that they had used successfully to reconstruct Indo-European on other language families, the amount of data and number of reconstructions for the world's languages has been drastically increasing. Many different language families have now been intensively studied, and the results have been presented in etymological dictionaries, numerous books and articles on particular questions, and at times even in databases.

Unfortunately, however, we rarely find attempts of scholars to actually provide their findings in a form that would allow to check the correctness of their predictions automatically. I am thinking in very simple terms here — a scholar who proposes a reconstruction for a given language family should deliver not only the proto-forms with the reflexes in the daughter languages, but also a detailed test of how the proposed sound law by which the proto-forms change into the daughter languages produce the reflexes.

While it is clear that this could not be easily implemented in the past, it is in fact possible now, as we can see from a couple of studies where scholars have tried to compute sound change (Hartmann 2003, Pyysalo 2017, see also Sims-Williams 2018 for an overview on more literature). Although these attempts are unsatisfying, given that they do not account for cross-linguistic comparability of data (eg. they use orthographies rather than unified transcriptions, as proposed by Anderson et al. 2018), they illustrate that it should in principle be possible to use transducers and similar technologies to formally check how well the data can be explained under a certain set of assumptions.

Without cross-linguistic accounts of the diversity of sound change processes (ie. a first solution to the problem of establishing a first typology of sound change), attempts to simulate sound change will remain difficult. The only way to address this problem is to require a more rigorous coding of data (both human- and machine-readable), and an increased openness of scholars who work on the reconstruction of interesting language families, to help make their data cross-linguistically comparable.

Sign languages

When drafting this post, I promised to Guido and Justin to grasp the opportunity when talking about sound change to say a few words about the peculiarities of sound change in contrast to other types of language change. The idea was, that this would help us to somehow contribute to the mini-series on sign languages, which Guido and Justin have been initiated this month (see post number one, two, and three).

I do not think that I have completely succeeded in doing so, as what I have discussed today with respect to sound change does not really point out what makes it peculiar (if it is). But to provide a brief attempt, before I finish this post, I think that it is important to emphasize that the whole debate about regularity of sound change is, in fact, not necessarily about regularity per se, but rather about the question of where the change occurs. As the words in spoken languages are composed of a fixed number of sounds, any change to this system will have an impact on the language as a whole. Synchronic variation of the pronunciation of these sounds offers the possibility of change (for example during language acquisition); and once the pronunciation shifts in this way, all words that are affected will shift along, similar to a typewriter in which you change a key.

As far as I understand, for the time being it is not clear whether a counterpart of this process exists in sign language evolution, but if one wanted to search for such a process, one should, in my opinion, do so by investigating to what degree the signs can be considered as being composed of something similar to phonemes in historical linguistics. In my opinion, the existence of phonemes as minimal meaning-discriminating units in all human languages, including spoken and signed ones, is far from being proven. But if it should turn out that signed languages also recruit meaning-discriminating units from a limited pool of possibilities, there might be the chance of uncovering phenomena similar to regular sound change.

References
Anderson, Cormac and Tresoldi, Tiago and Chacon, Thiago Costa and Fehn, Anne-Maria and Walworth, Mary and Forkel, Robert and List, Johann-Mattis (2018) A cross-linguistic database of phonetic transcription systems. Yearbook of the Poznań Linguistic Meeting 4.1: 21-53.

Anttila, Raimo (1976) The acceptance of sound change by linguistic structure. In: Fisiak, Jacek (ed.) Recent Developments in Historical Phonology. The Hague, Paris, New York: de Gruyter, pp. 43-56.

Au, Ching-Pong (2008) Acquisition and Evolution of Phonological Systems. Academia Sinica: Taipei.

Chen, Matthew (1972) The time dimension. Contribution toward a theory of sound change. Foundations of Language 8.4. 457-498.

Dan Dediu and Scott Moisik (2019) Pushes and pulls from below: Anatomical variation, articulation and sound change. Glossa 4.1: 1-33.

Hamann, Silke (2014) Phonological changes. In: Bowern, Claire (ed.) Routledge Handbook of Historical Linguistics. Routledge, pp. 249-263.

Hartmann, Lee (2003) Phono. Software for modeling regular historical sound change. In: Actas VIII Simposio Internacional de Comunicación Social. Southern Illinois University, pp. 606-609.

Hill, Nathan (2016): A refutation of Song’s (2014) explanation of the ‘stop coda problem’ in Old Chinese. International Journal of Chinese Linguistic 2.2. 270-281.

Kümmel, Martin Joachim (2008) Konsonantenwandel [Consonant change]. Wiesbaden: Reichert.

Labov, William (1981) Resolving the Neogrammarian Controversy. Language 57.2: 267-308.

List, Johann-Mattis (2014) Sequence Comparison in Historical Linguistics. Düsseldorf: Düsseldorf University Press.

List, Johann-Mattis and Chacon, Thiago (2015) Towards a cross-linguistic database for historical phonology? A proposal for a machine-readable modeling of phonetic context. Paper presented at the workshop "Historical Phonology and Phonological Theory [organized as part of the 48th annual meeting of the SLE]" (2015/09/04, Leiden, Societas Linguistica Europaea).

Ohala, J. J. (1989) Sound change is drawn from a pool of synchronic variation. In: Breivik, L. E. and Jahr, E. H. (eds.) Language Change: Contributions to the Study of its Causes. Berlin: Mouton de Gruyter, pp. 173-198.

Pyysalo, Jouna (2017) Proto-Indo-European Lexicon: The generative etymological dictionary of Indo-European languages. In: Proceedings of the 21st Nordic Conference of Computational Linguistics, pp. 259-262.

Sims-Williams, Patrick (2018) Mechanising historical phonology. Transactions of the Philological Society 116.3: 555-573.

Stevens, Mary and Harrington, Jonathan and Schiel, Florian (2019) Associating the origin and spread of sound change using agent-based modelling applied to /s/- retraction in English. Glossa 4.1: 1-30.

Wang, William Shi-Yuan (1969) Competing changes as a cause of residue. Language 45.1: 9-25.

Winter, Bodo and Wedel, Andrew (2016) The co-evolution of speech and the lexicon: Interaction of functional pressures, redundancy, and category variation. Topics in Cognitive Science 8:  503-513.

Monday, July 22, 2019

Two problems concerning the use of Ancient DNA


Last week I wrote a piece for The Wine Gourd blog, called The role of Wine Influencers — more of the same. I discussed the modern concern in the wine industry with social media Influencers, who use Facebook, Instagram, Twitter, Youtube, etc to promote wine — when LeBron James drinks a wine it will sell a whole lot better (presumably on the principle that “You may not be able to play like LeBron, but you can drink like him”).

My conclusion was that the wine industry has always had what are now called Micro- or Nano-Influencers, involving endorsements from people and organizations who possess an expert level of knowledge as well as social influence. For example, professional wine critics have always fitted this bill, notably Robert M. Parker Jr.

So, the existence of social media Wine Influencers is nothing new — it is simply the modern equivalent of something old.


Well, my blog post here is about the same idea in Ancient-DNA phylogenetics — the idea that, in spite of the claim that modern techniques provide new advantages, we may in fact simply be repeating ourselves. Modern issues are simply modern versions of the same old issues.

First problem

The first issue that I would like to raise is that of molecular data. This is seen as the crucial element of modern studies of ancient remains. Even the recent re-creation of the vineyard of Leonardo da Vinci (La Vigna di Leonardo, in Milan) involved the finding of sufficient DNA from the vineyard land, which was bombed during World War II, to identify the grape cultivar that was grown by da Vinci (Inside Leonardo da Vinci's vineyard).

The issue is that DNA studies, based on direct studies of genotype, are subject to all of the same data-analysis issues as are studies of phenotype (such as morphology, anatomy and ultrastructure).

One classic example is the supposed discovery in the 1980s of the phenomenon of Long-branch Attraction (LBA) in molecular studies. Here, if many shared nucleotide changes occur on distantly related branches of a phylogenetic tree, these branches may actually be reconstructed as sister lineages during the phylogenetic analysis. However, this is simply an example of parallelism, a phenomenon that had previously been known for decades in phylogenetic analyses of phenotype.

Many currently recognized practical problems in genotype studies, such as LBA and compositional biases, are merely specific examples of how analogy appears in molecular biology. Analogy will create convergences and parallelisms, and these will confound the attempt to detect homology.

So, reconstructing evolutionary history using molecular biology is a priori neither better nor worse than using any other source of data, because the same limitations apply. It is simply another type of data.

Second problem

The second issue that I would like to raise is that genome data are a type of Big Data, and the idea that Big Data will apparently solve all ills with data analyses. The idea seems to be that, if you can collect enough data, then you must be lead to "the truth".

This is nonsense — data are just numbers, and numbers can mislead, no matter how many there are. Data need to be interpreted by a human mind, if they are to tell that mind anything useful. The only thing that changes with the use of Big Data is the order in which the steps of the data analysis and interperation occur.

In the Old Days (ie. when I was a student), what we did was:
  1. develop an experimental question
  2. think about potential problems
  3. collect targeted data
  4. analyze the data
  5. interpret the data, to answer the question.
These days, with Big Data, what people do is:
  1. collect a very large amount of data
  2. analyze the data, and try to interpret it
  3. think of a question that the data might answer
  4. discover the potential problems later.
All that is really different is the order, along with which steps are confounded with which other steps.

I don't see that this is necessarily any better; it is just different. So, don't pin your hopes on Ancient DNA genome-scale data to solve problems with your work.

Other issues

Anyone working with Ancient DNA knows that there are oddles of other problems. Some of them are discussed for the general public by Gideon Lewis-Kraus, writing on January 17 2019 for The New York Times Magazine: Is Ancient DNA Research revealing new truths — or falling into old traps? The answer is, of course, "both".

Monday, July 15, 2019

Untangling vertical and horizontal processes in the evolution of handshapes

Justin Power


[This is a guest-post by Justin Power, and the 3rd part of our miniseries on sign language manual alphabets]

In Guido’s most recent post in this miniseries on manual alphabet evolution in sign languages, he discussed the role of character mapping on networks in phylogenetic inference. He pointed out how we used this approach to infer evolutionary pathways of languages, and why this step in exploratory data analysis is important, given the complexity of the underlying signal in this data set.

In this new post, I take up the topic of hand-shape evolution in more detail, explaining some of the complexities involved in studying sign language evolution. I will specifically look at how we can identify both vertical and horizontal processes in the evolution of hand-shapes.

Introduction

We know very little about how signs and hand-shapes actually evolve. There have been a few studies — most of them from decades ago — comparing American Sign Language in videos and dictionaries from the early 20th century with then contemporary forms (Frishberg 1975; Battison et al. 1975). One study in particular argued that, as a sign language emerges in a community of signers, crystallizing into a stable linguistic system, the signs evolve in a quasi-teleological way from earlier, more gesture- or pantomime-like forms to more language-like forms, cutting similar evolutionary pathways leading to more constraints on articulation and to general systematization.

But what happens (in this story) once sign languages become linguistic systems? Do they continue evolving, as happens in spoken languages? If yes, how? Investigating these kinds of questions was one of my motivations for tracking down historical examples of manual alphabets for over a dozen sign languages. The pay-off (besides the thrill of the treasure hunt) is that, by tracing hand-shapes through historical examples and comparing them with contemporary sign languages, we can infer the vertical and horizontal evolutionary processes affecting sign languages and hand-shape forms.

Vertical and horizontal aspects of hand-shape evolution

Consider part of the Neighbor-net from our paper (see Part 1) including the Austrian-origin and Russian groups in the figure below. Russian 1835 is the earliest manual alphabet in our sample published in Russia (St. Petersburg); and Danish 1808, in the Danish subgroup, was published in Copenhagen.


While the two manual alphabets are found in different neighborhoods in the graph, they share a number of hand-shapes, some of which were (and still are) shared widely throughout Europe, for reasons that we discuss in the main paper.

One such hand-shape represents the Latin / Cyrillic letter "A" in both Danish 1808 and Russian 1835, as illustrated in the timeline here.


Note the position of the thumbs at the bottom of the figure: in both early examples, the thumb is adjacent to the bent index finger. In an example from Danish SL in 1907 (and subsequently in 1926 and 1967), the position of the thumb has shifted across the index finger. For Russian SL, too, the position of the thumb in the contemporary hand-shape representing the Cyrillic letter A has crept across the index finger to the front of the fist (the hand-shape in the figure is my attempt to reproduce the source; see here for the real thing).

There are two points to note here in connection with evolutionary processes. First, these changes in thumb position appear to have a vertical aspect: as signers in a community used these hand-shapes and transmitted them to later generations, they also modified the forms in subtle ways, perhaps unconsciously in a process with analogies to sound change in spoken language.

Second, the changes also include a horizontal aspect: the forms evolved in similar ways, as the two signing communities converged on the same shape (apparently) independently, possibly due to similar articulatory or perceptual pressures. The horizontal aspect of this process contributes to signal incompatibility in the dataset underlying the network — the more convergence there is, then the less tree-like will be the Neighbor-net (in this case, the more spiderweb-like).

Convergence

In addition to the preceding example, a typical case of convergence can be seen in the independent creation of similar hand-shapes to represent the Greek and Cyrillic letter "Г".


Beginning again with the main Neighbor-net in the figure immediately above, we see that Russian 1835 and contemporary Greek SL are found in different neighborhoods, with Greek in the French-origin group. The two languages, however, share the Г-representing hand-shape (the Russian form is from Fleri 1835, while the Greek form is, again, my own hand; see here for the real one). Because Greek SL is the only language in the French-origin group to share this hand-shape with the Russian group, there is a clear suggestion of a horizontal process that resulted in similar hand-shapes across unrelated languages. The most likely processes here are convergence due to the independent creation of iconic representations of the written letter; or lateral transfer — called borrowing in linguistics — via some historical instance of contact between signers of the two languages. [My intuition is for the former explanation.]

Borrowing

The final example deals with a clear case of borrowing. The figure below shows the time- / taxon-filtered Neighbor-net, including historical manual alphabets up to about 1840 (see Part 2), but only annotated with the relevant languages.


The two earliest manual alphabets in our dataset were published in Madrid in 1593 (de Yebra) and 1620 (Bonet). In neither case do we see any trace of a hand-shape representing the letter "W", which was not needed to represent these Latin alphabets. Later, too, manual alphabets published in Spain in 1815, 1845, and 1859 still did not include the letter "W". In contrast, in Austrian 1786 and French 1800 (as well as other languages), hand-shape forms representing the letter W are found in the earliest examples we have for those languages. Some 160–230 years later, however, we find similar forms for "W" in contemporary Austrian, French and Spanish SLs. We deduce that contemporary Spanish SL did not inherit the "W" hand-shape from the 19th century Spanish manual alphabets. Instead, the hand-shape may have been borrowed from some other language, possibly French SL given its influence on deaf education in Europe, or possibly later from the International Sign manual alphabet (also part of the French-origin Group).

Conclusion

As these examples show, there are different types of horizontal processes contributing to conflicting signal in the data set. Using the splits network graphs together with historical examples of manual alphabets, we can untangle the horizontal signal in many cases. The approach has also given us some insight into the evolutionary processes contributing to the diversity of contemporary sign languages, a topic that we plan to investigate more fully.

Cited literature, further reading and data
  • Battison, Robin, Harry Markowicz, & James Woodward (1975) A good rule of thumb: Variable phonology in American Sign Language. In Ralph W. Fasold & Roger W. Shuy (eds.), Analyzing Variation in Language: Papers from the Second Colloquium on New Ways of Analyzing Variation, Part 3, pp. 291–302. Washington D.C.: Georgetown University Press.
  • Bonet, Juan Pablo (1620). Reduction de las letras y arte para enseñar a ablar los mudos. Madrid: Francisco Abarca de Angulo.
  • Fleri, Viktor I. (1835) Глухонемые, рассматриваемые в отношении к их состоянию и к способам образования, самым свойственнымих при. St. Petersburg:Типография А. Плюшара.
  • Frishberg, Nancy 1975 Arbitrariness and iconicity: Historical change in American Sign Language. Language 51(3): 696–719.
  • Yebra, Melchor de (1593) Libro llamado Refugium Infirmorum: Muy util y prouechoso para todo genero de gente : En el qual se contienen muchosauisos espirituales para socorro de los afligidos enfermos, y para ayudar à bien morir a los que estan en lo ultimo de su vida ; con un Alfabeto de S. Buenauentura para hablar por la mano. Madrid: Luys Sa[n]chez
A comprehensive reference list can be found in our pre-print at Humanties Commons. The raw data and analysis files are available via GitHub.

Other posts in this miniseries

Monday, July 8, 2019

Character cliques and networks – mapping haplotypes of manual alphabets


[This post is the second part of our miniseries on the origin and evolution of sign language manual alphabets]

One aspect of exploratory data analysis (EDA) is for us to try to understand how our data relate to our inference(s). This is especially important when the signal from our data is increasingly complex. Sign language manual alphabets are such a case.

In our first post about sign language manual alphabets, I introduced the principal networks that we used to classify sign languages. Here, I'll describe our character mapping procedure and why we did it as part of our EDA framework, in order to establish scenarios for the origin and evolution of sign languages.

Characters and mapping

We encoded each hand-shape used to signify a certain concept, such as the letters included in the standard Latin alphabet "a", "b", "c", .... "x", "y", "z", as a binary sequence – the presence or absence of a certain COGID (we will explain and discuss this in a later post). These binary sequences can be seen as an analogy of the genetic code, as a sort of 'linguistic haplotype', and their evolution can be mapped onto a network based on the entire dataset.

For instance, our matrix has three binaries (haplotypes) for the concept [g] in the oldest set of sign languages (pre-1840), two of which can be found in the earliest alphabets in our dataset: those of Yebra 1953 and Bonet 1620. Russian 1835, the oldest Cyrillic alphabet, uses a somewhat different hand-shape for its counterpart of the Latin "g", the Cyrillic "г".

For the concept [g], we thus have three taxon cliques, each defined by a distinct binary/haplotype: the 'Yebra haplotype', the 'Bonet haplotype', and the 'Cyrillic haplotype'.

By mapping these haplotypes on the network, as shown in the next figure, we can see that there is a small edge bundle reflecting the basic split between the Yebra and Bonet haplotypes.

Hand-shape drawings are taken from the original manuscripts.

We can also see that the Russian haplotype either evolved from the Yebra haplotype kept in the older Austrian-origin Group, ie. is an adaptation of the Yebra haplotype, or that it is a genuinely new invention — note the similarity of the Russian hanshape with the letter г.

We repeated this procedure for all 26 concepts of the standard Latin alphabet, to get an idea of how often the encoded linguistic haplotypes fit with the overall pattern visualized in the inferred Neighbor-nets (ie. the neighborhoods as defined by edge bundles). This is shown in the next figure.

The arrows indicate inferred evolutionary processes (replacement or invention).

Using this network mapping(which, in principle, uses the logic of parsimony/median networks), we can make direct inferences about the general mode of evolution.

For instance, even though Russian 1835 uses a different set of hand-shapes (ie. is defined by partly unique haplotypes), the hand-shapes for the concepts [p] and [z] are exclusively shared with the Austrian-origin Group. The biological equivalent would be: the 'Austrian haplotypes' are a uniquely shared derived feature reflecting a putative common origin of the Austrian and Russian lineages — ie a potential linguistic synapomorphy. We also can see that all haplotypes shared by Russian and all ([a][c][f][r][u][y]) or part ([b][e][i][k][n][o][x]) of the French-origin Group, an alternative source that may have inspired this early Cyrillic alphabet, lack this quality.

We can also make inferences about:
  1. which hand-shape is the original one (O);
  2. lineage-specific / diagnostic hand-shapes, eg. At. = Austrian, Da. = Danish (using two letter abbreviations);
  3. which hand-shapes are shared but apparently derived, eg. At.-Fr. are hand-shapes / haplotypes shared by members of the Austrian- and French-origin groups not found in the Yebra or Bonet alphabets — C stands for cosmopolitan, non-original handshapes common in various lineages, including British-origin Group, and D represents derived but rare hand-shapes without any clear lineage-affiliation; and
  4. alphabet-unique (ie. represent a linguistic autapomorphy.
In addition, we can explore certain details, including patterns (character-based taxon cliques) that are at odds with the overall reconstruction. The latter are to be expected, because the graph is planar (2-dimensional) but the processes that shaped sign alphabets are likely to be multi-dimensional. For instance, our networks failed to resolve the affinity of the contemporary Norwegian Sign Language, the reason for which can be seen in the following character map.


Note the position of Norwegian 1955, which is still part of the Austrian-origin Group (like older manual alphabets used in the late 19th century in Norway). However, it is already influenced by international standardization — eg. concepts [k], [p], and [z] use(d) French hand-shapes. Hence, Norwegian 1955 shares quite a high number of lineage-diagnostic hand-shapes with Danish 1967 and the Icelandic Sign Language. These, and others, were further replaced in its contemporary counterpart (Norwegian SL) by hand-shapes borrowed from various lineages — eg. [c],[f] from the nearly extinct Austrian-origin Group, [p] from the Russian Group, [k] same as in the Spanish Group) — as well as unique hand-shapes, including hand-shapes evolved from earlier forms or those that have been genuinely invented.

Why we map character evolution along networks

In many cases, we only have one set of data, in order to draw our conclusions based on the graph(s) we infer. We cannot test to which degree our data (the way we scored the differentiation patterns) and inferences are systematically biased. Thus, we want to explore which aspects of our inference are supported by character splits, and establish taxon cliques and evolutionary pathways for the characters (scored traits). Lacking an independent source of data, the latter would involve circular reasoning — ie. mapping the traits along a tree derived from those same traits.

By inferring a tree, we crystallize one pattern dimension out of the data, although more often than not this will be a comprise from multidimensional signals. A network, such as a Neighbor-net, has two dimensions, and hence our mapping can consider two alternatives at the same time — this enables us to make a choice, if we have to. Another practical advantage of a Neighbor-net is that it is quick to infer, so that we can easily reduce the data set and use a more focused graph for the map.

In cases where 2-dimensional graphs don't suffice, there are still Consensus networks, which would allow mapping character evolution based on a sample of many alternative trees.

We could even eliminate the circular reasoning while maintaining a relatively stable inference framework. Deleting a character or several characters (or recoding them: see eg. Should we try to infer trees on tree-unlikely matrices?) can easily lead to a new tree topology, although it has less effect on the structure of a Neighbor-net. When we would need to worry about circular reasoning for mapping a certain concept, or two concepts that may have interacted, we just base our Neighbour-net on a distance matrix calculated from a reduced character matrix, and then map only those concepts not considered for the inference.

Other posts in this miniseries

Monday, July 1, 2019

Stacking networks based on sign language manual alphabets


This post is the first of a mini-series on sign language manual alphabets. While the evolution of spoken languages has been studied intensively using phylogenetic methods, sign languages have not, as yet.

In this post we will first introduce our readers to a set of stacked networks, and how it assists in establishing ancestor-descendant relationships in a pretty straightforward (but not trivial) case: the evolution of manual alphabets in sign languages. In the next post, I will demonstrate the use of networks for character mapping and putting forward hypothesis about ancestor-descendant relationships.

In 2004, Spencer et al. (Two papers you may want to read...) showed that Neighbor-nets outperform tree inferences when it comes to explicit ancestor-descendant relationships. The data set they used was quite particular: copies of written text. Here, scribes copy a text, and then other scribes, some of them ignorant of the language of the text they are copying, copy the copies. In the paper, the sequence of copies was recorded (the 'true tree'), and then the various texts were transferred into phylogenetic matrices, in order to infer trees and networks, and then this result was compared to the 'true tree'. The best fit of the data to the truth was the Neighbor-net.

This is a compelling conclusion, because, as a planar network and in contrast to median networks, Neighbor-nets don't explicitly place taxa in ancestor-descendant relationships. However, we have shown for many cases here at the Genealogical World of Phylogenetic Networks how ancestors are often placed with respect to their descendants: they are often closer to the center of the graph, or the root when known, and thus they bridge the center or sister lineages and their descendants. We can thus see why Neighbor-nets might be useful in practice.

In this context, the evolution of sign language manual alphabets, ie. the hand-shapes used to represent letters of a written alphabet, should be relatively easy to reconstruct. Once an alphabet is established in a sign language school / community, the ancestor, it will be passed on to other "generations" within the community and other schools / communities, the descendants. However, this is not necessarily a dichotomous process, as depicted in the first figure.

A scheme depicting how manual alphabets may evolve and disperse.

There are a few complications here: for example, hand-shapes may change in course of being used (the hand-shape evolves); contact may lead to exchange or appropriation of hand-shapes (called "borrowing" in linguistics); and, in some cases, entire alphabets will need to be adapted to a particular use. The latter case occurs when changing from one script (Latin, say) to another (Cyrillic or Arabic) — the first formal school for the deaf was established in Paris, for example. As a teacher, I need to decide: Do I take a hand-shape from the morphologically similar letter, or the phonetically similar one? As a scientist, I need to assess the homologies among such hand-shapes without inflicting systematic bias.

Standardization will wipe out local customs and replace them with a multinational standard. For instance, Country 2 in the scheme above, drops its original B-type manual alphabet (red) for an A-type (blue); and in Country 7 both traditions are fused. Over time, originally distinct sign languages may converge due to geographic proximity, or even just feasibility.

The evolution of spoken languages has been studied intensively using phylogenetic methods, and in particular networks are much more commonly found in the linguistic literature than in the biological one. For sign languages we have made a first step in a recently published pre-print:
Justin M. Power, Guido W. Grimm, and Johann-Mattis List (2019) Evolutionary dynamics in the dispersal of sign languages. Humanities Commons. http://dx.doi.org/10.17613/0smt-j414
What excites me about our study is that it combines historical manual alphabets (going back to 1593), which are potential ancestors, with a set of modern-day alphabets, which are their likely descendants. The data set is thus an evolutionary paleontologist's dream (and, possibly, a cladist's nightmare, if we expect a simple tree-like set of relationships rather than a network). As a scientist, I simple love to boldly go where no-one has gone before.

The next figure shows the all-inclusive network from our paper, but focusing on the age of the manual alphabets.

For more linguistic details see the pre-print.
* Historical version(s) of these lineages are not included in our data set

Obviously, there has been quite a lot of evolutionary changes, as well as standardization, going on, although some parts, like the Swedish SL (sign language), have stuck to its unique original. Historical and contemporary Spanish / Catalan are still most similar to the oldest manual alphabets that Justin dug out for our study. On the other hand, the contemporary Norwegian SL is placed far apart from his historical counterparts, and lacks any obvious affinity. Austrian, Danish, and German look back on a long and diverse history, the green "Austrian-origin Group", but the contemporaries have been homogenized by standardization (note the closeness to the International Sign manual alphabet). If we use an analogy with common biological and biogeographical processes (such as range expansion, competition, extinction, etc), then the Austrian-origin Group only survived in a remote island population, where we still find a sort of living fossil, the Icelandic SL.

In contrast to biological data, the old, putatively ancestral, manual alphabets are not closer to the graph's center, or the oldest manual alphabets in our data set. The reason for this seems to lie in the data itself and how manual alphabets evolve, and this will be the topic of the next post(s).

Still, we can isolate some evolutionary pathways, especially when we make time-wise taxon-filtered networks and stack them (see this introduction to stacking and this application using Osmundaceae, a data set including an even larger ratio of fossil taxa to modern taxa).

Fig. 4 from Power et al. Coloring same as above: pink – Spanish; turquoise – French-origin; green – Austrian-origin; orange – Polish; red – Russian; light blue – Swedish Group. The English-origin and Afghan-Jordanian groups are not included, since not represented by historical manual alphabets in our data set

Each of the three networks includes manual alphabets from a certain time period, starting with pre-1840 at the bottom, historical 19th-/20th-century manual alphabets in the middle, and post-1950 manual alphabets in the top network. The dotted links between the networks connect manual alphabets that are included in two of the networks.

Even from these graphs alone, we can say a lot about how ancestors (original manual alphabets in a country) relate to descendants (later and contemporary manual alphabets) and their evolutionary pathways. Here are some examples.

Shortly after the time when the first schools for the deaf were established in continental Europe (late 18th, early 19th centuries), manual alphabets showed quite a diversity, and were very different from their potential Spanish sources, such as Yebra 1593 and Bonet 1620, with the French and Austrian teachers and communities going different ways. The oldest Cyrillic alphabet, Russian 1835, is more closely related to (ancient) Austrian than it is to (ancient) French.

The Swedish manual alphabet of 1866 is a fresh invention. Some hand-shapes may have been borrowed from one or another alphabet in use on the continent, but, as we will see in the next post of the series, includes genuinely new forms.

The French tradition was dispersed into the new World (American SL appears to be a direct derivation from the French, while the Brazilian SL is an adaptation) but remained a relatively homogeneous group. On the other hand, the Austrian-origin languages diversified, in particular within the Danish influence zone. Politically, the Danish king ceded Norway to Sweden in the Treaty of Kiel 1814 (note the distance between Norwegian and Danish languages in the late 19th century), while Iceland was a Danish dependency until 1918, when the Danish-Icelandic Act of Union was signed. Furthermore, the German manual alphabets subsequently diverged from the Austrian source.

The Polish manual alphabet, originally an adaptation of the Austrian-Danish manual alphabets (see the graph in the middle), became closer to the Russian group, with the Latvian sign language taking up an intermediate position. The Cyrillic alphabets evolved further away, too (top graph).

In the following post(s) of this miniseries, we will explain what we learned from simple character mapping on the time-taxon-filtered networks, and how to score manual alphabets in the first place.


Follow-up posts in this miniseries