Abstract
Keywords
Introduction
Molecular motor proteins are mechanoenzymes associated with the cytoskeleton that powers much of the movement performed by living organisms. There are different motor proteins that coexist in every eukaryotic cell and they differ in the kind of filament they bind to, the direction in which they move along the filament, and their “cargo”. 1 There are three major groups of motor proteins: kinesins and dyneins (that move along microtubules) and myosins (that move along actin filaments). The myosins are actin-based molecular motors and the amino acid sequences of myosin protein families are quite divergent in eukaryotes. The sequence identity of the motor domain could be lower than 20% among some members of the myosin family. In spite of the low sequence identity, the core tertiary structures of myosins are strongly preserved. 2 They are characterized by a motor domain that binds to actin in an ATP-dependent manner, a neck domain consisting of varying numbers of IQ motifs, and an amino-terminal and carboxy-terminal domain of various lengths and functions. The known conserved sequence motifs, called structurally conserved regions (SCRs), were used as a prerequisite for a protein to be considered as a myosin. 3
Abundant information is available about the structural, biochemical, and functional aspects of the motor domain of myosins. 4 However, fewer studies have been performed on the phylogenetic relationships and evolutionary history of the myosin family.5,6 Some of these phylogenetic studies have been carried out by choosing exemplars of the functionally different classes of myosins, while others have focused on the phylogenetic relationships of restricted myosin clades. The structurally variable features of the myosins, such as sequence divergence, gene duplications, and length and domain architectures, might account for the continuous need for a comprehensive phylogenetic analysis of such a prolific family. 7
Given the range of functions, structures, and interactions mediated by myosins, we report a survey of the abundance, protein architecture, conservation, and structure of gene products that contain sequence signatures of myosin domains across the entire proteome and myosin classes. An earlier study is consistent with many results presented here, and we now contribute an expanded structure and motor domain–centric analysis of myosins encountering new myosin families, with an emphasis on the scope of comparative analysis in these protein families. 8 Our results should be useful for the structural and functional prediction by analogy for less well-characterized myosin classes.
Myosins have undergone marked diversification and domain rearrangements; the comparative study of these molecular motors offers great potential to untwine early eukaryote evolution. 9 Other studies have attempted to infer inter-specific relationships specifically within the myosin family, though often with limited taxon sampling, and such analyses of few taxa can be subjected to strong systematic biases, which in turn produce high measures of repeatability (such as bootstrap proportions). 10 In this article, however, we report a large-scale study in the entire non-redundant sequence database to present an unprecedented number of 4,064 putative myosins from different phyla, after certain filters such as the presence of SCRs and motor domain length, in agreement with known classical myosins. In addition, the myosin families are examined in the context of domain architectures and report discovery of ten novel and distinct classes of myosins, mainly from Oomycetes, Cellular slime molds, Choanoflagellates, Pelagophytes, Eustigmatales, Fonticula, and Eucoccidiorida. Due to the availability of large datasets, most of the discussion will focus on the fungal and novel myosin classes.
Results
Identification of myosin genes
To sketch the evolutionary history of the myosin gene family, we surveyed in those eukaryotic genomes for which complete or near-complete genomic data were publicly available. These organisms represent a wide taxonomic diversity of eukaryotes and encompass the major eukaryotic “supergroups.” To survey for myosins, we employed a Hidden Markov Model (HMM)-based sequence search strategy, starting from the head domain of classical myosins (please see Methods for details), to identify myosins using an E-value cutoff of 1e-5 7 . This approach identified 6,044 encoded myosin-like protein sequences, which included sequence fragments (Supplementary File 1). To improve phylogenetic resolution, we considered only 4,064 of them that retain conserved myosin motifs in ATPase domain and the length of the domain was in agreement with the previously known myosin ATPase (head) domains.
We aligned the 4064 head domains of the myosins and calculated a maximum likelihood phylogeny using RAxML to construct a global tree (Supplementary Fig. 1). We considered local bootstrap probability (LBP) and literature information of different functional groups of myosins to divide the global tree into different subtrees, and further, the subtrees were phylogenetically evaluated. The phylogeny for the myosin repertoire thereby includes members from 340 diverse eukaryote species. Members from each of the 35 myosin classes, defined previously by Odronitz and Kollmar (ie, myosin 1 to myosin 35), were also retrieved and included in phylogeny with strong bootstrap support (>50% BS value). A simplified phylogeny showing representative members from each class, along with their domain architecture, has been provided (Fig. 1).
The maximum-likelihood trees were constructed based on the representative motor domain sequences selected from the global tree (Supplementary Fig. 1); bootstrap values (BS) above 50 are shown near the tree nodes with shapes and color.
Nomenclature and population of myosin classes
The previous study for the myosin superfamily reported 35 class, named myosin 1 to myosin 35.
8
The phylogeny presented here confirms the monophyly of most of the previously known myosins to a large extent. Few myosin classes, like myosin 3, 5, and 6, have been split into multiple clades, with adjacent clusters consisting of members of other classes. Our inclusion of newly available sequences, from a more diverse range of organisms, undoubtedly also expands several preexisting families (Fig. 2). For example, the cluster corresponding to myosin 7 class, has now expanded to include previously unclassified sequences. More importantly, greater taxon sampling identifies new myosin groups, including ten novel myosin classes. In the following, by extension to the standard myosin nomenclature, we refer to these new families as myosin 36 to myosin 45. The sequence similarity between myosins of different classes was also plotted in the form of a heatmap (Fig. 3). These observations support the previously known details, like the similarity between classes 15 and 35, 8 and 11 and 7, 10, and 15. We also observe similar but new mega clusters involving classes 3b-9-16-28-43-43-45 and 22a-22b-34-41-42 (Supp. Fig. 1).
Taxonomic distribution of myosin classes in eukaryotic genomes.
Conservation of catalytically important residues in the motor domains of novel myosins
Sequence analysis within core motor domains of myosins classes shows class-specific insertions at nucleotide-binding regions analogous to other ATPases -the P-loop or Walker A and a Switch-II motif, which undergo large conformational changes in the contractile cycle and are highly conserved across all myosin classes and a Switch-I region, which are further variable.
11
The phosphate-binding loop (P-loop) motif GESGAGKT is well conserved across the novel myosins, except for myosin Class-40 where it is replaced by GESGSGKT, similar to myosin classes 9, 12, 15, 16, 17, 20, and 21 (Supplementary Fig. 2). The switch-I region, which has a consensus LEAF-GNAKTIRNNNSSRFGK in conventional myosins, is more variable. In the switch-II region, LDIYGFE, is replaced with LDIFGFE (which occurs in most of the unconventional myosins, 4, 7, 9, 10, 14, 16, 21, 22, 26, 27, 28, 30, 31, 33, and 34) in almost all of the novel classes, except classes 41 and 45, which show a consensus of VDIFGFE and LDLS-GFE, respectively (Supplementary Fig. 3). Nucleotide state is communicated to the actin-binding region, and the lever arm and differences in the consensus sequences could indicate subtle variations that have evolved in nucleotide binding and dissociation affecting the kinetics of myosin motor movement across different classes.
11
A heat map showing the sequence identity between the motor domains of the myosin classes.
Diversification of myosins are achieved by diverse domain architecture of the genes that contain them
Motor proteins are generally composed of a motor head domain that converts chemical energy to force, and a range of additional domains that bind cargo, filaments or accessory proteins.
12
Since regions outside of the motor head domain direct many interactions, considerable functional diversification might be achieved through the evolution of the protein domain combinations. To further investigate the diversification of the myosin family, we identified putative domain architectures for the initial dataset, of all 4064 identified, myosin proteins using Pfam,
13
SMART,
14
CDD,
15
and Paircoil2
16
database searches. In total, we found 218 different myosin protein domain architectures for all identified myosins (individual domain architectures of each of the genes under analysis are available in (Supplementary Fig. 4).
(A) Diverse domain architecture observed in myosin Class-5a. The four-letter code refers to the following organisms – Mmul (
Surprisingly, most such coexisting domains were specific only to particular taxa in our analysis. For instance, Membrane Occupation and Recognition Nexus (MORN) repeats in Haptophytes, actin-binding calponin homology domains were observed only in Oomycetes, and ubiquitin-associated domain (UBA) in Kinetoplastids, indicating that these domain combinations were relatively recent acquisitions (Supplementary Fig. 5). It is also noteworthy that few myosins in our analysis, like in myosin classes 14a, 33, and 26, possess no identifiable protein domains outside of the motor domain. This implies that the great majority of the interactions between these motors and other proteins is controlled either by poorly conserved stretches of peptide or protein domains that are not yet described in protein domain databases, for example, in myosin 14.
17
It is possible to identify myosin-containing genes of unusual architectural forms by annotating the motor domain phylogeny with the protein domain architectures. The unusual domain architectures could be due to the result of secondary loss and gain of domains (Supplementary Figure 6). Accounting for these secondary loss events, 218 protein domain architectures, which were observed in multiple genomes, were specific to a paralogue or family on the myosin phylogeny, suggesting that they represent synapomorphic character state.
A phylogram of representative fungal myosins created by the maximum-likelihood method based on 1,000 replicates. Bootstrap values are given at the nodes. Workflow of sequence search and validation. In all, 378 myosins from 35 classes (Odronitz and Kollmar et al., 2007) were taken as queries for a Jackhmmer search against the NR database. The hits were filtered for the presence of functional (ATP-binding) motifs to obtain 4,064 sequences. The hits were aligned using the indicated parameters in Clustal-W and a phylogenetic tree was constructed using RAxML.

Myosin functional diversity
Our data show that there are robust phylogenetic patterns in terms of the abundance of members and varied domain architecture across myosin classes (Supplementary Fig. 4). Even within the same class, there is a diversity in coexisting domains, which might have major evolutionary significance and influence the topology of protein-protein interaction network (Fig. 4A). Some of the unusual domain architectures in the previously known myosin classes have been discussed in Supplementary Data.18–23 The former classes, 3, 5, 6, 7, 14, 15, and 22 members, have split into two clades and have distinct conserved motifs from each other. Myosin 3a is more similar in sequence to myosins 17, 33, 22, and 20 and myosin 3b to classes 9, 43, 44, 45, 28, and 16. Class-5b is closer to 6b than 5a. Class-7a and 7b are as closer to each other as they are to classes 10, 35, and 15. Class-14a is more similar in sequence to Class-25, and Class-14b is more similar to Class-12. Class-15a is closer to Class-35 than Class-15b. Classes 22a and 22b cluster with classes 27, 41, 34, and 42. Alignments of motor domains of such split myosin classes are provided in the link http://caps.ncbs.res.in/download/myosins/split_myosin_alignments.
Fungal myosins
A total of 1,365 myosins belonging to the four myosin families 1, 2, 5, and 17 were identified in the 244 fungal genomes for which complete or near-complete genomic data are available. Figure 4B shows a cartoon representation of these fungal myosins showing their domains and dimerization status.
Microsporidia are obligate, spore-forming unicellular parasites with a wide host range and they are most closely related to Fungi, despite obvious differences in existence and biology.
19
Although other fungal myosins belong to classes 1, 2, 5a, and 17, the distantly related Microsporidia taxa reveal only two types of myosins: Class-2 and the subclass 5b. The subclass was named as 5b because of co-clustering of few members of the previous myosin Class-5 and similar domain architecture (Mysc, IQ repeats, coiled-coil, and C-terminal DIL domain). However, we also observe less diverse domain architecture in fungal myosins; even though the core domain content is conserved, the data clearly show progressive changes and expansions of individual myosin class domain architectures (Supplementary Fig. 7). Foth et al had identified orphan myosin sequences in
Apicomplexan myosins
Myosin 14 has a single-motor domain. Class-14a is found in Apicomplexans and has a GESGAGKT motif, while myosin 14b is found in
Yusuf et al, 2015 studied the PfMYOB isoform of Class-14 myosin in
Classes 23 to 27 were represented only by Apicomplexan species. Myosin 23 has a MyTH4 domain. Apart from Apicomplexa, myosin 24 sequences are found in Oligohymenophorea, Spirotrichea, and
Identification of novel myosin classes and their putative function
The novel class myosin 36 form a separate cluster with a bootstrap value of 75 next to myosin classes 14, 25, and 11 in the phylogenetic tree (Supplementary Fig. 1). While the Class-25 cluster has Apicomplexan myosins, Class-21 is specific to metazoans and excavates; this new class (myosin 36) is found in members of Amoebozoal phyla Dictosteliida, and the ciliates Oligohymenophorea (
The members in the newly found class, myosins 37, cluster with myosin Class-4 (seen in lower eukaryotes) and Class-31 (specific to Oomycete, Pelagophyte, and Phaeophycae) with a bootstrap value of 55. They are specific to the Cryptophytes
The myosins of Class-38 cluster next to myosin Classes 6a, 30, and another new myosin class, (Class-39) with a significant bootstrap value of 100. They are found in Oomycetes, Bacillariophyta, Pelagophyceae, and Phaeophyceae, sharing a similar phylogenetic distribution as Class-30 myosins, whereas Class-6a myosins are seen only in Apicomplexans. Myosin 6a has the motor domain only, while some sequences have N-terminal SH3 or IQ domains. Myosin 30 sequences have either of the Phosphatidylinositol-binding domains PH (Pleckstrin homology) domain or PX (phagocytic oxidase) domain, sometimes with IQ repeats. One sequence from the alga (
Myosin Classes 39 and 40 are species-specific myosins. Class-39 myosins occur in the Isochrysidale
Myosins in 41 and 42 classes are phylogenetically closer to myosin 22 and 34 (with bootstrap value 85 and 92, respectively). While myosin 23 is found in Phaeophyceae and Bacillariophyta, myosins of classes 22, 41, and 42 occur in Oomycetes. Myosin 34 contains IQ and Ank repeats followed by FYVE as coexisting domains and myosin 22b has IQ and other associated domains like Ank, Pkinase, and adh short C2 domains. The novel class domain architectures are similar to those of Class-34 myosins. Class-41 has two distinct domain architectures – Myosin head followed by FYVE and GAF domain and the other with the myosin head followed by two to four IQ repeats, with or without PDZ domain. Five sequences have GESGAGKT, four have GESGSGKT and one has GESGTGKT in the P-loop ATP-binding site. Class-42 contains the myosin motor domain, followed by FYVE and sometimes interspersed with IQ motifs. Since all the associated domains are known to interact with cell-signaling components, these myosins could be localized in the cell membrane and they have a role in cell signaling. All sequences in Class-42 have a GESGAGKT motif, except two
The myosin classes 43, 44, and 45 cluster with classes 3b, 28 and 16 with bootstrap values of 58, 60, and 87 respectively. Some of the sequences are among the Choanoflagellate orphans reported by Odronitz and Kollmar.
8
Myosins belonging to classes 3b, 9, 16, and 28 are seen only in metazoans. Myosins of classes 43 and 44 are unique to Choanoflagellates and the former has SH2 domain, which is similar to myosin 28 domain architecture (IQ and SH2), whereas the latter has PH, SH2, or Mcp5_PH domains co-occurring with the motor domain. Since PH is involved in cell signaling and SH occurs in membrane kinases, these myosins could also be localized to cell membrane and be involved in cell signaling. The other myosins in the cluster have different domain architectures than the ones seen in the two novel classes: N-terminal Pkinase-IQ (Myosin 3); RA-IQ- C1-RhoGAP (Myosin 9); N-terminal Ank -NYAP (Myosin 16) and IQ-SH2 (Myosin 28). All the sequences in both these classes have a consensus P-loop motif of GESGAGKT. Class-45 myosins are found in molluscs and annelids and are associated with Glucosyl transferase and chitin synthase domains like the fungal myosins. These animals have either a chitinous exoskeleton (Mollusca) or chitin bristles in the exterior surface (
Orphan myosins
The orphan myosins could not be classified since they were less in number (from single to five sequences) and clustered separately in the phylogeny with high bootstrap values. There are 38 myosin sequences that are not affiliated to any of the 45 classes. They are 17 from Choanoflagellates (
The motor domains of Choanoflagellate and Haptophyte orphan myosins have distinct sequences from the other myosins. This illustrates the sequence diversity and evolutionary divergence of myosins from these phyla. As lower extant genomes are analyzed, further divergent myosins can be expected. Some orphan myosins have unusual domains like Ribonuc_l-PSP (
Conclusions
Recently, genome sequencing projects have allowed the identification of the precise number of myosin genes in major eukaryotic species and have remarkably expanded the taxonomic range of known myosins. With more than 4,000 proteins analyzed and a reasonable representation of the different myosin paralogues in the key groups of living organisms, the recovered phylogeny confirms and generalizes previous analyses that were carried out with decreased and taxonomically more limited data sets. In the present study, we reconstructed a myosin family phylogeny taking into account the new genomic information. Starting from 378 classical myosin sequences and their ATPase domains, sensitive computational searches using HMMs against the entire nonredundant database of sequences, gave rise to a broader understanding of putative myosin types in the whole taxa of life. The hits obtained from the initial search, followed by computational validations, permitted us to accept 4,064 myosin-containing gene products for our starting analysis. In our analysis, the most primitive myosins are from
Various analyses such as taxanomic distribution, expansions of existing myosin families, and the realization of new myosin families were possible. We can reiterate about the new myosin families here. Unlike previous analyses reported by other groups, our searches were not taxon-restricted. Our analysis of domain architectures of myosin-containing genes also did not assume one domain architecture for each myosin family. Indeed, this provides a perception of gain or loss of domain architectures across the eukaryotic kingdoms. Certain domain architectures are highly preferred. The classical domain architecture of a myosin containing a head-domain, few IQ repeats, followed by a coiled-coil and cargo-binding domain is too idealistic and examples are slowly emerging in the literature of exceptions to such a domain architecture. Our data show considerable diversity of domain architecture among the classes, for example, myosin 5a reinforcing the notion that individual myosin classes have undergone dynamic evolution. This also indicates that domain-level classification, which secludes the information encoded in the accessory domains, might not be the appropriate approach to understand evolutionary relationship for such a diverse motor family and one should take into account the full-length sequence, consolidating the complete information to comprehensively understand the evolutionary forces that shape the existence of these protein functional domains and domain combinations in each species. For example, myosin 6 does not contain a long-length, coiled-coil domain. 24 It is likely that the presence of a small fraction of coexisting domains and motifs has remained unnoticed, since the large analysis and automatic searches require stringent thresholds. Our report is attempting to aim a broad bioinformatics overview of all possible putative myosins in the entire universe so that several new, hitherto uncharacterized interesting variations of myosin families and myosin family members can be adopted for detailed structure-function characterization. This work also provides a framework and insights into the distribution and likely function of myosins including newly identified myosins supported by phylogenetic inferences and available experimental data, for revealing the relationships and functions.
Materials and Methods
Sequence searches and alignments
Sequence search for myosins was performed using the standalone JACKHMMER 25 program at an E-value of 10−10 against the complete non-redundant protein database (NCBI) as of March 13, 2013, release 58. We started our search with 378 myosin queries that belong to functionally different groups (to find similar sequences in the database). 8 The sequence corresponding to only the ATPase domain was considered for sequence searches, as per previous work. 19 Hits obtained from such sequence searches were included in the myosin family for further analysis if the sequence identities with the query were above 20%, key conserved myosin motifs in ATPase domain could be observed, and the length of the head domain of the hit was in agreement with previously known myosins (Fig. 6). 26 Only those sequences that had a motor domain as verified using standalone PfamScan at E-value of 10−2 were considered.
Protein sequences were aligned using CLUSTALW version 1.7, BLOSUM series scoring matrix and a gap penalty mask based on the aligned secondary structures of the myosins whose crystal structures were known. 27 Where required, minor manual corrections were performed on the alignment based on the knowledge of myosin motifs. The same strategy was employed for each separate alignment of the different myosin groups (see below).
Phylogenetic analyses
Phylogenetic analyses based on protein sequences were carried out using the maximum-likelihood method with the RAxML software (Version 7.4.2) with 1,000-bootstrap replicates. 28 The number of sequences is the most important limiting factor when an exhaustive phylogenetic analysis is attempted under the maximum-likelihood principles. Since the data size of the aligned set of sequences was large, we first reconstructed a global tree based on the protein sequences with the kinesin motor domain as the outgroup. The steps for obtaining the global tree were (1) selection of best-fit models of amino acid replacement for the data (2) calculation of a maximum-likelihood distance matrix with the RAxML program under the BLOSUM model, and (3) the use of the resulting tree topology as a seed to search for a topology with a higher likelihood value using the same amino acid substitution model.
We then divided the global tree into subtrees following two criteria: an LBP greater than 50% and literature information on different functional groups of myosins. The sequences composing each group were aligned using MUSCLE (with iterative refinement). 29 We selected the myosin 1 sequence, which is widely distributed across eukaryotes as an outgroup for the purpose of tree representation in other myosin class-specific trees. In groups of more than five sequences, we followed the same strategy as that used in estimating the global tree. A majority rule of LBP 50% was established for each node in every subtree; unsupported nodes were excluded and their branches forced to yield polytomies.
Domain architecture
All the gene products, containing at least one putative myosin head domain, were then searched against the protein conserved database CDD, 15 SMART, 14 and Pfam database 13 at an E-value of 10−2, to identify and classify coexisting domains. The prediction of coiled-coils was inferred separately using the PAIRCOIL2 program with P-score cutoff of 0.025. 16 Protein domain identification is limited by the sensitivity of the search and the diversity of protein domains in the database. CDD, SMART, and Pfam were used in combination to increase both the sensitivity and the protein diversity.
Gene ontology (GO) annotation
Sensitive sequence searches revealed 134 novel myosins with signatures of functional motifs. These sequences were further validated for the presence of functional motifs and GO annotations using Blast2GO 30 with default parameters. The details are described in Supplementary Table 2.
Author Contributions
Conceived the study: RS. Constructed the workflow: SNP. Performed the sequence searches: SNP. Performed analysis: SNP, IM. Contributed to the writing of the manuscript: SNP, IM, RS. Made critical revisions and approved the final version: RS. All authors reviewed and approved of the final manuscript.
Supplementary Material
The consensus domain architecture, taxonomic distribution and expansion, and unusual domain architecture of the classical myosins have been discussed in detail.
Supplementary Figure 1
ML tree of myosin head domains. The tree is collapsed at key nodes and rooted using the midpoint-rooted tree option. Myosin classes are indicated. Nodal support was obtained using RAxML with 1,000 bootstrap replicates. Different colors have been used to indicate different myosin classes. The arrows indicate the direction in which the phylogenetic tree has to be read.
Supplementary Figure 2
Alignments of the ATP-binding conserved regions in the different myosin classes. The residues that are completely conserved are shaded in black. The green box in the alignment indicates the motif region. The consensus sequence is based on MultAlin 31 : uppercase is identity, lowercase is consensus level >0.5, ! is any one of the amino acids I or V, $ is anyone of L/M, % is anyone of F/Y, and # is anyone of NDQEBZ. Lowercase is consensus level > SimilarityGlobalScore if S, M, or E are used as Similarity Type. Please see Supplementary Fig. 3 for alignment of all the novel myosin classes.
Supplementary Figure 3
Alignments of the ATP-binding conserved regions within the novel myosin classes. The residues that are completely conserved are shaded in black. The consensus sequence is based on MultAlin: uppercase is identity, lowercase is consensus level >0.5, !is anyone of IV, $ is anyone of LM, % is anyone of FY, and # is anyone of NDQEBZ. Lowercase is consensus level >SimilarityGlobalScore if S, M, or E are used as Similarity Type.
Supplementary Figure 4
Diversity in domain architectures observed among each myosin classes. Pfam, CDD, and SMART searches were used to identify putative gene architectures. The gene identifier and four-letter code are elaborated at the end.
Supplementary Figure 5
Diagram of protein domain composition of myosins in different eukaryotic groups. The lineage-specific Pfam domains are illustrated. Darker circles denote a frequency of occurrence in more than five sequences and lighter circles indicate a lesser frequency. Detailed information is provided in Supplementary Figure 6.
Supplementary Figure 6
Detailed diagram of protein domain composition of myosins in different eukaryotic groups. The lineage-specific Pfam domains are illustrated. Darker circles denote a frequency of occurrence in more than five sequences and lighter circles indicate a lesser frequency.
Supplementary Figure 7
The domain-tree view of diverse domain architectures observed in different fungal myosin classes. Schematic representation of domain architecture for each myosin is mapped onto the tree. The tree is based on the distance matrix computed by Jaccard similarity coefficient using the DoMosaic tool. Values marked above the branches refer to Jaccard index.
Supplementary Figure 8
Domain Architecture of the orphan myosins identified in our study. The four-letter code refers to the following organisms: Nvec (
Supplementary File 1
The fasta file of myosin head domains used to compute the phylogenetic tree shown in Supplementary Figure 1.
Supplementary Table 1
The gene identifier, accession Id, organism name, taxa Id, and P-loop motif for all the novel myosin sequences are provided. The sequences having differences in the GESGAGKT motif are highlighted.
Supplementary Table 2
Mapping of Gene Ontology terms (cellular component, molecular function, and biological process) for all the novel myosin domains.
