Drosophila flight muscle thick filaments have several non-myosin proteins: flightin [
54], myofilin [
45], stretchin-klp [42, 55], paramyosin and miniparamyosin [
56], kettin [
41], projectin [
39], and obscurin [
37]. Kettin, projectin and obscurin are either too low in abundance relative to myosin for them to be visible in a helically averaged 3-D reconstruction, or they are located in areas, like the M-band and the filament ends, that are avoided by segment selection [
33]. Proteins, like paramyosin and miniparamyosin, follow a different symmetry than the myosin molecules and are in low abundance in
Drosophila compared to
Lethocerus or
Bombus. Consequently, they are averaged out in the reconstruction. Using Chimera segmentation, we could separate three non-myosin densities, which we attribute to myofilin, stretchin-klp and flightin (
Figure 6).
Myofilin (always shown in yellow) was tentatively identified as the second non-myosin density in
Lethocerus [
31]. The
Drosophila myofilin amino acid sequence is the smallest of the three species so-far reconstructed [
36]. Myofilin is predominately found on the inside surface of the myosin tail annulus (
Figure 6a-d). The myofilin density in
Drosophila has a significant contact with a single myosin tail from a single myosin layer. The small domain found in
Drosophila appears to correspond to the N-terminal domain identified in
Bombus based on its shape and superposition with the similar density. Both the
Bombus and
Drosophila myofilin have segments that pass between two myosin tails within a myosin layer (
Figure 6c). The folded domain of myofilin is close to but does not contact flightin (
Figure 6d).
Stretchin-klp (always shown in purple) was first visualized in the 7Å reconstruction as a chain of three densities repeating along a left-handed helical track on the backbone surface [
33]. Two of the densities were well defined, the third much less so. Connectivity was suggested. Here we see the same three density repeat with better defined connections and more density for the third, more heterogeneous density (
Figure 6a). The stretchin-klp density is completely on the outside of the filament backbone but is nestled into valleys between myosin tails (
Figure 6b). Stretchin-klp makes no visible contact with myofilin (
Figure 6a-d). Although stretchin-klp passes over flightin, it makes no visible contact (
Figure 6b). Stretchin-klp interacts primarily with a single myosin layer although its continuity over the five quasi repeats in a single molecule gives it interactions with multiple myosin layers.
Flightin (always shown in red) was first visualized in
Lethocerus as a small, folded domain, since identified as the WYR domain [
36], followed by an extended peptide chain that reached outside of the myosin tail annulus to contact the proximal S2 [
31]. Recently, a flight muscle thick filament reconstruction from
Bombus ignitus showed that a previously unidentified density, the so-called “blue protein” [
31], was connected to the WYR domain by an extended density [
36]. An atomic model was built that indicated the flightin C-terminus was located on the inside surface of the myosin tail annulus, with a connection to the WYR domain after which the polypeptide chain was extended to reach outside the backbone. The flightin N-terminus, lacking any stabilizing interaction was not visible in the reconstruction. Non-myosin densities attributable to flightin and myofilin have now been found in
Lethocerus, Bombus and
Drosophila.
2.4.2. Stretchin-klp
Three additional densities were observed on the outside surface of the
Drosophila thick filament backbone that were not observed in
Lethocerus or
Bombus (
Figure 7d,e). Proteomic data indicates that these additional densities correspond to the Ig-like domains and linkers of stretchin-klp [
33]. An analysis of the stretchin-klp sequence in UniProt suggested a repeating sequence of Ig-like domains and linkers of various lengths (
Table 1). The basic pattern is one of Ig-like (86-92 residues), short linker (12-27 residues), Ig-like (86-90 residues), long linker (61-174 residues). With respect to size, the Ig-like domains and short linkers are more uniform than the long linkers accounting for the better definition of these densities. The long linkers are more heterogeneous than the short linkers and are generally large enough to form small, folded domains of their own. The structural pattern observed is that of two equally sized domains with a well-defined short linker and a large domain that is relatively poorly ordered representing the five highly variable long linkers.
The filament segments used in the reconstruction are not necessarily equally distributed with respect to the five repeats. It is therefore possible that the size distribution contributing to the third folded domain corresponding to the long linker is not necessarily random and could conceivably be biased with respect to size in favor of the larger long linkers. Nevertheless, the density corresponding to the long linker is poorly defined.
The atomic structure of an I-set domain of myosin binding protein C (PDB 2YXM;
Figure 7d,e) fit satisfactorily according to size into the two globular densities supporting interpretation of these densities. We also modeled one repeat of the “Ig-short linker-Ig-long linker” sequence using Alphafold (
Supplemental Figure S2b). The Ig-like domains of this model fit well. The short linker has relatively the same length as shown in the density, while the long linker, which was modeled as a long α-helix, was a poor match to the density. The five “long linker” predicted structures were all different. Mostly they formed folded domain but, in this case, the predicted structure was a single long α-helix. The long linker sequence in the five repeats is highly diverse, which suggests that their structures are highly diverse. All three stretchin-klp densities are positioned near the Skip 1 region of one myosin tail which lies on the backbone surface.
2.4.3. Flightin
The flightin densities from the three species have a similar shape with three distinctive features (
Figure 7f-h). The most prominent is a folded domain approximately in the middle of the segmented density (
Figure 7f). Studies of the flightin amino acid sequences from many species identified a highly conserved domain in the middle of the sequence, residues W85-R131 in
Drosophila, dubbed the “WYR” domain due to its enrichment in tryptophan, tyrosine, and arginine [
57]. Extending from this middle domain toward the outside of the myosin tail annulus is a “V” shaped extension. The vertex of the “V”, H84, is highly conserved and occurs just before the first residue of the WYR domain. The V-shaped extension ends at the outside surface of the thick filament backbone but the density continues for a variable number of residues. The atomic model begins at residue P69 leaving 68 N-terminal residues disordered. In
Lethocerus, the visible part is longer because it is stabilized by contacts with the proximal S2, but here in
Drosophila as well as in
Bombus it does not and is consequently shorter. Running along the inside of the myosin tail annulus from the WYR domain is an inner, extended density that connects to the former “blue” density at the C-terminus (
Figure 7f,
Supplemental Figure S2c). The inner extension is found only in
Drosophila and
Bombus; this feature has so-far not been seen in
Lethocerus although its “blue” density is present.
Application of the structure prediction program AlphaFold [
58] to the
Drosophila flightin sequence produced a structure for the WYR domain that was an excellent fit to the central density. The WYR domain predicted structure consists of a pair of α-helices connected by a loop. The first helix has a distinct kink in its middle which allows the two halves to incorporate a bend of ~45° (
Figure 7g). The orientation of the predicted WYR domain structure identifies the V-shaped density as coming from the sequence on its N-terminal side and the inner extended density as coming from the C-terminal side of the flightin sequence. To obtain the final model, we removed the N-terminal residues P69-K95 from the AlphaFold model and rebuilt them
de novo in COOT. In addition, the residues G140-L182 were also rebuilt de novo to fit the density (
Figure 7g). The final flightin atomic model was refined against the reconstruction using the Real Space Refinement utility in Phenix [
59] with acceptable validation. There are clashes in the model, specially between the two helical regions but the resolution proved insufficient to define a correction. However, the model in its current state has some predictive power.
For purposes of description, we define the following elements (
Figure 7i): External Domain (residues A2-G74)[
60], Extended-1 (residues Y75-R83), Extended-2 (residues H84-Y93), Helix-1 (residues K94-Q115), Loop-1 (residues T116-T127), Helix-2 (residues W128-D144), Connector (residues S145-Y166), C-terminal (residues N167-L182). The External Domain refers to those observed residues that are outside of the thick filament backbone a major part of which is not visible and is presumably disordered.
Differences in the flightin External Domain, which primarily reflect differences in mobility, are seen among the three species.
Lethocerus flightin has the longest visible External Domain, probably stabilized by an interaction with the proximal S2. The visible part of the
Drosophila External Domain is short and contacts nothing. The visible
Bombus flightin External Domain has approximately the same length as that in
Drosophila but extends outward almost perpendicular to the thick filament surface (
Figure 7h).
The Extended-1 and -2 elements define the V-shaped loop, which is a structure very similar in shape for all three species we have so-far examined. The vertex of the loop in Drosophila, residue H84, is highly conserved and occurs just before the beginning of the WYR domain, residue W85, suggesting that it may be the structural beginning of the WYR domain. The beginning of the V-shaped loop, Extended-1, is a region of poor sequence conservation among the three species for which structures have been revealed.
Extended-1, Helix-1, Loop-1 and part of Helix-2 are the elements of the WYR domain. The WYR domain technically ends at residue R131 after just one turn of Helix-2. However, sequence conservation in Helix-2 is good up to residue T135. In comparison to the atomic model for Bombus flightin, structure conservation is good up to residue I138. The helix-loop-helix structure of the WYR motif appears very similar among the three flightin structures that have been observed at subnanometer resolution. The C-terminal end of Helix-1 and the beginning of Loop-2 may comprise a paramyosin binding site because they extend furthest into the hollow core of the filament.
The flightin Connector runs along on the inner side of the myosin tail annulus but makes no stabilizing contacts with the underlying myosin tails. This element, reported first for
Bombus [
36] is absent in
Lethocerus [
31] and was not seen in the previous reconstructions from
Drosophila [
33]. Although the density here is not well defined, given the 22 residues comprising this element and the distance to be spanned, it is likely that most of the
Drosophila Connector structure must be α-helical. The Connector comprises a poorly conserved region of sequence and may have comparatively divergent structures. AlphaFold predicted a helical structure for the Connector. The 22 residues in the
Drosophila Connector have only a single site of 100% conservation with the other two species and seven sites with 2/3 conservation [
36].
Lethocerus has the longest Connector sequence.
Bombus has a 5-residue deletion in the sequence and
Drosophila has a 3-residue deletion. These deletions occur in non-overlapping regions of the Connector sequence. Both the poor conservation and shorter amino acid sequence may explain why the connecting densities of
Bombus and
Drosophila do not overlap. Thus, the connector may play an important role in defining the structure of the thick filament.
The density at the end of the flightin structure in
Drosophila and
Bombus highly overlap with the previously unidentified “blue protein” found in
Lethocerus, which is strong evidence of its correspondence to part of the flightin molecule even though it failed to connect to the WYR domain. The 16 residues at the end of the flightin C-terminus are highly conserved with four sites of identical sequence, and ten sites with two out of three residues conserved [
36]. Sometimes the 3
rd residue is a conservative substitution.
A genetic study of
Drosophila flightin showed that truncation of the 44 residues at the C-terminus (residues 139-182) disrupts flight, although normal sarcomeres were formed at eclosion [61, 62]. This region includes all the Connector and C-terminal elements. The structures of the putative WYR domain and the C-terminal domain were predicted with higher confidence by AlphaFold than the region at its N-terminal side, which appears to be intrinsically disordered. The Connector, which was absent in the
Lethocerus reconstruction [
31], is resolved in
Drosophila albeit at a lower resolution compared to the rest of the reconstruction.
The atomic models of flightin and myosin showed three sites that appear to be important for incorporation of flightin within the myosin tail annulus (
Figure 8a). The myosin mutation E1554K prevents incorporation of flightin into
Drosophila flight muscle thick filaments [
44]. The mutation occurs on the
e position of the heptad repeat [
63] which makes it a candidate for interaction with other polyptide chains. The
Drosophila flightin atomic model places flightin residue R87 in position to form a salt bridge with E1554 (
Figure 8b).
The second region of interaction (
Figure 8c) contains a salt bridge between myosin residue E1557 with flightin Loop-1 residue R124. Another salt bridge is found between myosin residue R1857 and flightin Helix-2 residue D144 (
Figure 8c). These have not previously been reported. An additional region of interaction involves mostly hydrophobic interactions between tyrosines of flightin Helix-1 and the myosin tail (
Figure 8d). The predicted interactions between the myosin tail and flightin are summarized in
Supplemental Table S2.