1. Introduction
The genus
Gossypium emerged almost 12.5 million years ago (mya), and about 5-10 mya diverged into diploid A and D sub-genome species [
1]. Subsequently, around 1-2 mya, the two-diploid species underwent trans-oceanic hybridization and chromosome doubling, resulting in an allopolyploid AD clade that radiated from the Americas and diversified into multiple species [
1]. The allopolyploid cotton,
Gossypium hirsutum L. (Upland cotton), and
Gossypium barbadense L. (Egyptian cotton) have originated from two independent domestication events [
2]. Today,
G. hirsutum dominates the world’s cotton fiber production (90%). Whereas
G. barbadense,
G. arboretum, and
G. herbaceum together contribute only 10% to it [
3]. Cotton production generates an annual economic impact of around 600 billion US dollars globally, making it a major contributor to the world’s economy [
4].
The plant architecture plays a crucial role in determining crop production and its management. It is determined by the meristematic cells and could be indeterminate, leading to continuous vegetative growth, to determinate, where meristematic cells are consumed in producing a terminal structure. Primary meristems are dynamically added in cotton-producing monopodial or sympodial branches, inflorescences, and terminal shoots, i.e., flowers. Similarly, the secondary meristems (cambium) add continuously and contribute to wood production. The position of organ-specific meristems called primordia contributes to the final plant architecture [
5]. Flowering in cotton depends on the differentiation of flower primordia that depends on axillary bud primordia, which in turn depends on the shoot apical meristem (SAM) differentiation. Axillary bud primordia initiate the transition from vegetative to the reproductive phase in cotton by differentiating from leaf primordia to flower primordia at a specific time point during plant development.
Transcription factors (TF) regulate tissue-specific gene expression by binding to the promoter region of the target genes. Subsequently, the DNA-TF complex promotes the recruitment of the RNA polymerase onto the target sequence and promotes gene expression. The GATA TFs (one of the important family of transcription factors) in plants express at different developmental stages, especially the light-induced processes, such as embryo development, flowering, petal differentiation and expansion, and maturation [
6,
7,
8]. The
NTL1 was the first GATA TF gene identified in
Nicotiana tabacum [
9]. So far, over 30 GATA TFs have been identified and characterized in
Arabidopsis thaliana [
10]. Additionally, 64 GATA TFs have been identified in
Glycine max, 30 GATA TFs in
Gossypium spp., and 28 GATA TFs in
Oryza sativa [
10,
11,
12]. These studies highlight the importance of GATA TFs in plant growth and development.
Another diverse family of transcription factors, SQUMOSA PROMOTER BINDING PROTEIN-LIKE (SPL), plays a crucial role in plant growth and development [
13,
14]. The SPL transcription factors play various roles in the growth and development of plants, such as controlling vegetative to reproductive phase change in
A. thaliana [
15,
16], ovary and fruit development in tomato [
17], yield-related traits in wheat [
18], and floral organ size and ovule production in cotton [
19].
Virus-induced gene silencing (VIGS) is an important reverse genetics tool for in vivo gene function study in plants [
20]. VIGS depends on posttranscriptional gene silencing (PTGS) machinery in a sequence-specific manner. Briefly, in this machinery, double-stranded RNAs (dsRNAs), such as the replication intermediates of plant RNA viruses and highly structured genomic RNA, are processed by Dicer-like proteins (DCLs) to form small interfering RNAs (siRNAs). The guide strand of siRNA gets loaded onto the RNA-induced silencing complex (RISC) and cleaves complementary messenger RNA (mRNA) [
21]. For VIGS, fragments of the gene of interest are cloned into a viral vector and applied to plants where the endogenous RNA silencing machinery in the host has degraded the viral RNA and, in this process, produced complementary siRNAs that reduce the target gene expression. VIGS has been demonstrated using numerous plant-virus combinations [
22,
23]. In the past decade, several viral genomes have been modified as a powerful reverse genetic tool for the functional characterization of plant genes. Some examples of VIGS constructs are the Tobacco rattle virus (TRV) [
24], Apple latent spherical virus (ALSV) [
25,
26], African cassava mosaic virus (ASMV) [
27], Cucumber mosaic virus (CMV) [
28], and Barley streak mosaic virus (BSMV) [
29].
TRV consists of a bipartite genome, TRV1, and TRV2 [
30], where TRV1 determines viral movement [
31] and TRV2 codes for the coat protein and other nonstructural proteins. Also, it shows a great deal of variability among different viral isolates [
30]. These nonstructural proteins are involved in the nematode transmission of this virus [
32], but these proteins are not essential for plant infection. Therefore, TRV2’s nonstructural protein-coding genes were replaced by multiple cloning site where fragments of the target gene could be introduced [
33]. TRV1 and TRV2 were later cloned into a binary vector for
Agrobacterium-mediated plant delivery [
24,
33].
TRV-VIGS has been effectively used to manipulate gene expression in different plant species such as
A. thaliana,
Solanum lycopersicum,
Nicotiana spp.,
G. arboreum, and
Petunia hybrida [
34]. TRV-based VIGS is one of the most effective methods to study gene function in plants due to several advantages, such as mild symptoms, spread to large cell patches, ability to migrate to the growing meristems, and systemic spread to new tissues and all plant parts [
35,
36,
37].
Given the desirable attributes of TRV and its successful implementation in VIGS in different plant species, in this study, we undertook the TRV-based VIGS of two candidate genes, Gohir.D05G103700 and Gohir.D12G153600. Using upland cotton mini-core collection, we identified these genes earlier by expression QTL (eQTL) analysis for five floral induction and meristem identity genes (FT, SOC1, AP1, FUL, and LFY) using the upland cotton mini-core collection (Naveed et al. unpublished data). In this analysis, Gohir.D05G103700 showed an association with the APETALA 1 (AP1) gene expression trait and annotated as GATA transcription factor 11-like, and Gohir.D12G153600 showed an association with FRUITFUL (FUL) and SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 (SOC1) gene expression traits and annotated as SQUAMOSA promoter binding-like transcription factor. This work is focused on characterizing the molecular function of these eQTLs and determining their role in cotton architecture development. Unveiling the molecular functions of these genes might have implications in breeding cotton for reduced regrowth after defoliation or annual growth habit with improved lint yield.
4. Discussion
Upland cotton an allopolyploid has a large and complex genome. Genetic transformation in this crop was successful and was the source of several herbicide tolerant and insect pest resistant genotypes available commercially [
41]. However, genetic transformation is still not commonplace in cotton and is highly genotype dependent and time consuming [
42]; hence cannot be applied to test gene functions. Virus-induced gene silencing offers a solution to this problem, as the experiments are relatively less time consuming. However, the gene-silencing effects are broadly transient, which has both pros and cons [
43]. There are two viral systems available for gene silencing and gene delivery in upland cotton, Cotton leaf crumple virus (CLCrV) and Tobacco rattle virus (TRV) [
43]. Both systems were shown to work well; however, TRV-based system is more commonly used and hence was adopted in the present study for functional characterization of
Gohir.D05G103700 and
Gohir.D12G153600.
Despite the fact that we used an upland cotton cultivar ‘Coker 201’ that earlier showed susceptibility to Agroinfiltration [
38], hence consistent delivery and systemic spread of the recombinant virus, we observed plant-to-plant variation in the silencing efficiency in our experiments. For instance, five of the six plants showed some level of suppression; however, the level of suppression ranged from about 0.4-fold to 1.4-fold for
Gohir.D05G103700 and from about 0.8-fold to a little over 2.2-fold for
Gohir.D12G153600 in two and a single plant, respectively. A review of the literature suggested that variation in the level of gene suppression is not uncommon in VIGS experiments, as was documented in several earlier studies performed in different crop plants [
44]. Several reasons were proposed to contribute to the observed variation in the level of gene suppression, from the effect of abiotic factors to genetic factors [
45,
46]. Further, in this study, we observed incomplete gene silencing for either of the genes. This incomplete silencing could be attributed to multiple factors, including the plant genotype, ploidy level, the plant development stages, temperature, and inoculation method [
47,
48,
49]. Further, it was demonstrated that TRV-based gene silencing was more efficient in the diploid (
G. arboreum and
G. herbaceum) than in tetraploid cotton (
G. hirsutum) [
49]. Furthermore, the inoculation method is a critical factor for the successful delivery of the VIGS construct and subsequent steps involving systemic spread and gene silencing [
29].
In addition, the two genes
Gohir.D05G103700 and
Gohir.D12G153600, differed in the level of suppression. There could be several possible reasons, including VIGS is a random process and may not lead to a similar level of suppression in two different attempts, as the long antisense RNA after complementation with the target RNAs process is somewhat random fashion. Also, the two homoeologous copies of a gene do not carry perfect similarity; hence, they do not exhibit similar levels of suppression. Unfortunately, it is quite difficult to develop homoeologue-specific primers to study the level of suppression of individual homoeologues. Hence, a cumulative level of gene suppression was studied in most of the studies [
50,
51]. In some cases, the antisense was able to silence both copies equally well, and some only one of the two copies. Additionally, often times a compensatory increase in the expression level of one of the homoeologues was observed upon silencing the other homoeologue [
45].
Phenotypic analysis revealed that Coker 201 tends to produce a supernumerary bract (fourth bract) in each flower (visible in the mock control), which was absent from GATA-VIGS inoculated plants (
Figure 4A). The earlier research suggested that cotton tends to produce a fourth bract in certain environmental conditions when the meristem in the axil of the true leaves is not consumed fully in flower production see
Figure 4B [
40]. It was proposed that the three bracts in the cotton bud are modifications of one true leaf and two stipules, and the fourth bract represents the second leaf (indicative of indeterminate growth). These results suggested that the GATA transcription factor identified as an eQTL for the cotton
AP1 gene has a target site in the
AP1 promoter, where it binds and occupies a site not letting the
FT/FD assembly bind and induce/promote its expression, which triggers flowering and hence consumption of the meristem in floral development [
12,
52,
53,
54]. We hypothesized that VIGS of the GATA transcription factor released the site from suppression, allowing
FT/FD-induced activation of the
AP1 gene. The qRT-PCR analysis, however, did not exhibit
AP1 over accumulation. Furthermore, the fourth bract in cotton is an undesirable trait [
40], as its enlargement creates a gap allowing thrips and other insects to enter and feed on the developing fruit. Functional characterization of the
Gohir.D05G103700 gene identified as the cause of the undesirable fourth floral bract and made available the target to select for this trait. However, we did not see any observable phenotypic changes in the TRV2-SPL-TF plants, which suggested the functional redundancy of this transcription factor in cotton or a need to completely silence the gene to have a notable phenotypic effect. Quantitative RT-PCR analysis of the
AP1 and the
SOC1/FUL genes established a connection between these genes and eQTLs,
Gohir.D05G103700 and
Gohir.D12G153600. In sum, this study reconfirmed the utility of TRV-based VIGS in cotton by functional characterization of an eQTL target for the
AP1 expression trait. Also, the study reflected on the transcriptional regulation of the cotton
AP1 gene and provided a target to breed for thrip resistance in cotton cultivars with a supernumerary bract (fourth bract).