2.1. Characterization of Ty1-copia sequences in the J. curcas genome
Three degenerate primer pairs related to the conserved sequence of the Ty1-
copia reverse transcriptase genes in higher plants [
13,
14,
15] were used by PCR-based method to increase the amount of DNA fragments of about 251-280 bp from six samples of
J. curcas collected from Thailand and three samples of
J. curcas obtained from Madagascar, Indonesia and Taiwan (
Table 1). All of the obtained PCR products were cloned and selected for at least 40 clones from the individual of the nine
J. curcas samples. The inserted clones were selected by preliminary grouping based on restriction patterns after cutting the inserted PCR products by restriction enzymes
AluI,
EcoRI,
HaeIII,
BclI and
NcoI before sequencing. All three degenerate primer pairs could amplify various Ty1-
copia elements. Altogether, 164 resulting sequences were used for the phylogenetic tree analysis. This study indicates the high diversity of the Ty1-
copia elements in the genome of
Jatropha curcas.
Comparison of all 164 obtained sequences to the GenBank-NCBI database using the BLAST network service at
http://www.ncbi.nlm.nih.gov/BLAST/ was performed and the analysis using SIB ExPASy Bioinformatic Resources portal at
https://web.expasy.org/translate/ was done. Amino acid sequences of approximately 83-93 amino acids in length derived from all isolated Ty1-
copia fragments, together with retrotransposons from other species,
Phaeodactylum tricornutum (GenBank: EU432487.1) as an outgroup, were aligned and then used to construct a phylogenetic tree based on maximum likelihood (ML) method (
Figure 1). Consistent with the high heterogeneity level reported among the reverse transcriptase (RT) sequences of Ty1-
copia in plants [
16], most of our isolated RT sequences were found different from each other and were found distributed into 12 clusters, which corresponds to Ty1-
copia families of
Jatropha curcas. The identified Ty1-
copia sequences were classified in to four distinct groups (TAR, Angela, Ale and Bianca) with high bootstrap values, which indicated that they belong to at least four distinct Ty1-
copia families in the
Jatropha genome. Family TAR were found in a large number of isolated Ty1-
copia clones (46.01%) that further classified into five subfamilies, followed by family Angela (34.97%) and family Ale (19.02%) which both could be further divided into 3 subfamilies, while Bianca possessed the smallest number of clone (0.61%) as shown in
Figure 1. The isolated retrotransposon clone numbers in the individual family were counted and the patterns of retrotransposon family distribution were identified among the nine
Jatropha curcas samples. We found that distribution patterns of Ty1-
copia families were different among the nine
Jatropha samples (
Table 1).
The consensus reverse transcriptase gene sequence of individual family was analysed and used to estimate relative copy number of the individual family in the
Jatropha genome by the hit number of BLAST search in the database of
Jatropha Genome at
http://www.kazusa.or.jp/jatropha/ [
17]. Average of Hit numbers of the TAR (75 clones), Angela (57 clones), Ale (31 clones) and Bianca (only 1 clone) were 61, 69, 64 and 11 respectively.
Molecular markers, for example SSR, AFLP and RAPD, have been widely used to evaluate genetic relationship among plants. Using these markers in
J. curcas, almost very low genetic divergence was detected among the
J. curcas varieties currently cultivated in Asia and America [
18,
19]. Therefore, other types of DNA markers, such as retrotransposon-based markers are of importance in order to use to identify and classify genetic characteristics of plants with narrow genetic base. This will increase the breeding efficiency in the
Jatropha population.
Retrotransposons have been known to be one of the important components of the eukaryotic genome. This should be related to the variation and also evolution of the species. The differences within the inactive elements can contribute to plant genome evolution, gene duplication events and new properties in the retrotransposition mechanisms [
20]. Detection of the retrotransposon presence or absence in an appropriate method will enhance characterization a population with low genetic diversity.
Sizes of plant genome are reported to be positively related to both differences and copy numbers of retrotransposon families [
21]. The Ty1-
copia in the plant kingdom have been characterized into six major families: lvana, Maximus, Ale, TAR, Angela and Bianca [
22]. Ty1-
copia families are reported to be more evolutionarily scattered and smaller in size than the Ty3-
gypsy [
23]. Different plant species carry different number of retrotransposon elements. Arabidopsis, having a small genome size of about 157 Mb, contains LTR retrotransposons in a very limited number of 5.60% [
24], whereas rice with genome size of about 389 Mb is composed of 22% LTR retrotransposon sequences [
25]. Maize, having large-sized genome of about 2045 Mb, has 74.6% of the LTR retrotransposons elements [
26], while
J. curcas with genome size of about 370 Mb contains 36% of transposable elements, comprising of Ty1-
copia and Ty3-
gypsy by 8.0% and 19.6%, respectively [
27]. In this study on Ty1-
copia characterization in
J. curcas, we found four families of Ty1-
copia, including TAR, Angela, Ale and Bianca, and distribution patterns of the four families were different among the nine
Jatropha samples studied. TAR was found in a large number (46.01%), followed by Angela and Ale, whereas Bianca was found in a small number (0.61%), with only 1 sequence found from
J. curcas from Indonesia. The result is consistent with previous reports on soybeans in which Ivana has the largest number of Ty1-
copia families and Bianca was not found [
28]. Tuntipaiboontana et al. (2018) [
29] reported Ty1-
copia in waterlilies and found Ale to be abundant and diverse, whereas Angela and TAR were found to be conserved.
Among LTR retrotransposons in plant species, the Ty1-
copia superfamily has been reported as a multitude component of both angiosperm and gymnosperm genomes [
30]. In our study, a degenerated primer method was used to evaluate the existence of Ty1-
copia retrotransposons in
J. curcas. The results indicated the existence of
copia-related sequences in all genomes of nine
J. curcas samples. This suggested that all of the nine
J. curcas samples share this repetitive elements in their genomes. In addition, the deduced amino acid sequences support the presence of
J. curcas Ty1-
copia sequences, where the consensus domain SLYGLKQA (
Figure 2), which is the Ty1-
copia characteristic of plant genome, was found to be highly conserved.
2.2. Molecular cytogenetic evaluation of the Ty1-copia in Jatropha based on FISH
DAPI-stained chromosomes in primary root of J. curcas at metaphase did not show clear constriction or distinguishing features among chromosomes in this study (
Figure 3A). However, Kikuchi et al. (2011) [
31] performed cytogenetic approach and reported that chromosomes of J. curcas are mostly metacentric or submetacentric and of similar size. In addition, in our study the relaxed prometaphase chromosomes of J. curcas showed obvious of centromere (primary constriction), secondary constriction and satellite (
Figure 3B). The condensation patterns observed in chromosomes at the prometaphase stage of J. curcas were similar to those in the chromosomes of coffee, quinoa and lemon, in which the centromeric and telomeric regions mostly consist of the highly repetitive elements [
32,
33,
34].
The chromosome distribution of our two identified families of Ty1-copia, Ale and Angela, was evaluated using FISH technique on mitotic chromosomes of both J. curcas and J. integerrima, Results suggested that Ale and Angela Ty1-copia families are dominantly clustered around the centromeres or at the chromosome terminal regions of Jatropha chromosomes (
Figure 4A,B). The FISH result confirms the existence of Ty1-copia, Ale and Angela, in genomes of both J. curcas and J. integerrima in this study. Patterns of Ty1-copia distribution in the chromosomes of Jatropha were the same as those in slash pine, lemon and oat, in that telomeric regions contained mostly repetitive retrotransposons [
34,
35]. In addition, the results were supported by the analysis of flanking regions in the EMBL database under AJ269530 and AJ269531 accession numbers [
36].
During normal development, transposable elements have been known to be silenced, but under abiotic and biotic stress, they will be motivated. The stimulators of stress, for example wounding, pathogen attack, allopolyploidization tissue culture conditions and unfavourable environmental conditions, including suboptimal temperature or water and nutrient availability, can cause transposon activation [
37,
38]. Under stress, transcripts of plant retrotransposons have been reported to be found in many plants, for example pine, Orobanche and Phelipanche [
27,
39]. This supports the genomic reorganization role predicted by McClintock for transposable elements in stress response [
40]. Stress-activated transposable elements can provide plants more variability, making them possible to survive in stressful environments. This may be due to various genes can be amplified and combined by the activity of transposable elements. New genetic variability induced by the action of transposable elements can facilitate quick adaptation [
41]. Therefore, transposable elements are the potential source of genetic variation that can contribute to differences in both genome structure and gene expression. They are an important factor for genome plasticity and genome evolution in plant [
42].