Genome-Wide Identification and Expression Analysis of Homeodomain Leucine Zipper Subfamily IV (HD-ZIP IV) Gene Family in Cannabis sativa L.
Abstract
The plant-specific homeodomain zipper family (HD-ZIP) of transcription factors plays central roles in regulating plant development and environmental resistance. HD-ZIP transcription factors IV (HDZ IV) have been involved primarily in the regulation of epidermal structure development, such as stomata and trichomes. In our study, we identified nine HDZ IV-encoding genes in Cannabis sativa L. by conducting a computational analysis of cannabis genome resources. Our analysis suggests that these genes putatively encode proteins that have all the conserved domains of HDZ IV transcription factors. The phylogenetic analysis of HDZ IV gene family members of cannabis, rice (Oryza sativa), and Arabidopsis further implies that they might have followed distinct evolutionary paths after divergence from a common ancestor. All the identified cannabis HDZ IV gene promoter sequences have multiple regulation motifs, such as light- and hormone-responsive elements. Furthermore, experimental evidence shows that different HDZ IV genes have different expression patterns in root, stem, leaf, and flower tissues. Four genes were primarily expressed in flowers, and the expression of CsHDG5 (XP_030501222.1) was also correlated with flower maturity. Fifty-nine genes were predicted as targets of HDZ IV transcription factors. Some of these genes play central roles in pathogen response, flower development, and brassinosteroid signaling. A subcellular localization assay indicated that one gene of this family is localized in the Arabidopsis protoplast nucleus. Taken together, our work lays fundamental groundwork to illuminate the function of cannabis HDZ IV genes and their possible future uses in increasing cannabis trichome morphogenesis and secondary metabolite production.
Article type: Research Article
Keywords: homeodomain zipper IV transcription factors, bioinformatics, gene regulation, expression pattern, flower maturity, trichomes
Affiliations: Agricultural Biotechnology Laboratory, Department of Plant Science and Landscape Architecture, University of Connecticut, Storrs, CT 06269, USA; gang.ma@uconn.edu (G.M.); ceilamanzelcanth@gmail.com (A.K.Z.); papicella@mydecineinc.com (P.V.A.); Mydecine Innovations Group Inc., Denver, CO 80231, USA
License: © 2022 by the authors. CC BY 4.0 Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Article links: DOI: 10.3390/plants11101307 | PubMed: 35631732 | PMC: PMC9144208
Relevance: Moderate: mentioned 3+ times in text
Full text: PDF (1.5 MB)
1. Introduction
As seeds grow into mature plants, precise gene transcription and translation regulation are crucial in every biological process, including growth, development, and metabolism [ref. 1]. Transcription factors play crucial roles in the transcriptional regulation of gene expression [ref. 2,ref. 3]. They recognize and bind to specific regulatory elements in the promoter regions of target genes [ref. 4]. The expression of these target genes could be modulated at the transcriptional level based on the need for growth or resistance to environmental stresses [ref. 5]. Thus, the identification and functional analysis of transcription factor genes are desired for understanding fundamental knowledge about the molecular mechanisms of various biological processes [ref. 6]. Transcription factors have different binding domains, which could diversify into multiple families [ref. 7,ref. 8]. Transcription factors with homeodomain-zipper (HDZ) domains are specific to plant species [ref. 9,ref. 10]. The HDZ gene family has been classified into four subfamilies from I to IV based on the presence and absence of additional conserved domains [ref. 11]. Specific domains only occur in specific HDZ subfamilies; for example, the MEKHLA domain only appears in the members of the HDZ III subfamily [ref. 12].
As whole genome sequencing technology develops, increasing numbers of plant species are being sequenced [ref. 13,ref. 14]. An analysis of the HDZ IV gene family has been carried out in different plants including Arabidopsis, rice, maize, soybean, and cucumber [ref. 15,ref. 16,ref. 17,ref. 18,ref. 19]. These genes are mainly expressed in epidermal cells of plant organs and in the outermost layer of the shoot apical meristem [ref. 15,ref. 16,ref. 17,ref. 18,ref. 19]. Recent works used RNA sequencing to show the functional characterization and expression patterns of HDZ IV transcription factors and how they regulate the initiation of epidermal structures, including stomata, trichomes, and root hairs as well as cuticle development [ref. 10]. Abundant mutant resources enabled the utilization of a deep functional analysis of members of the Arabidopsis HDZ IV gene family. Arabidopsis HDZ IV transcription factor GLABRA2 (GL2) plays important roles in trichome development and root hair patterning [ref. 20,ref. 21,ref. 22]. The roles of Arabidopsis PROTODERMAL FACTOR 2 (AtPDF2) and MERISTEM LAYER 1 (AtML1), two functionally redundant HDZ IV genes, have been clarified in terms of the regulation of epidermis and flower development [ref. 23]. ANTHOCYANINLESS 2 (AtANL2) is involved in sub-epidermal cells’ anthocyanin deposition, epidermal cell proliferation, and root growth [ref. 24,ref. 25,ref. 26]. One of the Arabidopsis HDZ IV transcription factors, AtHDG11, also plays a starring role in plant resistance to drought stress by improving root development and reducing stomatal density [ref. 27,ref. 28,ref. 29]. Functional analysis of some of the HDZ IV genes has also been carried out in other plant species such as Artemisia annua L. (sweet wormwood) and Solanum lycopersicum (tomato). In sweet wormwood, homeodomain protein 1 (AaHD1) regulates glandular trichome initiation. Interestingly, this gene is involved in the phytohormone jasmonate (JA)-mediated glandular trichome initiation [ref. 30]. Tomato homeodomain protein SlPDF2 induces trichome formation and embryo lethality. The overexpression of PDF2 in tomato leads to the generation of more trichomes in the shoot, leaf, and floral tissues [ref. 31]. From all these studies, it can be concluded that HDZ IV transcription factors are vital to plant-specific epidermal structures’ development. The investigation and functional analysis of this gene family in other crop plants is urgently needed to solve important application problems.
Cannabis sativa L., one of the first domesticated plants, is separated into fiber-based hemp and marijuana, from which both medicinal and recreational drugs are obtained [ref. 32]. Cannabis generates abundant secondary metabolites, for which their therapeutic potential, particularly in the cases of cannabidiolic acid (CBDA) and Δ9-tetrahydrocannabinolic acid, has renewed global attention [ref. 33]. One type of epidermal structure, glandular trichomes, produces known and unknown secondary metabolites [ref. 34]. To increase the content of these secondary metabolites, farmers and plant researchers have tried to optimize growth conditions by, for example, increasing light or improving nutrition [ref. 35]. Using various additives including phytohormones can increase secondary metabolites content in some specific conditions [ref. 36]. However, these methods cannot fundamentally change the situation. How then can we increase the number of glandular trichomes, especially in leaves? Plant molecular biology and functional analysis of specific genes could provide an answer. In this study, we used bioinformatics strategies to carry out genome-wide identification and functional analysis of HDZ IV transcription factors in cannabis. Furthermore, we also experimentally studied the expression patterns of individual genes of this family during flower development. From these two studies, we identify the HDZ IV transcription factor gene family in cannabis, which could be used as a possible solution to increase the number of cannabis trichomes and improve secondary metabolite content in future work.
2. Results
2.1. Identification of HDZ Subfamily IV Genes in Cannabis
The Arabidopsis (16 genes) and Oryza sativa (11 genes) HDZ subfamily IV transcription factor genes were used as queries in BLASTp searches against a cannabis genome database [ref. 37]. Redundant sequences, sequences without conserved homeodomain, and sequences with a MEKHLA domain (HDZ sub family III specific domain) were removed (there were three proteins containing MEKHLA domain analyzed by Pfam database and Conserved Domain Database) (Supplementary Figure S1). A total of nine genes encoding putative HDZ IV genes in cannabis were identified. These cannabis HDZ IV proteins contained characteristic domains, namely homeobox and Steroidogenic Acute Regulatory (StAR)-related lipid Transfer (START). Various features of putatively identified cannabis HDZ IV genes, such as length, predicted protein molecular weight, isoelectric point (pI), grand average of hydropathicity (GRAVY), instability index, aliphatic index, and subcellular location, are summarized in Table 1. The molecular mass of cannabis HDZ IV proteins ranged from 79,965.59 to 91,400.22 kDa, with an average molecular mass of 85,846.23 kDa; the protein length ranged from 737 to 841 residues, with an average length of 781 aa; the isoelectric point of the HDZ IV proteins ranged from 5.62 to 6.41 with the average protein isoelectric point of 5.84. The grand average of hydropathicity (GRAVY) is a measure of its hydrophobicity or hydrophilicity. The GRAVY of cannabis HDZ IV proteins ranged from −0.458 to −0.282. The hydrophobic amino acids may affect the stabilization and function of cannabis HDZ IV proteins. The instability index of proteins represents whether the protein will be stable in a test tube. The instability index of cannabis HDZ IV proteins showed their range from 40.27 to 54.15, indicating that none of the cannabis HDZ IV proteins are stable in vitro. The aliphatic index of a protein is defined as the relative volume occupied by aliphatic side chains (alanine, valine, isoleucine, and leucine), which may be regarded as a positive indicator of thermostability of globular proteins. The predicted cannabis HDZ IV proteins’ aliphatic indices ranged from 74.75 to 87.14. Therefore, cannabis HDZ IV proteins may be stable under higher temperatures in vivo. The subcellular localization prediction showed cannabis HDZ IV proteins that were primarily located in the nucleus. Interestingly, two cannabis HDZ IV proteins, CsHDG5 (XP_030501222.1) and CsHDG5-like (XP_030501651.1), had nearly identical properties (Table 1) and sequences, and they may represent a recent gene duplication, although these sequences should be confirmed as distinct genes.
Table 1: Structural features of CsHDZIV genes in cannabis.
| Gene | Gene ID | mRNA Locus | Length (aa) | MW (Da) | pI | Grand Average of Hydropathicity (GRAVY) | Instability Index | Aliphatic Index | Subcellular Location |
|---|---|---|---|---|---|---|---|---|---|
| LOC115706571 (ANTHOCYANINLESS 2 isoform X1) | XP_030490119.1 | XM_030634259.1 | 841 | 91,400.22 | 5.69 | −0.345 | 52.84 | 77.35 | nucl: 13,pero: 1 |
| LOC115707829 (GLABRA 2) | XP_030491770.1 | XM_030635910.1 | 755 | 83,575.46 | 5.64 | −0.457 | 48.05 | 74.78 | nucl: 12, extr: 2 |
| LOC115708255 (PROTODERMAL FACTOR 2) | XP_030492336.1 | XM_030636476.1 | 772 | 84,561.23 | 5.81 | −0.364 | 40.27 | 81.09 | nucl: 14 |
| LOC115714770(HDG11) | XP_030499384.1 | XM_030643524.1 | 734 | 79,965.59 | 6.33 | −0.328 | 54.15 | 77.18 | nucl: 14 |
| LOC115716550(HDG5) | XP_030501222.1 | XM_030645362.1 | 814 | 90,647.37 | 5.62 | −0.458 | 52.88 | 74.75 | nucl: 12, extr: 2 |
| LOC115716687 (ANTHOCYANINLESS 2 isoform X2) | XP_030501400.1 | XM_030645540.1 | 776 | 84,873.60 | 5.68 | −0.282 | 47.12 | 80.55 | nucl: 13,pero: 1 |
| LOC115716879(HDG5-like) | XP_030501651.1 | XM_030645791.1 | 814 | 90,670.40 | 5.66 | −0.458 | 52.17 | 74.75 | nucl: 12, extr: 2 |
| LOC115699242(HDG2) | XP_030482406.1 | XM_030626546.1 | 784 | 85,192.34 | 5.75 | −0.296 | 41.12 | 78.72 | nucl: 12, extr: 2 |
| LOC115723870(ROC8-like) | XP_030509195.1 | XM_030653335.1 | 737 | 81,729.85 | 6.41 | −0.329 | 49.62 | 87.14 | nucl: 11,cyto: 2, vacu: 1 |
2.2. Phylogenetic Analysis of Cannabis HDZ IV Proteins
The phylogenetic tree was constructed using Maximum Likelihood method (execution parameter: bootstrap method 500) to understand the evolutionary relationships between cannabis HDZ IV proteins and model plant HDZ IV proteins (Arabidopsis and Oryza sativa) [ref. 38]. Most HDZ IV proteins from cannabis and the dicot model plant Arabidopsis clustered together (Figure 1).

2.3. Conserved Motifs in Cannabis HDZ IV Proteins
To understand the possible functions of cannabis HDZ IV proteins, a comprehensive prediction website/software was used to find conserved motifs [ref. 41]. A total of 10 conserved motifs were queried in nine cannabis HDZ IV proteins. Figure 2 and Figure S2 show the details about conserved motifs in cannabis HDZ IV proteins. Motif 1 corresponded to the conserved Homeodomain (HD), and motifs 2, 6, 7, and 9 belonged to the START domain (Steroidogenic Acute Regulatory (StAR)-related lipid Transfer) [ref. 42].

2.4. Gene Structure and Chromosomal Localization of Cannabis HDZ IV Genes
Untranslated Regions (UTR) and introns play crucial roles in post-transcriptional regulation and alternative splicing, respectively [ref. 43]. It is important to gain information about cannabis HDZ IV gene structure, which could affect gene expression and translation. Figure 3 shows that the numbers of exons and introns among cannabis HDZ IV genes varied between 8 to 18 and 7 to 17, respectively. The lengths of UTR in cannabis HDZ IV genes were different, which means that their post-transcriptional regulation may also differ. Intriguingly, CsROC8-like (XM_030653335.1) did not have predicted 5′ and 3′ UTRs. The regulation pattern of this gene may be different from other HDZ IV genes.

Figure 4 exhibits the cannabis HDZ IV genes’ locations mapped to the cannabis chromosomes. No HDZ IV gene was present on chromosome 2, 3, 6, 7, or X. Chromosomes 1 and 5 contained three HDZ IV genes. There was one HDZ IV gene on chromosomes 4, 8, and 9, respectively.

2.5. Prediction and Analysis of Cannabis HDZ IV Genes Promoter
Promoter analysis is an effective strategy for understanding transcriptional regulation of specific genes. Two-thousand base pair upstream DNA sequences were collected using TBtools [ref. 44]. In Figure 5, we can see that the most frequently occurring regulatory domain is related to light. Light is the most important environmental factor in the regulation of the plant flowering stage [ref. 45,ref. 46]. Another interesting finding is that cannabis HDZ IV genes contain multiple hormone-responsive elements, suggesting that these genes can be regulated by multiple hormones [ref. 47].

2.6. Expression Pattern of Specific Cannabis HDZ IV Genes in Different Tissues and during Flower Maturation
To understand the expression pattern of this gene family, we isolated total RNAs from cannabis Space Candy (SC) variety root, stem, leaf, and flower tissues. Figure 6 shows that the expression of eight out of nine genes was detected in different tissues; the exception is ROC8-like (XP_030509195.1), which may be regulated by a specific stimulus [ref. 15]. Eight genes of the HDZ-IV family have different expression patterns in different tissues. All genes except PROTODERMAL FACTOR 2 (XP_030492336.1) were not expressed in the root. Therefore, we used expressions in the leaf as our control. There were four genes (HDG5-like, GLABRA 2, HDG5, and ANTHOCYANINLESS 2 isoform X2) primarily expressed in the flower tissues, which may be involved in trichome morphogenesis and the development in cannabis [ref. 19]. Interestingly, the expression of GLABRA 2 (XP_030491770.1) was 30-fold higher in the flower. The GLABRA 2 homolog in Arabidopsis was involved in trichome development and root hair patterning [ref. 20,ref. 21,ref. 22]. However, we could not detect GLABRA 2 (XP_030491770.1) expression in roots. Cannabis GLABRA 2 may have a specific function in trichome morphogenesis. HDG11 (XP_030499384.1), HDG2 (XP_030482406.1) and PROTODERMAL FACTOR 2 (XP_030492336.1) had low expression in these tissues.

Artemisia annua L. (sweet wormwood) has glandular trichomes that can generate the secondary metabolite artemisinin [ref. 48]. These glandular trichomes are similar to the glandular trichomes found in cannabis. The AaHD1 protein is important in glandular trichome initiation [ref. 30]. Cannabis homeobox-leucine zipper protein HDG5 (XP_030501222.1) is an ortholog of AaHD1. We wanted to know the function and expression pattern of CsHDG5. Figure 7 shows the expression of CsHDG5 during flower maturity (from week 1 to week 7). CsHDG5 was expressed at the highest level in week 3 and week 4; after that, its expression decreased. Furthermore, the functional analysis of CsHDG5 showed that CsHDG5 may initiate trichome development (unpublished data).

2.7. Putative Targets of HDZ IV Transcription Factors
It is important to know that HDZ IV transcription factors regulate substrates in cannabis. A previous study in Arabidopsis indicated that some HDZ IV transcription factors could bind to 5′-GCATTAAATGC-3′ consensus sequences [ref. 19]. We analyzed and identified 59 genes’ promoter sequences containing this motif. Figure 8 and Supplementary Table S2 show that these 59 genes include 16 unknown/uncharacterized genes, 11 genes encoding enzymes, and 32 functional genes. XM_030645201.1 was predicted as an uncharacterized protein similar to At3g49140, which belongs to the pentatricopeptide repeat (PPR) superfamily of proteins [ref. 49]. The promoter sequence of XM_030645201.1 has three 5′-GCATTAAATGC-3′ consensus sequences. This gene may be one of the targets of HDZ IV transcription factors. Our unpublished data also indicate that one of the HDZ IV transcription factors was induced by Arabidopsis pathogen elicitor peptide 3 (AtPep3) in trichomes [ref. 50]. HDZ IV transcription factors may play crucial roles in plant responses to biotic stresses by regulating specific substrates [ref. 51]. We also found that XM_030630414.1, XM_030634601.1 and XM_030649628.1 were predicted to be involved in flower development [ref. 52,ref. 53]. Interestingly, a putative transcription factor, bHLH63, may be regulated by a specific HDZ IV transcription factor. Transcription factor bHLH63/CIB1 in Arabidopsis could bind a G-box or E-box to promote FT gene expression and, thus, trigger flowering in response to blue light [ref. 54].

Flower yield is vital in the cannabis industry [ref. 55]. Environmental cues and endogenous signals affect and determine the expression of genes associated with the initiation and development of floral organs [ref. 56]. One of the well-known classes of plant hormones, the brassinosteroids (BRs), also regulates multiple aspects of plant development, and recent evidence suggests that BRs stimulate flowering by reducing transcript levels of a potent floral repressor [ref. 57,ref. 58]. XM_030637534.1 encodes putative transcription factor BIM1 and is involved in brassinosteroid signaling [ref. 59]. Identifying these substrates and learning their regulation mechanisms by HDZ IV transcription factors would be a promising endeavor.
2.8. Subcellular Localization of Specific HDZ IV Gene PROTODERMAL FACTOR 2 (XP_030492336.1) in Arabidopsis Protoplast
Transcription factors are mainly localized in the nucleus. To study the subcellular localization of cannabis HDZ IV genes, we selected PROTODERMAL FACTOR 2 (XP_030492336.1) as our candidate. Cannabis PROTODERMAL FACTOR 2 (XP_030492336.1)’s homolog in tomato, PDF2, is localized in the nucleus and plasma membrane [ref. 31]. We inserted cannabis PROTODERMAL FACTOR 2 (XP_030492336.1) into plasmid pK7YWG2 and transiently transformed Arabidopsis protoplasts with the resulting plasmid. Figure 9A shows that cannabis PROTODERMAL FACTOR 2 (XP_030492336.1) is localized in the nucleus. Similar studies will research other parts of cannabis HDZ IV genes.

3. Discussions
The study of the functions of plant-specific HDZ IV gene family members in model plants has mainly focused on their roles in the initiation and developmental regulation of epidermal structures, such as cuticle, stomata, root hairs, and especially trichomes, a well-known “bio-factory” generating multiple secondary metabolites in cannabis [ref. 60]. These epidermal structures participate in plant development and responses to biotic and abiotic stresses. Stomata play central roles in gas exchange and water evaporation [ref. 61]. The cuticle and trichomes affect plants’ responses to biotic stresses and ultraviolet (UV) exposure. The HDZ IV gene family is considered one of the most important regulation mechanisms for sessile plants [ref. 10]. In this study, we carried out a bioinformatics survey relative to this gene family in the important crop cannabis. For past cannabis studies, most attention has been focused on the identification of novel secondary metabolites and the functional analysis of secondary metabolites (such as CBD or THC) in human health [ref. 62]. There are several methods to improve the content of secondary metabolites during the maturation period [ref. 63]. These in vitro strategies include the usage of different light conditions and plant growth effectors. Plant hormones could help cannabis flowers generate more CBD at the late flower period (unpublished data). However, drawbacks may occur when these strategies are used. It is better to induce plants to generate more secondary metabolites. Molecular biology and associated experimental strategies could help to more deeply understand the mechanisms of trichome initiation and secondary metabolites synthesis and transportation.
Our genome-wide analysis identified nine HDZ IV genes in cannabis (Table 1). The number of HDZ IV genes in cannabis is lower than that reported in other plants. Conserved domains analysis revealed that cannabis HDZ IV proteins contain all the conserved motifs, including homeodomain and START domains, which are characteristics of HDZ transcription factors. Homeodomain and the START domain may help cannabis HDZ IV proteins to functionally regulate their target genes’ expression and affect the specific phenotype. To build on these findings and to study potential benefits, future work may involve specific cannabis HDZ IV protein substrates using ChIP-seq and other strategies. Phylogenetic and gene structure analysis could enable us to decipher their function-based on homology relative to genes in better-characterized plants. Cannabis is diploid (2n = 20), and the genome size is about 830 Mb [ref. 32], which is six-times larger than the model plant Arabidopsis and two-times larger than the model plant rice. Interestingly, the number of HDZ IV genes in cannabis is smaller than in these two model plants. One possibility is that cannabis HDZ IV genes are multifunctional in regulating epidermal structures development. Cannabis HDZ IV genes share similar exon–intron architecture, with the exception of CsROC8-like (XP_030509195.1), which does not have 5′ and 3′ UTR. Alternative splicing is a normal phenomenon in eukaryotes and greatly increases the diversity of proteins that can be encoded by the genome. More results based on high-throughput RNA sequencing need to be analyzed before we may conclude that cannabis HDZ IV genes have different transcripts for generating multiple proteins.
Plants experience complex molecular regulation during juvenile and mature stages [ref. 64]. Transcription factors are pivotal in these stages, activating or inhibiting the expression of specific genes [ref. 65]. On the other hand, transcription factors themselves are also regulated by other regulatory proteins and/or plant hormones. Predicted promoter sequence analysis could help us to understand possible regulation mechanisms. Figure 5 shows that the promoter region of all cannabis HDZ IV genes contain light-related regulatory domains. Light regulates plant growth and development. A myriad studies demonstrated that light-related regulation illuminates every corner of plant science [ref. 66]. Trichomes are the important tissues that protect plants from UV light. Meanwhile, the daily and seasonal variation in light exposure could trigger the transition from the vegetative stage to the reproductive stage [ref. 67,ref. 68]. We can postulate that the light-associated regulators could bind specific cannabis HDZ IV gene promoter(s) and activate gene expression, which could initiate trichome development when the cannabis plants experience a short-day period [ref. 69]. Cannabis HDZ IV gene promoters also have hormone-associated binding domains for salicylic acid, methyl jasmonate, and gibberellin, which are involved in the induction of trichomes [ref. 70,ref. 71]. Different HDZ IV genes have different expression patterns in root, stem, leaf, and flower tissues. Four genes are primarily expressed in flowers and may play roles in cannabis flower trichomes’ initiation and development. Based on the expression results, we found that cannabis CsHDG5 (XP_030501222.1) is mainly expressed in flower tissue and has its highest expression level in week 3 and week 4 during the flower’s mature period. Interestingly, CsHDG5 could be induced under specific stimuli in trichomes. Some putative targets regulated by HDZ IV transcription factors are involved in pathogen signaling. CsHDG5, one of the HDZ IV genes, may be important to both trichome development and pathogen response.
4. Materials and Methods
4.1. Computational Identification and Analysis of Cannabis HDZ IV Genes
The proteins corresponding to the HDZ IV genes of Oryza sativa and Arabidopsis were used as queries in BLASTp searches in the cannabis genome database (assembly number: GCA_900626175.2) [ref. 37]. The cannabis proteins resulting from each blast search (E-value < 10−5) were pooled, and redundant sequences were removed. Protein sequences containing MEKHLA domain were also removed. Finally, the analysis of intron and exon composition was carried out by using TBtools [ref. 44]. The prediction of molecular weight and other physicochemical properties was performed by using online software ExPASy Proteomics (http://web.expasy.org/compute_pi accessed on 13 April 2021). The identification of conserved domains among cannabis HDZ IV proteins was performed using the MEME tool (http://meme-suite.org/tools/meme accessed on 7 May 2015) [ref. 41]. The identified motifs, as represented by logos, were manually inspected for the presence of elements representing conserved motifs of HDZ IV proteins.
4.2. Alignment, Phylogenetic Analysis and Chromosomal Localization
The multiple sequence alignment of HDZ IV sequences was performed by using MEGA11 software [ref. 72]. The phylogenetic tree was constructed by using the Maximum Likelihood method. The chromosomal mapping of individual HDZIV genes was carried out by TBtools.
4.3. Prediction of Cis-Regulatory Elements
In order to identify cis-regulatory domains in each cannabis HDZ IV gene’s promoter sequence, a 2 kb region upstream to the translation start codon was extracted using TBtools. These 2 kb sequences were uploaded into the Plant Cis-Acting Regulatory Element (PlantCARE) website (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/ accessed on 1 January 2002) [ref. 73]. The key regulatory elements were selected manually and constructed by TBtools.
4.4. Plant Material and Various Treatments
“Stormy Daniels” (SD) and “Space Candy” (SC) varieties were used for gene expression analysis in the present study. The flower tissues from week 1 to week 7 were harvested from SD cannabis plants, and different tissues (root, stem, leaf, and flower) were collected from SC variety and stored at −80 °C following freezing in liquid nitrogen. In each case of flowering stages and different tissues, samples were collected in triplicate.
4.5. Total RNA Isolation, cDNA Synthesis, and Gene Expression Analysis
Total RNA was isolated from SD flower tissues and SC different tissues according to the previously described protocol [ref. 74]. Each RNA sample was treated with DNase to eliminate DNA contamination. The integrity and size distribution of total RNA was analyzed by agarose gel electrophoresis. A Nanodrop® (Waltham, MA, USA) instrument was used to measure RNA yield (ng/µL) and purity (260:280 wavelength ratios). DNA-free RNA (2 µg) was used for synthesis of first strand cDNA by using a Bio-Rad iScript™ (Hercules, CA, USA) cDNA synthesis kit as per manufacturer’s recommendations. Quantitative real-time PCR was performed with volumes of 20 µL per well with Bio-Rad™ SYBR green Supermix (Hercules, CA, USA). The amount of cDNA was normalized by using an amplification of housekeeping cannabis actin as an internal control. The data from real-time PCR amplification were estimated in terms of comparative fold expression following the 2(2−ΔΔct) method [ref. 75]. The list of different primers used in the study is provided in Supplementary Table S1.
4.6. Subcellular Localization of PROTODERMAL FACTOR 2 (XP_030492336.1)
PROTODERMAL FACTOR 2 (XP_030492336.1) was fused to EYFP using a 35S promoter-containing plasmid (pK7YWG2) with LR Clonase II (Invitrogen, Waltham, MA, USA), and the resulting construct was transiently transformed into Arabidopsis protoplast [ref. 76]. Protoplasts were viewed using a Nikon A1R confocal microscope through a 20× Plan Apo lens. Both channels were excited at 514 nm. Emissions were collected with an EYFP filter (514 nm). Z-stacks were collected at a step size of 27 microns. Composite channel/stack images/scale bars were produced in ImageJ [ref. 77].
5. Conclusions
This work used bioinformatics strategies to identify the HDZ IV gene family in Cannabis sativa for the first time. This gene family is central in regulating epidermal structures such as trichomes, which generate multiple secondary metabolites. Understanding the molecular mechanisms involved in these regulation networks will help researchers to construct genetically modified plants that generate more trichomes in every tissue. Indeed, transcription factors could be an ideal regulatory tool to accomplish this task. Thus, our results represent a useful foundation for developing an approach in cannabis research.
References
- Y. Zhu, S. Klasfeld, D. Wagner. Molecular regulation of plant developmental transitions and plant architecture via PEPB family proteins: An update on mechanism of action. J. Exp. Bot., 2021. [DOI | PubMed]
- S.H. Wani, S. Anand, B. Singh, A. Bohra, R. Joshi. WRKY transcription factors and plant defense responses: Latest discoveries and future prospects. Plant Cell Rep., 2021. [DOI | PubMed]
- Y. Qian, T. Zhang, Y. Yu, L. Gou, J. Yang, J. Xu, E. Pi. Regulatory Mechanisms of bHLH Transcription Factors in Plant Adaptive Responses to Various Abiotic Stresses. Front. Plant Sci., 2021. [DOI | PubMed]
- L. Liu, M.J. White, T.H. MacRae. Transcription factors and their genes in higher plants functional domains, evolution and regulation. Eur. J. Biochem., 1999. [DOI | PubMed]
- J.M. Franco-Zorrilla, I. López-Vidriero, J.L. Carrasco, M. Godoy, P. Vera, R. Solano. DNA-binding specificities of plant transcription factors and their potential to define target genes. Proc. Natl. Acad. Sci. USA, 2014. [DOI | PubMed]
- Z. Xie, T.M. Nolan, H. Jiang, Y. Yin. AP2/ERF Transcription Factor Regulatory Networks in Hormone and Abiotic Stress Responses in Arabidopsis. Front. Plant Sci., 2019. [DOI | PubMed]
- C. Yu, J. Lin, W. Li. Positional distribution of transcription factor binding sites in Arabidopsis thaliana. Sci. Rep., 2016. [DOI | PubMed]
- X. Lai, A. Stigliani, G. Vachon, C. Carles, C. Smaczniak, C. Zubieta, K. Kaufmann, F. Parcy. Building Transcription Factor Binding Site Models to Understand Gene Regulation in Plants. Mol. Plant, 2019. [DOI | PubMed]
- A. Pandey, P. Misra, A. Alok, N. Kaur, S. Sharma, D. Lakhwani, M.H. Asif, S. Tiwari, P.K. Trivedi. Genome-Wide Identification and Expression Analysis of Homeodomain Leucine Zipper Subfamily IV (HDZ IV) Gene Family from Musa accuminata. Front. Plant Sci., 2016. [DOI | PubMed]
- F. Romani, R. Reinheimer, S.N. Florent, J.L. Bowman, J.E. Moreno. Evolutionary history of HOMEODOMAIN LEUCINE ZIPPER transcription factors during plant transition to land. New Phytol., 2018. [DOI | PubMed]
- G. Sessa, C. Steindler, G. Morelli, I. Ruberti. The Arabidopsis Athb-8, -9 and -14 genes are members of a small gene family coding for highly related HD-ZIP proteins. Plant Mol. Biol., 1998. [DOI | PubMed]
- M.J. Prigge, S.E. Clark. Evolution of the class III HD-Zip gene family in land plants. Evol. Dev., 2006. [DOI | PubMed]
- F. Chen, Y. Song, X. Li, J. Chen, L. Mo, X. Zhang, Z. Lin, L. Zhang. Genome sequences of horticultural plants: Past, present, and future. Hortic. Res., 2019. [DOI | PubMed]
- P.J. Kersey. Plant genome sequences: Past, present, future. Curr. Opin. Plant Biol., 2019. [DOI | PubMed]
- V. Belamkar, N.T. Weeks, A.K. Bharti, A.D. Farmer, M.A. Graham, S.B. Cannon. Comprehensive characterization and RNA-Seq profiling of the HD-Zip transcription factor family in soybean (Glycine max) during dehydration and salt stress. BMC Genom., 2014. [DOI]
- R. Fu, W. Liu, Q. Li, J. Li, L. Wang, Z. Ren. Comprehensive analysis of the homeodomain-leucine zipper IV transcription factor family in Cucumis sativus. Genome, 2013. [DOI | PubMed]
- M. Jain, A.K. Tyagi, J.P. Khurana. Genome-wide identification, classification, evolutionary expansion and expression analyses of homeobox genes in rice. FEBS J., 2008. [DOI | PubMed]
- M. Javelle, C. Klein-Cosson, V. Vernoud, V. Boltz, C. Maher, M. Timmermans, N. Depège-Fargeix, P.M. Rogowsky. Genome-wide characterization of the HD-ZIP IV transcription factor family in maize: Preferential expression in the epidermis. Plant Physiol., 2011. [DOI | PubMed]
- M. Nakamura, H. Katsumata, M. Abe, N. Yabe, Y. Komeda, K.T. Yamamoto, T. Takahashi. Characterization of the class IV homeodomain-Leucine Zipper gene family in Arabidopsis. Plant Physiol., 2006. [DOI | PubMed]
- G. Han, X. Wei, X. Dong, C. Wang, N. Sui, J. Guo, F. Yuan, Z. Gong, X. Li, Y. Zhang. Arabidopsis ZINC FINGER PROTEIN1 Acts Downstream of GL2 to Repress Root Hair Initiation and Elongation by Directly Suppressing bHLH Genes. Plant Cell, 2020. [DOI | PubMed]
- S. Siyu Chen, S. Wang. GLABRA2, A Common Regulator for Epidermal Cell Fate Determination and Anthocyanin Biosynthesis in Arabidopsis. Int. J. Mol. Sci., 2019. [DOI | PubMed]
- X. Wang, S. Bi, L. Wang, H. Li, B. Gao, S. Huang, X. Qu, J. Cheng, S. Wang, C. Liu. GLABRA2 Regulates Actin Bundling Protein VILLIN1 in Root Hair Growth in Response to Osmotic Stress. Plant Physiol., 2020. [DOI | PubMed]
- E. Ogawa, Y. Yamada, N. Sezaki, S. Kosaka, H. Kondo, N. Kamata, M. Abe, Y. Komeda, T. Takahashi. ATML1 and PDF2 Play a Redundant and Essential Role in Arabidopsis Embryo Development. Plant Cell Physiol., 2015. [DOI | PubMed]
- A. Mabuchi, K. Soga, K. Wakabayashi, T. Hoson. Phenotypic screening of Arabidopsis T-DNA insertion lines for cell wall mechanical properties revealed ANTHOCYANINLESS2, a cell wall-related gene. J. Plant Physiol., 2016. [DOI | PubMed]
- H. Kubo, A.J. Peeters, M.G. Aarts, A. Pereira, M. Koornneef. ANTHOCYANINLESS2, a homeobox gene affecting anthocyanin distribution and root development in Arabidopsis. Plant Cell, 1999. [DOI | PubMed]
- H. Kubo, K. Hayashi. Characterization of root cells of anl2 mutant in Arabidopsis thaliana. Plant Sci., 2011. [DOI | PubMed]
- J.N. Banavath, T. Chakradhar, V. Pandit, S. Konduru, K.K. Guduru, C.S. Akila, S. Podha, C.O.R. Puli. Stress Inducible Overexpression of AtHDG11 Leads to Improved Drought and Salt Stress Tolerance in Peanut (Arachis hypogaea L.). Front. Chem., 2018. [DOI | PubMed]
- X. Cai, P. Xu, Y. Wang, C. Xiang. Activated expression of AtEDT1/HDG11 promotes lateral root formation in Arabidopsis mutant edt1 by upregulating jasmonate biosynthesis. J. Integr. Plant Biol., 2015. [DOI | PubMed]
- X. Guo, Y. Wang, P. Zhao, P. Xu, G. Yu, L. Zhang, Y. Xiong, C. Xiang. AtEDT1/HDG11 regulates stomatal density and water-use efficiency via ERECTA and E2Fa. New Phytol., 2019. [DOI | PubMed]
- T. Yan, M. Chen, Q. Shen, L. Li, X. Fu, Q. Pan, Y. Tang, P. Shi, Z. Lv, W. Jiang. HOMEODOMAIN PROTEIN 1 is required for jasmonate-mediated glandular trichome initiation in Artemisia annua. New Phytol., 2017. [DOI | PubMed]
- C. Yang, H. Li, J. Zhang, Z. Luo, P. Gong, C. Zhang, J. Li, T. Wang, Y. Zhang, Y. Lu. A regulatory gene induces trichome formation and embryo lethality in tomato. Proc. Natl. Acad. Sci. USA, 2011. [DOI | PubMed]
- G. Ren, X. Zhang, Y. Li, K. Ridout, M.L. Serrano-Serrano, Y. Yang, A. Liu, G. Ravikanth, M.A. Nawaz, A.S. Mumtaz. Large-scale whole-genome resequencing unravels the domestication history of Cannabis sativa. Sci. Adv., 2021. [DOI | PubMed]
- J. Gonçalves, T. Rosado, S. Soares, A.Y. Simão, D. Caramelo, Â. Luís, N. Fernández, M. Barroso, E. Gallardo, A.P. Duarte. Cannabis and Its Secondary Metabolites: Their Use as Therapeutic Drugs, Toxicological Aspects, and Analytical Determination. Medicines, 2019. [DOI | PubMed]
- C.A.S. Tanney, R. Backer, A. Anja Geitmann, D.L. Smith. Cannabis Glandular Trichomes: A Cellular Metabolite Factory. Front. Plant Sci., 2021. [DOI | PubMed]
- R. Backer, T. Schwinghamer, P. Rosenbaum, V. McCarty, S.E. Bilodeau, D. Lyu, M.B. Ahmed, G. Robinson, M. Lefsrud, O. Wilkins. Closing the Yield Gap for Cannabis: A Meta-Analysis of Factors Determining Cannabis Yield. Front. Plant Sci., 2019. [DOI | PubMed]
- L. Burgel, J. Hartung, D. Schibano, S. Graeff-Hönninger. Impact of Different Phytohormones on Morphology, Yield and Cannabinoid Content of Cannabis sativa L.. Plants, 2020. [DOI]
- K.U. Laverty, J.M. Stout, M.J. Sullivan, H. Shah, N. Gill, L. Holbrook, G. Deikus, R. Sebra, T.R. Hughes, J.E. Page. A physical and genetic map of Cannabis sativa identifies extensive rearrangements at the THC/CBD acid synthase loci. Genome Res., 2019. [DOI | PubMed]
- J. Truszkowski, N. Goldman. Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps. Syst. Biol., 2016. [DOI | PubMed]
- D.T. Jones, W.R. Taylor, J.M. Thornton. The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci., 1992. [DOI | PubMed]
- J. Felsenstein. Confidence Limits on Phylogenies: An Approach Using the Bootstrap. Evolution, 1985. [DOI | PubMed]
- T.L. Bailey, J. Johnson, C.E. Grant, W.S. Noble. The MEME Suite. Nucleic Acids Res., 2015. [DOI | PubMed]
- M. Elhiti, C. Stasolla. Structure and function of homodomain-leucine zipper (HD-Zip) proteins. Plant Signal Behav., 2009. [DOI | PubMed]
- A.K. Srivastava, Y. Lu, G. Zinta, Z. Lang, J.K. Zhu. UTR-Dependent Control of Gene Expression in Plants. Trends Plant Sci., 2018. [DOI | PubMed]
- C. Chen, H. Chen, Y. Zhang, H.R. Thomas, M.H. Frank, Y. He, R. Xia. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol. Plant, 2020. [DOI | PubMed]
- B. Thomas. Light signals and flowering. J. Exp. Bot., 2006. [DOI | PubMed]
- L. Li, X. Li, Y. Liu, H. Liu. Flowering responses to light and temperature. Sci. China Life Sci., 2016. [DOI | PubMed]
- M. Wei, A. Liu, Y. Zhang, Y. Zhou, D. Li, K. Dossa, R. Zhou, X. Zhang, J. You. Genome-wide characterization and expression analysis of the HD-Zip gene family in response to drought and salinity stresses in sesame. BMC Genom., 2019. [DOI]
- M.E. Olsson, L.M. Olofsson, A.L. Lindahl, A. Lundgren, M. Brodelius, P.E. Brodelius. Localization of enzymes of artemisinin biosynthesis to the apical cells of glandular secretory trichomes of Artemisia annua L.. Phytochemistry, 2009. [DOI | PubMed]
- J.T. Ascencio-Ibáñez, R. Sozzani, T.J. Lee, T.M. Chu, R.D. Wolfinger, R. Cella, L. Hanley-Bowdoin. Global analysis of Arabidopsis gene expression uncovers a complex array of changes impacting pathogen response and cell cycle during geminivirus infection. Plant Physiol., 2008. [DOI | PubMed]
- S. Bartels, M. Lori, M. Mbengue, M. van Verk, D. Klauser, T. Hander, R. Böni, S. Robatzek, T. Boller. The family of Peps and their precursors in Arabidopsis: Differential expression and localization but similar induction of pattern-triggered immune responses. J. Exp. Bot., 2013. [DOI | PubMed]
- R. Sharif, C. Xie, J. Wang, Z. Cao, H. Zhang, P. Chen, L. Yuhong. Genome wide identification, characterization and expression analysis of HD-ZIP gene family in Cucumis sativus L. under biotic and various abiotic stresses. Int. J. Biol. Macromol., 2020. [DOI | PubMed]
- C.C. Sheldon, D.T. Rouse, E.J. Finnegan, W.J. Peacock, E.S. Dennis. The molecular basis of vernalization: The central role of FLOWERING LOCUS C (FLC). Proc. Natl. Acad. Sci. USA, 2000. [DOI | PubMed]
- C.C. Hansen, D.R. Nelson, B.L. Møller, D. Werck-Reichhart. Plant cytochrome P450 plasticity and evolution. Mol. Plant, 2021. [DOI | PubMed]
- Y. Liu, X. Li, D. Ma, Z. Chen, J.W. Wang, H. Liu. CIB1 and CO interact to mediate CRY2-dependent regulation of flowering. EMBO Rep., 2018. [DOI | PubMed]
- S.J. Livingston, T.D. Quilichini, J.K. Booth, D. Wong, K.H. Rensing, J. Laflamme-Yonkman, S.D. Castellarin, J. Bohlmann, J.E. Page, A.L. Samuels. Cannabis glandular trichomes alter morphology and metabolite content during flower maturation. Plant J., 2020. [DOI | PubMed]
- S. Proietti, V. Scariot, S. De Pascale, R. Paradiso. Flowering Mechanisms and Environmental Stimuli for Flower Transition: Bases for Production Scheduling in Greenhouse Floriculture. Plants, 2022. [DOI | PubMed]
- Z. Li, Y. He. Roles of Brassinosteroids in Plant Reproduction. Int. J. Mol. Sci., 2020. [DOI]
- S.D. Clouse. The molecular intersection of brassinosteroid-regulated growth and flowering in Arabidopsis. Proc. Natl. Acad. Sci. USA, 2008. [DOI | PubMed]
- T. Liang, S. Mei, C. Shi, Y. Yang, Y. Peng, L. Ma, F. Wang, X. Li, X. Huang, Y. Yin. UVR8 Interacts with BES1 and BIM1 to Regulate Transcription and Photomorphogenesis in Arabidopsis. Dev. Cell, 2018. [DOI | PubMed]
- J.J. Zager, I. Lange, N. Srividya, A. Smith, B.M. Lange. Gene Networks Underlying Cannabinoid and Terpenoid Accumulation in Cannabis. Plant Physiol., 2019. [DOI | PubMed]
- T. Lawson, S. Vialet-Chabrand. Speedy stomata, photosynthesis and plant water use efficiency. New Phytol., 2019. [DOI | PubMed]
- G. Lafaye, L. Karila, L. Blecha, A. Benyamina. Cannabis, cannabinoids, and health. Dialogues Clin. Neurosci., 2017. [DOI | PubMed]
- V. Desaulniers Brousseau, B.S. Wu, S. MacPherson, V. Morello, M. Lefsrud. Cannabinoids and Terpenes: How Production of Photo-Protectants Can Be Manipulated to Enhance Cannabis sativa L. Phytochemistry. Front. Plant Sci., 2021. [DOI | PubMed]
- C. Lavedrine, E. Farcot, T. Vernoux. Modeling plant development: From signals to gene networks. Curr. Opin. Plant. Biol., 2015. [DOI | PubMed]
- F. Romani, J.E. Moreno. Molecular mechanisms involved in functional macroevolution of plant transcription factors. The New Phytol., 2021. [DOI | PubMed]
- C. Kami, S. Lorrain, P. Hornitschek, C. Fankhauser. Light-regulated plant growth and development. Curr. Top. Dev. Biol., 2010. [DOI | PubMed]
- M.T. Hauser. Molecular basis of natural variation and environmental control of trichome patterning. Front. Plant Sci., 2014. [DOI | PubMed]
- S. Braich, R.C. Baillie, L.S. Jewell, G.C. Spangenberg, N. Cogan. Generation of a Comprehensive Transcriptome Atlas and Transcriptome Dynamics in Medicinal Cannabis. Sci. Rep., 2019. [DOI | PubMed]
- J. Petit, E. Salentijn, M.J. Paulo, C. Denneboom, L.M. Trindade. Genetic Architecture of Flowering Time and Sex Determination in Hemp (Cannabis sativa L.): A Genome-Wide Association Study. Front. Plant Sci., 2020. [DOI | PubMed]
- M.B. Traw, J. Bergelson. Interactive Effects of Jasmonic Acid, Salicylic Acid, and Gibberellin on Induction of Trichomes in Arabidopsis. Plant Physiol., 2003. [DOI | PubMed]
- A.J. Boughton, K. Hoover, G.W. Felton. Methyl jasmonate application induces increased densities of glandular trichomes on tomato, Lycopersicon esculentum. J. Chem. Ecol., 2005. [DOI | PubMed]
- K. Tamura, G. Stecher, S. Kumar. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol. Biol. Evol., 2021. [DOI | PubMed]
- M. Lescot, P. Déhais, G. Thijs, K. Marchal, Y. Moreau, Y. Van de Peer, P. Rouzé, S. Rombauts. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res., 2002. [DOI | PubMed]
- A.M. Adal, K. Doshi, L. Holbrook, S.S. Mahmoud. Comparative RNA-Seq analysis reveals genes associated with masculinization in female Cannabis sativa. Planta, 2021. [DOI | PubMed]
- K.J. Livak, T.D. Schmittgen. Analysis of relative gene expression data using real-time quantitative PCR and the 2(2−ΔΔct) Method. Methods, 2001. [DOI | PubMed]
- S.D. Yoo, Y.H. Cho, J. Sheen. Arabidopsis mesophyll protoplasts: A versatile cell system for transient gene expression analysis. Nat. Protoc., 2007. [DOI | PubMed]
- C.A. Schneider, W.S. Rasband, K.W. Eliceiri. NIH Image to ImageJ: 25 years of image analysis. Nat. Methods, 2012. [DOI | PubMed]
