Contrasting pattern of subtelomeric satellites in the Cannabaceae family
Abstract
Introduction:
Satellite DNA (satDNA) is a rapidly evolving component of plant genomes, typically found in (peri)centromeric, (sub)telomeric, and other heterochromatic regions. Due to their variability and species- or population-specific distribution, satDNA serves as valuable cytogenetic markers for studying chromosomal rearrangements and karyotype evolution among closely related species. Previous studies have identified species-specific subtelomeric repeats CS-1 in Cannabis sativa, HSR1 in Humulus lupulus, and HJSR in Humulus japonicus. These satellites have been used to differentiate sex chromosomes from autosomes, however, their evolutionary origins, sequence variation and conservation pattern across related species remain largely unexplored.
Methods:
In this study, we analyze sequence similarity among these satellites and assess their interspecific chromosomal localization using fluorescence in situ hybridization (FISH).
Results:
Our results reveal that the HSR1 and HJSR satellites are shared across all studied species, suggesting their common origin from a shared pool of satDNA in their common ancestor. In contrast, the CS-1 satellite exhibits higher sequence divergence.
Discussion:
Although all three satellites are predominantly localized in subtelomeric regions, we identified species-specific exceptions. These findings provide new insight into the evolutionary dynamics of satDNA within the Cannabaceae family and offer further support for the divergence of Humulus species.
Article type: Research Article
Keywords: subtelomeric repeats, satellite divergence, phylogenetics, metaphase chromosomes
Affiliations: Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czechia; Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia; Hop Research Institute Co. Ltd., Žatec, Czechia
License: Copyright © 2025 Horáková, Bačovský, Čegan, Janoušek, Patzak and Hobza. CC BY 4.0 This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
Article links: DOI: 10.3389/fpls.2025.1631369 | PubMed: 40909903 | PMC: PMC12406709
Relevance: Moderate: mentioned 3+ times in text
Full text: PDF (2.1 MB)
Introduction
Plant genomes are predominantly composed of repetitive DNA, which can account for up to 85% of the total genome (ref. Novák et al., 2020). This repetitive fraction is broadly classified into two main categories: dispersed (transposable elements) and tandem repeats (satellite DNA = satDNA). SatDNA is composed of monomer sequences organized in long tandem arrays, which tend to accumulate primarily in chromosomal subdomains characterized by a condensed, heterochromatic organization – such as (peri)centromeres and (sub)telomeres (ref. Garrido-Ramos, 2015). SatDNA represents one of the most rapidly evolving components of eukaryotic genomes, varying among related species in (i) nucleotide sequence, (ii) copy number, (iii) length of monomer subunits, and (iv) chromosome localization (ref. Plohl et al., 2008; ref. Lower et al., 2018). SatDNA is further enriched in specialized chromosomes, such as B chromosomes or sex chromosomes. In the latter, enrichment is associated with low or suppressed recombination, which facilitates the expansion of satellite repeats and the accumulation of transposable elements (TEs) in the non-recombining regions of the Y or W chromosome (ref. Palacios-Gimenez et al., 2017; ref. Jesionek et al., 2021). In addition to their dynamic nature, large satDNA arrays have a role in chromosome evolution and segregation, highlighting their functional significance (ref. Plohl et al., 2012). Due to their repetitive structure, satDNA is a robust and powerful tool for cytogenetic studies. It enables detailed analysis of individual chromosomes (ref. Hobza et al., 2024), chromosomal rearrangements, and evolutionary dynamics among related species (ref. Jagannathan et al., 2017; ref. Schmidt et al., 2019, ref. 2024). According to the library hypothesis, related species share a set of satellite sequences that exhibit certain level of sequence divergence and variation in abundance. The differential amplification of specific satellite sequences leads to unique satDNA collections in related species, which may display species- or population-specific profiles (ref. Fry et al., 1973; ref. Mestrovic et al., 1998; ref. Plohl et al., 2012). This hypothesis has been extensively studied in various plants (ref. Koukalova et al., 2010; ref. Belyayev et al., 2019, ref. 2020), fish (ref. Utsunomia et al., 2017; ref. Goes et al., 2022), and other eukaryotic organisms (ref. Mestrovic et al., 1998; ref. Camacho et al., 2022). Furthermore, satDNA monomers are typically homogenized within a species as a consequence of concerted evolution, a process that maintains sequence uniformity within a species while promoting interspecies divergence (ref. Waye and Willard, 1989; ref. Durfy and Willard, 1990; ref. Wei et al., 2014).
The Cannabaceae family comprises the genera Cannabis and Humulus, along with eight additional genera widely distributed across tropical and temperate regions (ref. Yang et al., 2013; ref. Jin et al., 2020). The genus Cannabis, thought to have originated in East Asia, includes a diploid species, Cannabis sativa (ref. Lapierre et al., 2023), which is cultivated for fiber, oil, and as a source of therapeutic compounds (ref. Small, 2015). The sister genus Humulus consists of three species: Humulus lupulus L., H. japonicus Siebold & Zucc. (synonym H. scandens (Lour.) Merr.), and the Chinese endemic H. yunnanensis Hu (ref. Ling and Zhang, 2019). Humulus lupulus, commonly known as hop, is a key ingredient in beer brewing. Together with H. japonicus, it produces unique secondary metabolites with significant therapeutic potential (ref. Yu et al., 2007; ref. Zanoli and Zavatti, 2008; ref. Ryu et al., 2017; ref. Jiang et al., 2018). The divergence between C. sativa and H. lupulus is estimated to have occurred between 16 and 27.8 million years ago (mya; ref. McPartland, 2018; ref. Jin et al., 2020; ref. Prentout et al., 2021; ref. Padgitt-Cobb et al., 2023). In contrast, H. lupulus and H. japonicus diverged more recently, approximately 3.7 – 10.7 mya (ref. Murakami, 2000; ref. Murakami et al., 2006; ref. Jin et al., 2020). Members of both Cannabis and Humulus genera exhibit dioecy, but some individuals occasionally develop as monoecious. It is hypothesized that dioecy evolved prior to the divergence of these two genera, at least 21–25 mya (ref. Prentout et al., 2021; ref. Akagi et al., 2025).
The three species within the Cannabis and Humulus genera, namely C. sativa, H. lupulus, and H. japonicus, exhibit significant differences in genome size, ranging from 808 Mb in C. sativa (ref. Gao et al., 2020) to 1.8 Gb in H. japonicus (ref. Grabowska-Joachimiak et al., 2006) and 2.8 Gb in H. lupulus (ref. Padgitt-Cobb et al., 2023). A substantial proportion of these genomes is composed of repetitive DNA: 64-74.8% in C. sativa (ref. Gao et al., 2020; ref. Pisupati et al., 2018; ref. Lynch et al., 2025), 61.3-64.5% in H. lupulus (ref. Pisupati et al., 2018; ref. Padgitt-Cobb et al., 2023; ref. Horáková et al., 2025), and 66.8% in H. japonicus (ref. Zhang et al., 2023). Consistent with patterns observed in plant genomes (ref. Feschotte et al., 2002), long terminal repeat retrotransposons (LTR-RTs) represent the most abundant class of repetitive DNA across Cannabis and Humulus species (ref. Pisupati et al., 2018; ref. Zhang et al., 2023; ref. Lynch et al., 2025). A burst of LTR-RTs proliferation may have contributed to the enlargement and structural rearrangement of the X chromosomes in both Humulus species (ref. Akagi et al., 2025). Interestingly, the H. japonicus has undergone an X-autosome fusion involving autosome 3, leading to the formation of neo-X chromosome arm and neo-Y chromosomes (ref. Akagi et al., 2025). In contrast, the accumulation of Ty1/Copia solo-LTRs has predominantly shaped the Y chromosome in C. sativa (ref. Lynch et al., 2025). Meanwhile, satDNA constitutes only a small fraction of the repetitive DNA in Humulus species, accounting for 0.3-2% of the genome (ref. Zhang et al., 2023; ref. Horáková et al., 2025). Recent studies using long-read PacBio sequencing and advanced bioinformatics tools have characterized tandem repeat families in H. lupulus (ref. Easterling et al., 2020) and the most abundant DNA repeats in H. japonicus (ref. Zhang et al., 2023). To date, the most valuable cytogenetic markers for identifying autosomes and sex chromosome in Cannabis and Humulus species include three members: CS-1 (Cannabis sativa 1; ref. Divashuk et al., 2014), HSR1 (Humulus subtelomeric repeat 1; ref. Divashuk et al., 2011), and HJSR (Humulus japonicus subtelomeric repeat; ref. Alexandrov et al., 2012). Both subtelomeric HSR1 and HJSR satellites were originally characterized through restriction digestion using the KpnI endonuclease (ref. Divashuk et al., 2011; ref. Alexandrov et al., 2012), whereas CS-1 was later derived from HSR1 as a repeat with low homology to the HSR1 satellite (ref. Divashuk et al., 2014). This may suggest common origin for both repeats, classifying them into one superfamily, similar to patterns observed in other plats species such as Rumex acetosa (ref. Navajas-Pérez et al., 2006; ref. Cuñado et al., 2007; ref. Navajas-Pérez et al., 2009b; ref. Quesada Del Bosque et al., 2011). Despite these three satellites were identified independently in Humulus and Cannabis species, they exhibit remarkable similarities, including their subtelomeric localization. However, the origin and evolutionary dynamics of these large satellite arrays remain poorly understood, particularly compared to those described in other plant species. In this study, we isolated and compared major satellites CS-1 (C. sativa-specific), HSR1 (H. lupulus-specific), and HJSR (H. japonicus-specific). We identified their interspecific chromosomal localization using fluorescent in situ hybridization (FISH) in both sexes. To quantify their genome abundance, sequence similarity, and intraspecific variability, we combined short Illumina reads with bioinformatic analyses, and molecular cloning and phylogenic analysis within each species. Given the parallel origin of CS-1, HSR1, and HJSR, we aimed to compare (i) the genetic differentiation of the HSR1 and HJSR satellites in the context of H. japonicus speciation, and (ii) the origin and divergence of this subtelomeric satellite family. We also discussed the sequence similarity and chromosomal distribution of the HSR1 and CS-1 satellites to gain insight into their evolutionary relationship.
Materials and methods
Plant material
Female H. lupulus cv Saaz hop (Osvald’s clone 72) and male H. lupulus Lib male (15181) (2n = 18 + XX/XY) were provided by the Hop Research Institute Co. Ltd. in Žatec (Czech Republic). Female and male H. japonicus (2n = 14 + XX/XY1Y2) and C. sativa cv Kompolti (2n = 18 + XX/XY) plants were grown from seeds obtained from W. Legutko (Poland) and SEMO a.s. (Czech Republic), respectively. All plants were grown in a greenhouse under controlled conditions (16h daylight/8h dark photoperiod) at the Department of Plant Developmental Genetics in Brno (Czech Republic). The sex of H. japonicus plants was determined based on floral morphology and chromosome number. In Cannabis sativa and H. lupulus, sex was determined using PCR with male-specific molecular markers: MADC2 for C. sativa (ref. Mandolino et al., 1999) and OPJ9 for H. lupulus (ref. Polley et al., 1997). Both male-specific markers were amplified using Taq polymerase (Top Bio) according to the manufacturer’s instructions (for primers, see Supplementary Table S1). PCR cycling conditions followed protocols described by ref. Razumova et al. (2016) and ref. Patzak et al. (2002). Male and female plants of each species were selected for further analysis based on the PCR results (Supplementary Figure S1).
Isolation of DNA and Low-coverage genome sequencing
Genomic DNA was extracted from young leaves of male C. sativa, H. lupulus, and H. japonicus, using the NucleoSpin Plant II (Macherey-Nagel GmbH and Co. KG., Germany), following the manufacturer’s protocol. Libraries were prepared using the NEBNext® Ultra™ II DNA Library Prep Kit. Genomic DNA from male plants of C. sativa and H. japonicus was low-coverage sequenced using an Illumina MiSeq sequencer, generating 300 bp paired-end reads at the Centre of Plant Structural and Functional Genomics (Olomouc, Czech Republic). Additionally, we used the library of H. lupulus described in ref. Horáková et al. (2025).
Analysis of repeats using Repeat Explorer
The FastQC tool (available at http://www.bioinformatics.babraham.ac.uk/projects/fastqc) was used to assess the quality of sequencing reads. Furthermore, reads were pre-processed based on quality (Q30) with subsequent adaptor trimming, filtering out short or unpaired sequences, and all reads were trimmed to a uniform length of 200 bp using Trimmomatic 0.32 (ref. Bolger et al., 2014). To identify repetitive DNA composition and tandem repeats, these datasets were analyzed using RepeatExplorer2 (ref. Novák et al., 2013, ref. 2020) and Tandem Repeat Analyzer (TAREAN) pipelines (ref. Novák et al., 2017). The clustering of randomly selected 2 x 500–000 reads was performed by default setting.
Ligation of PCR products and cloning
The major satellites, including Cl5 (CS-1), Cl75 (HSR1), and Cl57 (HJSR) were amplified by PCR using specific primers (Supplementary Table S1). Primers were designed in Geneious Prime (version 2023.1.1) based on the consensus sequences generated by the TAREAN pipeline. PCR amplification was performed using Q5 High-Fidelity DNA polymerase (M0491S; NEB) according to the manufacturer’s instructions. The cycling conditions were as follows: 95°C for 4 min followed by 35 cycles of 94°C for 30 s, 52 – 57°C for 35 s, 72°C for 30 s, and a final extension step at 72°C for 10 min. The annealing temperature was optimized for each primer pair. Selected units for each satellite (Supplementary Figure S2) were extracted from agarose gel and purified using the QIAquick Gel Extraction Kit (QIAGEN GmbH – Hilden, Germany).
The purified fragments were individually cloned into the pJET1.2 plasmid vector using the CloneJET PCR Cloning Kit (K1231; ThermoFisher), following the manufacturer’s instructions. Ligation reactions were incubated overnight at 16°C and subsequently desalted. Competent E. coli cells were transformed by electroporation using a MicroPulser Electroporator (Bio-Rad). After 30-minute incubation, transformed cells were plated onto LB medium containing ampicillin (100 mg/L) and incubated overnight at 37°C. The next day, colony PCR was performed using pJET1.2-specific primers to identify positive clones. Individual bacterial colonies were used as templates. From positive PCR products, residual primers and PCR dNTPs were removed using ExoSAP reaction. PCR amplicons were sequenced in Macrogen (Amsterdam, Netherlands).
DNA probes preparation
Ribosomal DNA (5S and 45S rDNA) was amplified by PCR using specific primers (Supplementary Table S1). The primers were designed in Geneious Prime (version 2023.1.) based on the sequences from GenBank database (accession numbers: MN537579 (ref. Easterling et al., 2020) and AF223066.1). PCR amplification was performed in a 20 µl reaction mixture containing 1x PCR buffer, 0.0001 M dNTPs, 0.0001 M of each primer, 0.5 U Taq polymerase (Top Bio), and 10–15 ng of template DNA. The PCR cycling conditions were as follows: initial denaturation at 95°C for 4 min, 35 cycles at 94°C for 30 s, 55°C for 35 s, 72°C for 30 s, followed by a final extension at 72°C for 10 min. The telomeric repeat sequence (Arabidopsis-type, TTTAGGG) was amplified without template DNA according to ref. Ljdo et al. (1991). PCR cycling conditions were: first denaturation at 94°C for 1 min, 10 cycles of 94°C for 1 min, 55°C for 30 s, and 72°C for 1 min, followed by 30 cycles of 94°C for 1 min, 60°C for 30 s, 72°C for 90 s, and final extension step at 72°C for 5 min.
PCR products were separated on a 1% agarose gel with EtBr staining and purified using the QIAquick PCR Purification Kit (28104; QIAGEN) according to the manufacturer’s instructions. Purified PCR products (1µg) of each satellite and species and 5S, 45S rDNA, and telomere obtained as described above, were labeled using Nick Translation Labelling kits (Jena Bioscience, Germany) with Atto488 NT (PP-305L-488), Atto550 NT (PP-305L-550), and Cy5 (PP-305L-647N), following the manufacturer’s instructions. Labeling reactions were incubated at 15°C for 90 min. The labeled products were analyzed on a 1% agarose gel with EtBr staining. Reactions of well-labeled DNA probes were stopped by adding 0.5M EDTA and incubating at 85°C. The labeled DNA probes were then used directly in the hybridization mixture for FISH.
Mitotic chromosome preparation and Fluorescence in situ hybridization
Mitotic chromosomes were prepared from young leaves (2–5 mm in length) of C. sativa, H. lupulus, and H. japonicus as described in ref. Horáková et al. (2025). FISH was performed on mitotic metaphase chromosomes according to ref. Sacchi et al. (2024) under two stringency conditions (77% and 68%; Supplementary Table S2). FISH with 77% stringency was used to confirm the subtelomeric localization of CS-1 (Cl5) in C. sativa, HSR1 (Cl75) in H. lupulus, HJSR (Cl57) in H. japonicus, and 45S rDNA. Low-stringency FISH (68%) was used to determine the interspecific localization of satellites. Chromosomes were counterstained with 4´,6´-diamidino-2-phenylindole (DAPI) in Vectashield Antifade Mounting Medium. Images were captured using an Olympus AX70 epifluorescence microscope equipped with a CCD camera and processed using Adobe Photoshop. FISH experiments were performed in triplicate, with at least ten metaphases analyzed per experiment.
Phylogenetic analysis of subtelomeric satellites
The sequences were aligned in MAFFT in two rounds. At first, the adjusted direction option was used to ensure the correct orientation of sequences. The alignment was further improved in MAFFT using an iterative refinement method incorporating local pairwise alignment (mafft-linsi). The phylogenetic tree was constructed using maximum-likelihood approach using IQ-TREE. The substitution model (HKY+F+G4) was chosen in IQ-TREE based on the BIC criterion. The support values were obtained using the ultrafast bootstrap method (with 1000 replicates). To obtain sequence logos for each satellite-species combination, we mapped the reads from a given species to the reference, which was represented by a 60% consensus sequence prepared from sequenced clones of PCR products obtained in the species in which that satellite was originally described (for primers, see Supplementary Table S1). In several satellite-species combinations, low-coverage sequencing did not allow sufficient coverage of the reference, so the sequence logo was prepared based on the cloned PCR products. DNA reads were trimmed using Trimmomatic (ref. Bolger et al., 2014) and mapped to the reference using BWA-MEM (ref. Li, 2013). SAM files were converted to fasta alignments using a simple Python script (sam2fasta.py; https://sourceforge.net/projects/sam2fasta/). Sequence logos were generated from fasta alignments using a locally installed version of the Weblogo 3 program (ref. Crooks et al., 2004) with sequence-type set to ‘dna’.
Results
Analysis of repeat composition in Cannabis and Humulus species
Repeatome analysis revealed major satellites candidates in Cannabis and Humulus species. Specifically, Cl5 (CS-1) in C. sativa, Cl75 (HSR1) in H. lupulus, and Cl57 (HJSR) in H. japonicus. We extracted consensus sequence for each repeat and their abundance in the genomes based on sequencing data (Supplementary Table S3). These major satellites – Cl5 (CS-1), Cl75 (HSR1), and Cl57 (HJSR) show high sequence similarity to previously described sequences in GenBank: CS-1 (JX402748.2; ref. Divashuk et al., 2014), HSR1 (Humulus subtelomeric repeat 1, GU831574.1; ref. Divashuk et al., 2011), HJSR (Humulus japonicus subtelomeric repeat, GU831573.1; ref. Alexandrov et al., 2012). For clarity and consistency, we refer to these repeats throughout the text by their established GenBank names: CS-1, HSR1, and HJSR, respectively. The monomer unit lengths for these satellites range from 370 bp for CS-1, 380 bp for HJSR, to 383 bp for HSR1. Sequence comparison revealed relatively higher sequence similarity between HSR1 and CS-1 (58.82%) and HSR1 and HJSR (65.05%; Supplementary Figure S3).
Comparison of major satellite distribution among the related species
To analyze the interspecific distribution of major satellites, CS-1, HSR1, and HJSR, we performed FISH on metaphase chromosomes of C. sativa, H. lupulus, and H. japonicus (f1; T1). Additionally, Arabidopsis-type telomeric (TTTAGGG) and 45S rDNA probes were used to label chromosome ends and enable the identification of previously described chromosomes (f1). Telomeric signals are present at the terminal regions of all chromosomes, without any interstitial telomeric signals (f1; Supplementary Figure S4A). The distribution of 45S rDNA varies among studied species, being localized on chromosome 9 in C. sativa, chromosome 8 in H. lupulus, and on two pairs of autosomes, specifically on chromosomes 5 and 7 in H. japonicus (f1). Chromosome nomenclature in H. lupulus is based on the pseudomolecule assembly by ref. Akagi et al. (2025) with chromosomes numbered according to their size, from largest to smallest. Whereas in C. sativa and H. japonicus, it follows the systems established by ref. Kim et al. (2008); ref. Alexandrov et al. (2012), and ref. Divashuk et al. (2014).

Table 1: Analyzed tandem repeats in C. sativa, H. lupulus, H. japonicus, the monomer unit size and its chromosomal distribution.
| Satellite | Reference | Size (bp) | Chromosomal localization | ||
|---|---|---|---|---|---|
| C. sativa | H. lupulus | H. japonicus | |||
| CS-1 | PRJEB81858 | 370 | +/subtelomere | +/(peri)centromere | +/subtelomere |
| HSR1 | PRJEB81858 | 383 | +/(peri)centromere | +/subtelomere, pericentromere (X chromosome) | +/subtelomere |
| HJSR | PRJEB81858 | 380 | – | +/subtelomere | +/subtelomere |
+ Presence of satellite in species, – absence of satellite in species; Consensual sequences are listed in Supplementary Data sheet 1.
Bold values indicate the positions of major satellites in the species where they were originally identified
We observed chromosome-specific differences in the localization of all three studied satellites.
In Cannabis sativa, CS-1 is predominantly localized in the subtelomeric regions of both arms of all autosomes, except for the q-arm of chromosome 4 and the p-arm of chromosome 9, the latter of which possess 45S rDNA. Notably, the Y chromosome lacks CS-1 on the q-arm (f1). This distinct distribution of CS-1, combined with the Y chromosome size and DAPI banding pattern, enables its precise identification (f1). However, the identification of the X chromosome is still difficult, based solely on CS-1 distribution and factors mentioned above (f1). In Humulus lupulus, the CS-1 satellite is localized in the (peri)centromeric region of chromosome 2 (f1) while 5S rDNA is simultaneously detected in its subtelomeric region (Supplementary Figure S5). Although the CS-1 satellite was amplified with the designed primers in H. japonicus (Supplementary Figure S2), we did not observe any clear signals on metaphase chromosomes of either sex (f1 and Supplementary Figure S4B), presumably resulting from its low abundance on chromosomes and in the genome.
The Humulus lupulus-specific repeat, HSR1 satellite, is localized in the (peri)centromeric regions of ten chromosomes in males (f1) and eleven chromosomes in females C. sativa (Supplementary Figure S4C). This structural chromosomal heterozygosity may indicate a hybrid origin of the female C. sativa cv Kompolti or suggest the presence of HSR1 on one of the X chromosomes. Simultaneous hybridization of 45S rDNA, HSR1, and CS-1 satellites shows that HSR1 is absent from chromosomes 4, 9, and Y (Supplementary Figure S6). In contrast, the HSR1 satellite is localized in the subtelomeric regions of almost all autosomes of H. lupulus, except for the p-arm of chromosome 8, which carries 45S rDNA, the p-arm of X chromosome, and the q-arm of chromosome Y (f1). Notably, the X chromosome exhibits a strong HSR1 signal in the pericentromeric region of its p-arm. In Humulus japonicus, the HSR1 probe is distributed in the subtelomeric regions of almost all chromosomes (f1), showing similarities to the HJSR probe (f1). However, HSR loci on one chromosome pair and the chromosome Y2 are absent or underrepresented.
In Cannabis sativa, HJSR is found mainly in subtelomeric regions, although, some chromosomes lack the HJSR signal (f1). The distribution of the HJSR satellite in H. lupulus was identical to that of the HSR1 satellite, including its localization on the sex chromosomes (f1; Supplementary Figure S4D). In Humulus japonicus, the HJSR satellite is present in subtelomeric regions, except for one arm of X chromosome, p-arm of chromosome 5, and both arms of chromosome 7, which carry 45S rDNA. No HJSR signal was observed on the Y2 chromosome (f1). The distribution of 45S rDNA across all analyzed species aligns with previous studies. The distribution of CS-1, HSR1 and HJSR satellites in female plants are shown in Supplementary Figure S4. We observed no sex-specific differences in the chromosomal distribution of these satellites between studied accessions (f1; Supplementary Figure S4). However, the number of HSR1 signals varied in C. sativa (Supplementary Figures S4C, S6). The overall comparative distribution of CS-1, HSR1, and HJSR satellite further suggests intergenomic changes during the species divergence (f2).

Characterization and origin of major satellites in Cannabaceae
Using the primers sets designed for each major satellite across the three species, we obtained consistent amplification pattern for HSR1 and HJSR, with monomer unit length of approximately 380 bp. In contrast, the CS-1 satellite exhibited a species-specific ladder pattern in H. japonicus. The expected monomer band of 370 bp for CS-1 was obtained exclusively in C. sativa and H. lupulus (Supplementary Figure S2). Selected units for each satellite and species were cloned and sequenced. We analyzed a total of 35 CS-1 sequences (23 from C. sativa, 5 from H. lupulus, and 7 from H. japonicus), 46 HSR1 sequences (15 from C. sativa, 14 from H. lupulus, and 17 from H. japonicus), and 36 HJSR sequences (11 from C. sativa, 14 from H. lupulus, and 11 from H. japonicus). A comparison of satellite sequences cloned from each species supported low interspecific variability and their classification into one supercluster family, in line with physical localization on metaphase chromosomes (Supplementary Figure S7). To determine the relationships among these satellites in studied species, we conducted a phylogenetic analysis of individual sequence clusters (f3; Supplementary Figures S8, S9). The CS-1 amplified in C. sativa and H. japonicus grouped together, while those amplified in H. lupulus formed a separate clade (f3). Although the sequences from C. sativa and H. japonicus are closely related, no visible CS-1 signal was detected on the chromosomes of H. japonicus (f1), suggesting a low enrichment of these satellite units insufficient for detection by FISH. CS-1 was clearly localized to the subtelomeric regions of C. sativa chromosomes, in contrast to its (peri)centromeric localization on chromosome 2 in H. lupulus. The HSR1 and HJSR sequences tended to cluster together, with no unique species-specific clades identified between the species (Supplementary Figures S8, S9), indicating low divergence among individual subunits. This is consistent with their shared localization in subtelomeric regions. In Cannabis sativa, HSR1 was found in pericentromeric regions on five autosome pairs (f1). Among the three satellite arrays, only the HJSR satellite exhibited a conserved subtelomeric localization across all three species (f1), lacking clear species-specific clade and suggesting common origin for all three species.

Discussion
Satellite DNA plays a crucial role in the organization of plant chromosomes and has critical implications for evolutionary, genetic, and taxonomic research. Analyzing repetitive DNA divergence enables the exploration of evolutionary relationships among plant species (ref. Yoong Lim et al., 2006; ref. Koo et al., 2011). Despite the progress in characterizing the satDNA content within Humulus species, the organization, location and the extent to which these sequences are shared or diverged among the related species of the Cannabaceae family remain unknown. In this study, we examined the molecular and cytogenetic characteristics and phylogenetic relationships of major satellites CS-1, HSR1, and HJSR in C. sativa, H. lupulus, and H. japonicus. According to the library hypothesis, all three studied species should share a library of different conserved satellite DNA units (different satellite DNA families, monomer variants, and subfamilies within a satellite DNA family). The satDNA sequences may be differentially amplified in each taxon with the subsequent replacement of one sequence variant by another in different species or populations (ref. Plohl et al., 2012). Similarly, the evolution of the HSR1 and HJSR satellites is likely to be explained by such a process. The presence of both satellites in C. sativa suggests that they originated from a common ancestor. It is tempting to speculate that their presence predated the speciation and split of C. sativa and H. lupulus. Although the number of repeat sequences varies significantly between species, differences in subcluster abundance likely result from species-specific amplification processes (ref. Plohl et al., 2008, ref. 2012). In contrast, the CS-1 satellite appears more diverse than the HSR and HJSR satellites (f3). Correspondingly, the satellite RAE180 in Rumex acetosa has undergone distinct pattern of accumulation on autosomes and sex chromosomes. It is supposed that RAE180 originated before the split between Rumex species with XX/XY and XX/XY1Y2 sex chromosome systems (ref. Cuñado et al., 2007; ref. Navajas-Pérez et al., 2009b, ref. 2009a; ref. Quesada Del Bosque et al., 2011). Since Rumex species contain so far, the highest number of satellites on the sex chromosomes in plants (ref. Jesionek et al., 2021), satellite RAE180 was hypothesized to be present in an ancestral genome in various monomer variants from which novel tandem arrays could later be amplified (ref. Navajas-Pérez et al., 2009a). Such divergence was assessed in R. hastatulus Texas and North Carolina cytotypes, R. acetosella and R. acetosa (ref. Navajas-Pérez et al., 2009a; ref. Quesada Del Bosque et al., 2011), supporting again library hypothesis as shown for major satellites in this study, namely HSJR which shares similar chromosomal position and do not display specific sequence clustering (Supplementary Figures S9).
Although we confirmed the presence of CS-1 in the H. japonicus genome, this satellite was not detected on the metaphase chromosome using standard FISH approach, even with low stringency and higher sensitivity. We suggest that the absence of CS-1 positive loci on H. japonicus chromosomes rather indicate its low abundance in the genome and low tandem organization, similar to the RAE180 satellite in R. acetosella (ref. Cuñado et al., 2007). These findings are consistent with the large phylogenetic distance between Cannabis and Humulus genera and suggest distinct evolutionary trajectories for satellites in the lineages leading to H. japonicus. SatDNA evolves rapidly through unequal crossing-over, replication slippage or mutation (ref. Plohl et al., 2012). Amplification of satDNA may accompany karyotype rearrangements in plant species and chromosomal position of some satDNA family varies between related species (ref. Navajas-Pérez et al., 2009a). In our study, we observed that the subtelomeric localization of the three major satellites was not conserved and showed the distinct distribution pattern across the studied species, except HJSR. Nevertheless, the interspecific differences were evident in chromosomal positions and number of foci of all three satellites, with predominant subtelomeric localization. The Humulus lupulus HSR1 satellite was localized to the (peri)centromeric regions of C. sativa. Similarly, CS-1 was found exclusively in the (peri)centromeric region of two autosomes in H. lupulus (summarized in f2; T1). This satellite localization (f1) supports the hypothesis of large-scale genomic reorganization during the evolution in Humulus (ref. Akagi et al., 2025). Further, this pattern is comparable to Cl12 in XY and XYY cytotypes of R. hastatulus, which during karyotype rearrangement was relocated from the (peri)centromeric regions and accumulated in subtelomeric regions of some autosomes and Y1 chromosome (ref. Sacchi et al., 2024). Despite chromosomal rearrangements, including autosome and X-autosome fusions that occurred during the evolution in H. japonicus (ref. Akagi et al., 2025), leading to a reduced chromosome number and the formation of X and both Y chromosomes, telomeric signals were consistently observed only at the terminal regions of all chromosomes.
In summary, the analysis of the major satellites in the Cannabaceae family reveals a complex evolutionary pattern during the speciation of H. lupulus and H. japonicus. Despite differences in abundance, the sequence structure of CS-1, HSR1, and HJSR remains largely conserved, we propose that their chromosomal localization has contributed to species-specific sequence evolution. This divergence may have played a role in speciation promotion, as evidenced by the distinct positioning of CS-1 and HSR1 arrays in C. sativa and H. lupulus. With the availability of long-read sequencing data, a more comprehensive analysis of the full organizational structure of these major satellites will be possible, facilitating the identification of new sequence motifs even among various cultivars and varieties within the Cannabaceae family.
References
- T. Akagi, T. Segawa, R. Uchida, H. Tanaka, K. Shirasawa, N. Yamagishi. Evolution and functioning of an X–A balance sex-determining system in hops.. Nat. Plants., 2025. [DOI | PubMed]
- O. S. Alexandrov, M. G. Divashuk, N. A. Yakovin, G. I. Karlov. Sex chromosome differentiation in Humulus japonicus Siebold & Zuccarini 1846 (Cannabaceae) revealed by fluorescence in situ hybridization of subtelomeric repeat.. Comp. Cytogenet., 2012. [DOI | PubMed]
- A. Belyayev, M. Jandová, J. Josefiová, R. Kalendar, V. Mahelka, B. Mandák. The major satellite DNA families of the diploid Chenopodium album aggregate species: Arguments for and against the “library hypothesis.. PloS One, 2020. [DOI | PubMed]
- A. Belyayev, J. Josefiová, M. Jandová, R. Kalendar, K. Krak, B. Mandák. Natural history of a satellite DNA family: from the ancestral genome component to species-specific sequences, concerted and non-concerted evolution.. IJMS, 2019. [DOI | PubMed]
- A. M. Bolger, M. Lohse, B. Usadel. Trimmomatic: A flexible trimmer for Illumina sequence data.. Bioinformatics, 2014. [DOI | PubMed]
- J. P. M. Camacho, J. Cabrero, M. D. López-León, M. Martín-Peciña, F. Perfectti, M. A. Garrido-Ramos. Satellitome comparison of two oedipodine grasshoppers highlights the contingent nature of satellite DNA evolution.. BMC Biol., 2022. [DOI | PubMed]
- G. E. Crooks, G. Hon, J.-M. Chandonia, S. E. Brenner. WebLogo: A sequence logo generator: figure 1.. Genome Res., 2004. [DOI | PubMed]
- N. Cuñado, R. Navajas-Pérez, R. de la Herrán, C. Ruiz Rejón, M. Ruiz Rejón, J. L. Santos. The evolution of sex chromosomes in the genus Rumex (Polygonaceae): Identification of a new species with heteromorphic sex chromosomes.. Chromosome Res., 2007. [DOI | PubMed]
- M. G. Divashuk, O. S. Alexandrov, P. Y. Kroupin, G. I. Karlov. Molecular cytogenetic mapping of Humulus lupulus sex chromosomes.. Cytogenet. Genome Res., 2011. [DOI | PubMed]
- M. G. Divashuk, O. S. Alexandrov, O. V. Razumova, I. V. Kirov, G. I. Karlov. Molecular cytogenetic characterization of the dioecious Cannabis sativa with an XY chromosome sex determination system.. PloS One, 2014. [DOI | PubMed]
- S. J. Durfy, H. F. Willard. Concerted evolution of primate alpha satellite DNA.. J. Mol. Biol., 1990. [DOI | PubMed]
- K. A. Easterling, N. J. Pitra, T. B. Morcol, J. R. Aquino, L. G. Lopes, K. C. Bussey. Identification of tandem repeat families from long-read sequences of Humulus lupulus.. PloS One, 2020. [DOI | PubMed]
- C. Feschotte, N. Jiang, S. R. Wessler. Plant transposable elements: where genetics meets genomics.. Nat. Rev. Genet., 2002. [DOI | PubMed]
- K. Fry, R. Poon, P. Whitcome, J. Idriss, W. Salser, J. Mazrimas. Nucleotide sequence of HS-β Satellite DNA from kangaroo rat Dipodomys ordii .. Proc. Natl. Acad. Sci. U.S.A., 1973. [DOI | PubMed]
- S. Gao, B. Wang, S. Xie, X. Xu, J. Zhang, L. Pei. A high-quality reference genome of wild Cannabis sativa.. Hortic. Res., 2020. [DOI | PubMed]
- M. A. Garrido-Ramos. Satellite DNA in plants: more than just rubbish.. Cytogenet. Genome Res., 2015. [DOI | PubMed]
- C. A. G. Goes, R. Z. Dos Santos, W. R. C. Aguiar, D. C. V. Alves, D. M. Z. D. A. Silva, F. Foresti. Revealing the satellite DNA history in psalidodon and astyanax characid fish by comparative satellitomics.. Front. Genet., 2022. [DOI | PubMed]
- A. Grabowska-Joachimiak, E. Sliwinska, M. Pigula, U. Skomra, A. J. Joachimiak. Genome size in Humulus lupulus L. and H. japonicus Siebold & Zucc. (Cannabaceae).. Acta Societatis Botanicorum Poloniae,, 2006. [DOI]
- R. Hobza, V. Bačovský, R. Čegan, L. Horáková, M. Hubinský, T. Janíček. Sexy ways: approaches to studying plant sex chromosomes.. J. Exp. Bot., 2024. [DOI | PubMed]
- L. Horáková, P. Jedlička, R. Čegan, P. Navrátilová, H. Tanaka, A. Toyoda. Dynamic patterns of repeats and retrotransposons in the centromeres of Humulus lupulus L.. New Phytol. doi:, 2025. [DOI | PubMed]
- M. Jagannathan, N. Warsinger-Pepe, G. J. Watase, Y. M. Yamashita. Comparative analysis of satellite DNA in the drosophila melanogaster species complex.. G3 (Bethesda), 2017. [DOI | PubMed]
- W. Jesionek, M. Bodláková, Z. Kubát, R. Čegan, B. Vyskot, J. Vrána. Fundamentally different repetitive element composition of sex chromosomes in Rumex acetosa .. Ann. Bot., 2021. [DOI | PubMed]
- C.-H. Jiang, T.-L. Sun, D.-X. Xiang, S.-S. Wei, W.-Q. Li. Anticancer activity and mechanism of xanthohumol: A prenylated flavonoid from hops (Humulus lupulus L.).. Front. Pharmacol., 2018. [DOI | PubMed]
- J. Jin, M. Yang, P. W. Fritsch, R. Van Velzen, D. Li, T. Yi. Born migrators: Historical biogeography of the cosmopolitan family Cannabaceae.. J. Sytematics Evol., 2020. [DOI]
- S.-Y. Kim, C.-S. Kim, J. Lee, J.-W. Bang. Karyotype Analysis and Physical Mapping Using Two rRNA Genes in Dioecious Plant, Humulus japonicus Sieboid & Zucc.. Genes Genom., 2008
- D.-H. Koo, C. P. Hong, J. Batley, Y. S. Chung, D. Edwards, J.-W. Bang. Rapid divergence of repetitive DNAs in Brassica relatives.. Genomics, 2011. [DOI | PubMed]
- B. Koukalova, A. P. Moraes, S. Renny-Byfield, R. Matyasek, A. R. Leitch, A. Kovarik. Fall and rise of satellite repeats in allopolyploids of Nicotiana over c. 5 million years.. New Phytol., 2010. [DOI | PubMed]
- É. Lapierre, A. S. Monthony, D. Torkamaneh. Genomics-based taxonomy to clarify cannabis classification.. Genome, 2023. [DOI | PubMed]
- H. Li. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. doi:. 2013. [DOI]
- L.-Z. Ling, S.-D. Zhang. The complete chloroplast genome of Humulus yunnanensis and phylogenetic analysis of the genus Humulus .. Mitochondrial DNA Part B, 2019. [DOI | PubMed]
- J. W. Ljdo, R. A. Wells, A. Baldini, S. T. Reeders. Improved telomere detection using a telomere repeat probe (TTAGGG)n generated by PCR.. Nucleic acids research,, 1991. [PubMed]
- S. S. Lower, M. P. McGurk, A. G. Clark, D. A. Barbash. Satellite DNA evolution: old ideas, new approaches.. Curr. Opin. Genet. Dev., 2018. [DOI | PubMed]
- R. C. Lynch, L. K. Padgitt-Cobb, A. R. Garfinkel, B. J. Knaus, N. T. Hartwick, N. Allsing. Domesticated cannabinoid synthases amid a wild mosaic cannabis pangenome.. Nature., 2025. [DOI | PubMed]
- G. Mandolino, A. Carboni, S. Forapani, V. Faeti, P. Ranalli. Identification of DNA markers linked to the male sex in dioecious hemp (Cannabis sativa L.).. Theoretical and applied genetics,, 1999. [DOI]
- J. M. McPartland. Cannabis systematics at the levels of family, genus, and species.. Cannabis Cannabinoid Res., 2018. [DOI | PubMed]
- N. Mestrovic, M. Plohl, B. Mravinac, D. Ugarkovic. Evolution of satellite DNAs from the genus Palorus–experimental evidence for the “library” hypothesis.. Mol. Biol. Evol., 1998. [DOI | PubMed]
- A. Murakami. Comparison of Sequence of rbcL and Non-coding Regions of Chloroplast DNA and ITS2 Region of rDNA in Genus Humulus.. Breed. Sci., 2000. [DOI]
- A. Murakami, P. Darby, B. Javornik, M. S. S. Pais, E. Seigner, A. Lutz. Molecular phylogeny of wild Hops, Humulus lupulus L.. Heredity, 2006. [DOI | PubMed]
- R. Navajas-Pérez, M. E. Quesada Del Bosque, M. A. Garrido-Ramos. Effect of location, organization, and repeat-copy number in satellite-DNA evolution.. Mol. Genet. Genomics, 2009. [DOI | PubMed]
- R. Navajas-Pérez, T. Schwarzacher, M. R. Rejón, M. A. Garrido-Ramos. Molecular cytogenetic characterization of Rumex papillaris, a dioecious plant with an XX/XY1Y2 sex chromosome system.. Genetica, 2009. [DOI | PubMed]
- R. Navajas-Pérez, T. Schwarzacher, R. de la Herrán, C. Ruiz Rejón, M. Ruiz Rejón, M. A. Garrido-Ramos. The origin and evolution of the variability in a Y-specific satellite-DNA of Rumex acetosa and its relatives.. Gene, 2006. [DOI | PubMed]
- P. Novák, L. Ávila Robledillo, A. Koblížková, I. Vrbová, P. Neumann, J. Macas. TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads.. Nucleic Acids Res., 2017. [DOI | PubMed]
- P. Novák, M. S. Guignard, P. Neumann, L. J. Kelly, J. Mlinarec, A. Koblížková. Repeat-sequence turnover shifts fundamentally in species with large genomes.. Nat. Plants, 2020. [DOI | PubMed]
- P. Novák, P. Neumann, J. Pech, J. Steinhaisl, J. MacAs. RepeatExplorer: A Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads.. Bioinformatics, 2013. [DOI | PubMed]
- L. K. Padgitt-Cobb, N. J. Pitra, P. D. Matthews, J. A. Henning, D. A. Hendrix. An improved assembly of the “Cascade” hop (Humulus lupulus) genome uncovers signatures of molecular evolution and refines time of divergence estimates for the Cannabaceae family.. Hortic. Res., 2023. [DOI | PubMed]
- O. M. Palacios-Gimenez, G. B. Dias, L. G. De Lima, G. C. E. S. Kuhn, É. Ramos, C. Martins. High-throughput analysis of the satellitome revealed enormous diversity of satellite DNAs in the neo-Y chromosome of the cricket Eneoptera Surinamensis.. Sci. Rep., 2017. [DOI | PubMed]
- J. Patzak, P. Vejl, S. Skupinová, V. Nesvadba. Identification of sex in F 1 progenies of hop (Humulus lupulus L.) by molecular marker.. Plant, Soil and Environment,, 2002
- R. Pisupati, D. Vergara, N. C. Kane. Diversity and evolution of the repetitive genomic content in Cannabis sativa.. BMC Genomics, 2018. [DOI | PubMed]
- M. Plohl, A. Luchetti, N. Meštrović, B. Mantovani. Satellite DNAs between selfishness and functionality: Structure, genomics and evolution of tandem repeats in centromeric (hetero)chromatin.. Gene, 2008. [DOI | PubMed]
- M. Plohl, N. Meštrović, B. Mravinac. Satellite DNA evolution.. Genome Dyn., 2012. [DOI | PubMed]
- A. Polley, E. Seigner, M. W. Ganal. Identification of sex in hop (Humulus lupulus) using molecular markers.. Genome, 1997. [DOI | PubMed]
- D. Prentout, N. Stajner, A. Cerenak, T. Tricou, C. Brochier-Armanet, J. Jakse. Plant genera Cannabis and Humulus share the same pair of well-differentiated sex chromosomes.. New Phytol., 2021. [DOI | PubMed]
- M. E. Quesada Del Bosque, R. Navajas-Pérez, J. L. Panero, A. Fernández-González, M. A. Garrido-Ramos. A satellite DNA evolutionary analysis in the North American endemic dioecious plant Rumex hastatulus (Polygonaceae).. Genome, 2011. [DOI | PubMed]
- O. V. Razumova, O. S. Alexandrov, M. G. Divashuk, T. I. Sukhorada, G. I. Karlov. Molecular cytogenetic analysis of monoecious hemp (Cannabis sativa L.) cultivars reveals its karyotype variations and sex chromosomes constitution.. Protoplasma, 2016. [DOI | PubMed]
- Y.-K. Ryu, Y. Kang, J. Go, H.-Y. Park, J.-R. Noh, Y.-H. Kim. Humulus japonicus prevents dopaminergic neuron death in 6-hydroxydopamine-induced models of Parkinson’s disease.. J. Medicinal Food, 2017. [DOI | PubMed]
- B. Sacchi, Z. Humphries, J. Kružlicová, M. Bodláková, C. Pyne, B. I. Choudhury. Phased assembly of neo-sex chromosomes reveals extensive Y degeneration and rapid genome evolution in Rumex hastatulus .. Mol. Biol. Evol., 2024. [DOI | PubMed]
- N. Schmidt, K. Sielemann, S. Breitenbach, J. Fuchs, B. Pucker, B. Weisshaar. Repeat turnover meets stable chromosomes: repetitive DNA sequences mark speciation and gene pool boundaries in sugar beet and wild beets.. Plant J., 2024. [DOI | PubMed]
- T. Schmidt, T. Heitkam, S. Liedtke, V. Schubert, G. Menzel. Adding color to a century-old enigma: multi-color chromosome identification unravels the autotriploid nature of saffron (Crocus sativus) as a hybrid of wild Crocus cartwrightianus cytotypes.. New Phytol., 2019. [DOI | PubMed]
- E. Small. Evolution and classification of cannabis sativa (Marijuana, hemp) in relation to human utilization.. Bot. Rev., 2015. [DOI]
- R. Utsunomia, F. J. Ruiz-Ruano, D. M. Z. A. Silva, É. A. Serrano, I. F. Rosa, P. E. S. Scudeler. A glimpse into the satellite DNA library in characidae fish (Teleostei, characiformes).. Front. Genet., 2017. [DOI | PubMed]
- J. S. Waye, H. F. Willard. Concerted evolution of alpha satellite DNA: Evidence for species specificity and a general lack of sequence conservation among alphoid sequences of higher primates.. Chromosoma, 1989. [DOI | PubMed]
- K. H.-C. Wei, J. K. Grenier, D. A. Barbash, A. G. Clark. Correlated variation and population differentiation in satellite DNA abundance among lines of Drosophila melanogaster .. Proc. Natl. Acad. Sci. U.S.A., 2014. [DOI | PubMed]
- M.-Q. Yang, R. Van Velzen, F. T. Bakker, A. Sattarian, D.-Z. Li, T.-S. Yi. Molecular phylogenetics and character evolution of Cannabaceae.. TAXON, 2013. [DOI]
- K. Yoong Lim, A. Kovarik, R. Matyasek, M. W. Chase, S. Knapp, E. McCarthy. Comparative genomics and repetitive sequence divergence in the species of diploid Nicotiana section Alatae.. Plant J., 2006. [DOI | PubMed]
- B. C. Yu, M. C. Yang, K. H. Lee, K. H. Kim, S. U. Choi, K. R. Lee. Two new phenolic constituents ofHumulus japonicus and their cytotoxicity testIn Vitro .. Arch. Pharm. Res., 2007. [DOI | PubMed]
- P. Zanoli, M. Zavatti. Pharmacognostic and pharmacological profile of Humulus lupulus L.. J. Ethnopharmacol., 2008. [DOI | PubMed]
- G. J. Zhang, K. L. Jia, J. Wang, W. J. Gao, S. F. Li. Genome-wide analysis of transposable elements and satellite DNA in Humulus scandens, a dioecious plant with XX/XY1Y2 chromosomes.. Front. Plant Sci., 2023. [DOI | PubMed]
