1. HOME»
  2. News»
  3. The latest news

First report of a near-complete cucumber reference genome and multi-omics database

2024-06-22
【字体:

Recently, Cucurbitaceae Vegetable Genetics and Breeding Group in the Institute of Vegetables and Flowers (IVF, CAAS) made significant progress in cucumber genomics. The researchers accomplished the first-time near-complete assembly and gene annotation of the cucumber reference genome, along with the establishment of the inaugural comprehensive cucumber multi-omics database. The paper was published in Molecular Plant (IF=17.1) with the title of “A near-complete cucumber reference genome assembly and Cucumber-DB, a multi-omics database”.

图.jpg

Cucumber (Cucumis sativus L.) is an important economic vegetable crop in the Cucurbitaceae family. Nearly 30% of the cucumber genome consists of complex repetitive sequences such as 45s rDNA and microsatellites, a proportion much higher than that of crops like rice, maize, and watermelon (< 5%). Due to limitations in sequencing technology and assembly methods at the time, the widely used reference genome of cucumber inbred line '9930' (CLv3.0 version) still has a large amount of unknown sequence (~130 Mb) and 72 gaps. Additionally, these repetitive sequences significantly impact the accuracy of gene annotation, highlighting the urgent need to improve the quality of the cucumber reference genome. Therefore, this study used the strategy of ONT+HiFi+HiC for the first time to obtain a near-complete assembly of the cucumber reference genome (CLv4.0) with only one remaining gap; based on large-scale PacBio full-length and Ilumina transcriptome data, a near-complete cucumber reference transcriptome dataset (CsRTD1) was constructed, with a BUSCO value of 99.19%. Integrating pan-genomic, population variation genomic, transcriptomic, and core germplasm information, the first cucumber multi-omics database, Cucumber-DB (http://www.cucumberdb.com/), was established, which can provide a comprehensive sharing platform for cucumber functional genomics and molecular breeding research.

This study was led by Prof. Zhang Shengping, as the co-corresponding author. Associate Prof. Guan Jiantao, Miao Han, Prof. Zhang Zhonghua, and Associate Prof. Dong Shaoyun contributed equally to this work as co-first authors. This work was funded by the National Key Research and Development Program, the Earmarked Fund for Modern Agro-industry Technology Research System, the Agricultural Science and Technology Innovation Program of the Chinese Academy of Agricultural Sciences, and the Central Public-interest Scientific Institution Basal Research Fund.

Link to this paper: https://www.cell.com/molecular-plant/fulltext/S1674-2052(24)00192-8


图 1.jpg

Figure 1 Comparison of Clv4.0 and Clv3.0 genome assemblies


图 2.jpg

Figure 2 Construction and evaluation of the reference transcript dataset (CsRTD1)


图 3.jpg

Figure 3 Overview of the cucumber multi-omics database (Cucumber-DB)