DCARv2 BGI Mobile Element Annotation JBrowse GFF3

Resource Type: 
File
File Type: 
GFF3
Download: 
Download File NameAvailable atSizeMD5
mobileelements.gff3.gzCarrotOmics9.43MB093e4d63344b6dfba259335f64454e30
Description: 

Mobile Element Annotation performed by BGI. This is the GFF3 file used for JBrowse. See the linked analysis below for further details.

References: 
The following records refer to this file:
AnalysisLoading content
OrganismLoading content
Analysis: 
NameDescription

Mobile elements in the genome assembly were identified at both the DNA and protein level. RepeatMasker v3.2.9 (http://www.repeatmasker.org/) was applied to screen the genome assembly for low complexity DNA sequences and interspersed repeated elements using a custom library (a combination of Repbase v16.02 and plant repeat database). RepeatProteinMask (an extension of RepeatMasker) was used to perform RMBlast against the ME protein database to find known repeat sequences at the protein level.

Ab initio prediction program RepeatModeler version 1.0.4 (http://www.repeatmasker.org/RepeatModeler/) was employed to build the de novo repeat library from the assembled genome, refined by removing the contaminated sequences possibly derived from bacterial and redundant duplicated sequences in the library. Using this library as a database, RepeatMasker was implemented to identify and classify homologous repeat elements in the genome. In addition, LTR_FINDER version 1.1.0.5 was used to search the whole genome for the characteristic structure of the full-length long terminal repeat (LTR) retrotransposons. Subsequently, a custom program was used to merge all the predictions and generate a combined repetitive sequence annotation to mask the carrot genome.

ME accounted for 44.9% (190 Mb) of the assembled carrot genome. This value is larger than those observed in other sequenced genomes of similar size, for example, grape (41.4%, for 487 Mb) and melon (20%, for 375 Mb). With 57.4 Mb, the fraction of class II transposable elements in the carrot genome is higher than in most other plant genomes including rice (48 Mb). A large fraction of MEs are of relatively recent origin, with a sequence divergence rate of less than 10%.

Data from this analysis can be viewed in JBrowse here.

Loading content
Organism: 
NameCommon NameComment
Carrot
For a general overview of carrot, see the Carrot Facts Page
Loading content
License: 
NameAttribution 4.0 International (CC BY 4.0)
License Summary

You are free to:

  • Share: copy and redistribute the material in any medium or format

  • Adapt: remix, transform and build upon the material for any purpose, even commercially.

The licensor cannot revoke these freedoms as long as you follow the following license terms:

  • Attribution You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.

  • No additional restrictions You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits

Notices:

  • You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation.

  • No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.

Full Legal Texthttps://creativecommons.org/licenses/by/4.0/legalcode