Abstract: DNA sequence file in FASTA format for chromosomal pseudomolecules of barley (Hordeum vulgare) cv. Morex. This is the third release (Morex V3) of the Morex genome sequence assembly. Primary contig assembly from PacBio Hifi reads was done with HiCanu [doi:10.1101/gr.263566.120]. Contigs were scaffolded with Bionano data and arranged into chromosomal pseudomolecules with Hi-C data using the TRITEX pipeline [doi:10.1186/s13059-019-1899-5]. An AGP file specifying the placement of sequence scaffolds in the pseudomolecules is provided. The folder 'gene_annotation' holds the structural gene annotation of the Morex V3 assembly: gene models in GFF3 format, their functional descriptions as well as coding and protein sequences of high- and low-confidence genes. The folder 'repeat annotation' contains GFF files specifying the positions of transposable elements and tandem repeats. A table with the approximate centromere positions is found in the folder 'centromere_positions'. Annotation and data management were supported by the de.NBI grant (www.denbi.de) of the German Federal Ministry of Education and Research (031A536).
License: CC BY 4.0 (Creative Commons Attribution)
DOI: 10.5447/ipk/2021/3
Content: 3 Directories 16 Files (4.9 GB)
repeat_classification_PGSB_REcat-v4.tab.txt | 9.9 KB |
ReadMe__Barley_Morex_v3_transposon_annotation_by_homology.txt | 1.8 KB |
TEanno-v1.0__200416_MorexV3_pseudomolecules.gff | 520.8 MB |
CONTRIBUTOR: |
Jerry Jenkins,
Thomas Lux,
Jennifer Ens,
Heidrun Gundlach,
Zuzana Tulpova,
Uwe Scholz,
Klaus Mayer,
Manuel Spannagl,
Curtis Pozniak,
Hana Simkova,
Matthew Moscou,
Jeremy Schmutz,
Nils Stein
[Show full information]
|
CREATOR: |
Martin Mascher
[Show full information]
|
PUBLISHER: | e!DAL - Plant Genomics and Phenomics Research Data Repository (PGP), IPK Gatersleben, Seeland OT Gatersleben, Corrensstraße 3, 06466, Germany |
SIZE: | 520.8 MB |
SUBJECT: | barley, Hordeum vulgare, genome sequence assembly, long read sequencing, reference genome, gene annotation, transposable elements |