Citation: M. Mascher (2015-01-23): Barley_HighConfidences_Genes_ORFs_near_complete.gff. Size: 21.4 MB
Abstract: To avoid annotation artifacts due to fragmented gene models, we generated a GFF files of barley genes (PMID: 23075845) whose protein sequences is nearly completely represented in the Morex WGS assembly. Protein sequences were aligned to the genomic contigs with exonerate (PMID: 15713233). Genes were considered near-complete if 98 % of their protein sequences could be aligned to the genomic sequence. A total of 18,039 (74 %) out of 24,243 high-confidence genes positioned on the Morex WGS assembly had near-complete ORFs.
License: CC BY 4.0 (Creative Commons Attribution)
CONTRIBUTOR: |
Matthias Pfeifer,
Manuel Spannagl
[Show full information]
|
CREATOR: |
Martin Mascher
[Show full information]
|
PUBLISHER: | e!DAL - Plant Genomics and Phenomics Research Data Repository (PGP), IPK Gatersleben, Seeland OT Gatersleben, Corrensstraße 3, 06466, Germany |
SIZE: | 21.4 MB |
SUBJECT: | barley, genome annotation, general feature format (GFF), open reading frames |