Citation: T. Schmutzer (2015-09-15): ScientificData_Functional_Annotation_SNPs_and_INDELs.

Abstract: This data resource contains the functional annotation of SNPs and INDELs for 52 Brassica napus lines. The analyzed sequence data was produced in the frame of the PreBreed-Yield project and was published (Snowdon et al. 2015, DOI: 10.1016/j.tplants.2015.04.013). The complete whole-genome shotgun resequencing data is archived at the European Nucleotide Archive (http://www.ebi.ac.uk/ena) under the project numbers PRJEB5974 and PRJEB6069. The discovery of INDELs is based on a gapped alignment that was constructed using Bowtie2. Subsequently the discovery of INDELs was performed using SAMtools/BCFtools using a minimal base quality of ‘-Q 30’ and a minimal read alignment quality of ‘-q 20’. BCFtools (version 1.2) was applied to screen for raw INDELs. A posterior filtering was performed subsequently using minimal (8) and maximal read depth (50), as well as a stringent IMF (0.9) and IDV (8) setting to identify high quality and homozygous sites. In total we detected 633,844 insertions and 469,860 deletions in the range between -20 and 20 bp sequence length. The discovery of SNPs was performed utilizing an un-gapped alignment that was constructed for each genotype individually using SOAP v2. SNP calling has been performed with multiple prediction methods using the tools FaSD, Freebayes and SAMtools. The approach assigned an additional confidence value to the predicted variant position (VP) by using the variant caller count (VCC) measurement. This measurement indicates how many variant calling methods predict a particular VP. All displayed VPs passed the following criteria: bi-allelic, SNP quality score >= 100, homozygous, read depth >= 4 and a VCC >= 2. This resource comprises in total ~16.5 million VPs that correspond to ~4.3 million unique positions in the Brassica napus Darmor-bzh reference genome (v4.2). All SNPs and INDELs subsequently were processed by the tool CooVar to construct a functional annotation using 101k predicted gene models of Brassica napus (Chalhoub et al. 2014, DOI: 10.1126/science.1253435).

License: CC BY 4.0 (Creative Commons Attribution)

Files:
Loading, please wait!
//schmutzr@IPK-GATERSLEBEN.DE/Functional annotation of SNPs and INDELs from 52 highly diverse accessions of the model allopolyploid plant Brassica napus./ScientificData_Functional_Annotation_SNPs_and_INDELs
PBY004_INDELs_categorized-gvs.gvf 5.3 MB
PBY004_SNPs_categorized-gvs.gvf 55.5 MB
PBY007_INDELs_categorized-gvs.gvf 2.1 MB
PBY010_INDELs_categorized-gvs.gvf 437.1 KB
PBY007_SNPs_categorized-gvs.gvf 34.3 MB
PBY011_INDELs_categorized-gvs.gvf 484.2 KB
PBY010_SNPs_categorized-gvs.gvf 21.1 MB
PBY011_SNPs_categorized-gvs.gvf 18.9 MB
PBY012_INDELs_categorized-gvs.gvf 3.4 MB
PBY012_SNPs_categorized-gvs.gvf 42 MB
PBY013_INDELs_categorized-gvs.gvf 2.1 MB
PBY013_SNPs_categorized-gvs.gvf 31.6 MB
PBY014_INDELs_categorized-gvs.gvf 2.1 MB
PBY014_SNPs_categorized-gvs.gvf 40.8 MB
PBY015_INDELs_categorized-gvs.gvf 3.1 MB
PBY017_INDELs_categorized-gvs.gvf 1006.3 KB
PBY015_SNPs_categorized-gvs.gvf 43.2 MB
PBY018_INDELs_categorized-gvs.gvf 693.7 KB
PBY017_SNPs_categorized-gvs.gvf 22.3 MB
PBY021_INDELs_categorized-gvs.gvf 611.1 KB
PBY018_SNPs_categorized-gvs.gvf 11.2 MB
PBY021_SNPs_categorized-gvs.gvf 21.2 MB
PBY022_INDELs_categorized-gvs.gvf 2.8 MB
PBY022_SNPs_categorized-gvs.gvf 36.7 MB
PBY023_INDELs_categorized-gvs.gvf 1.3 MB
PBY023_SNPs_categorized-gvs.gvf 29.4 MB
PBY024_INDELs_categorized-gvs.gvf 1.7 MB
PBY025_INDELs_categorized-gvs.gvf 385.9 KB
PBY024_SNPs_categorized-gvs.gvf 29.1 MB
PBY025_SNPs_categorized-gvs.gvf 16.5 MB
PBY026_INDELs_categorized-gvs.gvf 1.6 MB
PBY026_SNPs_categorized-gvs.gvf 31.5 MB
PBY027_INDELs_categorized-gvs.gvf 1.4 MB
PBY027_SNPs_categorized-gvs.gvf 26.5 MB
PBY029_INDELs_categorized-gvs.gvf 1.8 MB
PBY029_SNPs_categorized-gvs.gvf 28.4 MB
PBY031_SNPs_categorized-gvs.gvf 42.5 MB
PBY031_INDELs_categorized-gvs.gvf 2.2 MB
PBY032_INDELs_categorized-gvs.gvf 5.7 MB
PBY032_SNPs_categorized-gvs.gvf 90.9 MB
PBY033_INDELs_categorized-gvs.gvf 6.4 MB
PBY033_SNPs_categorized-gvs.gvf 84.8 MB
PBY034_INDELs_categorized-gvs.gvf 5 MB
PBY034_SNPs_categorized-gvs.gvf 80.1 MB
PBY035_INDELs_categorized-gvs.gvf 1.4 MB
PBY035_SNPs_categorized-gvs.gvf 39.8 MB
PBY036_INDELs_categorized-gvs.gvf 3.8 MB
PBY036_SNPs_categorized-gvs.gvf 67.9 MB
PBY037_INDELs_categorized-gvs.gvf 5.4 MB
PBY037_SNPs_categorized-gvs.gvf 89.6 MB
PBY038_INDELs_categorized-gvs.gvf 3.7 MB
PBY038_SNPs_categorized-gvs.gvf 77.3 MB
PBY039_INDELs_categorized-gvs.gvf 7.3 MB
PBY039_SNPs_categorized-gvs.gvf 115.6 MB
PBY040_INDELs_categorized-gvs.gvf 3.4 MB
PBY040_SNPs_categorized-gvs.gvf 72.5 MB
PBY041_INDELs_categorized-gvs.gvf 1.9 MB
PBY041_SNPs_categorized-gvs.gvf 35.9 MB
PBY043_INDELs_categorized-gvs.gvf 1.6 MB
PBY043_SNPs_categorized-gvs.gvf 27.6 MB
PBY044_INDELs_categorized-gvs.gvf 3 MB
PBY044_SNPs_categorized-gvs.gvf 45.6 MB
PBY045_INDELs_categorized-gvs.gvf 1.9 MB
PBY046_INDELs_categorized-gvs.gvf 754.9 KB
PBY045_SNPs_categorized-gvs.gvf 40.6 MB
PBY046_SNPs_categorized-gvs.gvf 23.3 MB
PBY047_INDELs_categorized-gvs.gvf 7.3 MB
PBY047_SNPs_categorized-gvs.gvf 101.2 MB
PBY048_INDELs_categorized-gvs.gvf 5.3 MB
PBY048_SNPs_categorized-gvs.gvf 62.8 MB
PBY049_INDELs_categorized-gvs.gvf 3.5 MB
PBY049_SNPs_categorized-gvs.gvf 54.8 MB
PBY050_INDELs_categorized-gvs.gvf 6.7 MB
PBY050_SNPs_categorized-gvs.gvf 114.3 MB
PBY051_INDELs_categorized-gvs.gvf 3.8 MB
PBY051_SNPs_categorized-gvs.gvf 82.4 MB
PBY052_INDELs_categorized-gvs.gvf 5.5 MB
PBY052_SNPs_categorized-gvs.gvf 91.6 MB
PBY053_INDELs_categorized-gvs.gvf 1.9 MB
PBY053_SNPs_categorized-gvs.gvf 37.9 MB
PBY054_INDELs_categorized-gvs.gvf 2 MB
PBY054_SNPs_categorized-gvs.gvf 33 MB
PBY055_INDELs_categorized-gvs.gvf 3.8 MB
PBY055_SNPs_categorized-gvs.gvf 56.8 MB
PBY056_INDELs_categorized-gvs.gvf 2.6 MB
PBY056_SNPs_categorized-gvs.gvf 36.7 MB
PBY057_INDELs_categorized-gvs.gvf 2.8 MB
PBY057_SNPs_categorized-gvs.gvf 35.7 MB
PBY058_INDELs_categorized-gvs.gvf 4.5 MB
PBY058_SNPs_categorized-gvs.gvf 68.4 MB
PBY059_INDELs_categorized-gvs.gvf 1.7 MB
PBY059_SNPs_categorized-gvs.gvf 29.2 MB
PBY060_INDELs_categorized-gvs.gvf 3.5 MB
PBY060_SNPs_categorized-gvs.gvf 33.1 MB
PBY061_INDELs_categorized-gvs.gvf 3.9 MB
PBY061_SNPs_categorized-gvs.gvf 43.4 MB
PBY062_INDELs_categorized-gvs.gvf 2.5 MB
PBY062_SNPs_categorized-gvs.gvf 34.8 MB
PBY001_INDELs_categorized-gvs.gvf 2 MB
PBY001_SNPs_categorized-gvs.gvf 33.1 MB
PBY002_SNPs_categorized-gvs.gvf 33.3 MB
PBY002_INDELs_categorized-gvs.gvf 2.2 MB
PBY003_SNPs_categorized-gvs.gvf 33.8 MB
PBY003_INDELs_categorized-gvs.gvf 1.6 MB
Download as ZIP (NOTE: ZIP Extraction using the native Windows Zip Client can fail due to file path length, please use third-party ZIP client instead)
Metadata
CONTRIBUTOR:
Uwe Scholz, Christian Colmsee, Chris Ulpinnis, Doreen Stengel, Birgit Samans, Rod Snowdon, Gunhild Leckband, Amine Abbadi, Frank Breuer, Peter Duchscherer, Stefan Abel, Zeljko Micic, Denis Lespinasse, Emmanuelle Dyrszka [Show full information]
CREATOR:
Thomas Schmutzer [Show full information]
PUBLISHER: e!DAL - Plant Genomics and Phenomics Research Data Repository (PGP), IPK Gatersleben, Seeland OT Gatersleben, Corrensstraße 3, 06466, Germany
SIZE: 0 B
SUBJECT: Brassica napus, rapeseed, functional annotation, diversity, 4.3 million single nucleotide polymorphisms (SNPs), PreBreed Yield
COVERAGE: none
DATE: Event: event
CREATED: TimePoint: Tue Sep 15 13:43:17 CEST 2015
UPDATED: TimePoint: Tue Sep 15 13:43:17 CEST 2015
LANGUAGE: en
RELATION: none
SOURCE: none
Revision: 0 - CreationDate: Tue Sep 15 13:43:17 CEST 2015 - RevisionDate: Tue Sep 15 13:43:17 CEST 2015