Citation: A. Bräutigam (2016-12-01): Big Data Analysis Training Course hosted at the Leibniz Institute of Plant Genetics and Crop Plant Research (IPK). DOI:10.5447/ipk/2016/59

Abstract: The Training Course (TC) covers an introduction to (i) linux, bash scripts, and R, (ii) read mapping for transcriptomics, (iii) genome assembly and annotation, and to (iv) biological data extraction. The TC is targeted towards biologist with little to no programming experience and thus requires no prior knowledge with regard to programming or linux. To proceed with the course, store all data in a folder and note its location. Within the course manual, file location is hard coded – please replace the file location in the documents with the one where you stored the data on your system. A linux operating system with at least 8Gb of RAM and at least 2 CPUs is recommended for execution of the programs in a timely manner. You will need root privileges (i.e. have administrator rights) on the system. Within the course documents, programs and methods are not attributed according to scientific standards as the course manual was meant for hands on execution and training, but not as a reference manual. Please cite original authors for all programs and tools if you use them in your work. The main document "BigDataTrainingCourse2016_manual.pdf" will guide you through the course material and the structure of the data.

License: CC BY 4.0 (Creative Commons Attribution)

DOI: 10.5447/ipk/2016/59

Content: 30 Directories 217 Files (44 GB)

Files:
Loading, please wait!
//lange@IPK-GATERSLEBEN.DE/Big Data Analysis Training Course hosted at the Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)/GCBN_TrainingsKurs_BigData/dataset_RESULTS [2 Directories 83 Files]
Kmasker_plots_HvMrx5x 185.8 KB
GEMOMA_HVVMRXALLeA0221I08 118 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb_GENES_augustus.aa 19.1 KB
ecoli_DH10B.fasta.nsq 1.1 MB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.bubbleInScaff 0 B
CONTAMINATION_ecoli_HVVMRXALLeA0221I08_soap.cids 55 B
HVVMRXALLeA0221I08_R0_noadapter_QT.fastq 2.6 MB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo_5kb.fasta 102.6 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.kmerFreq 681 B
trimmerr 0 B
BLASTN_DB_nt_QUERY_HVVMRXALLeA0221I08_abyss_5k_ID90_E10.outfmt.blastn 1013 B
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.scaf 67.1 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.gapSeq 0 B
CONTAMINATION_ecoli_HVVMRXALLeA0221I08_abyss.cids 0 B
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.contig 4.3 MB
HVVMRXALLeA0221I08_R2_noadapter_QT.fastq.stats 839 B
assembly.fasta.cids 135 B
KMASKER_HvMrx5x_RT5_N5_Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb.fasta 148.9 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.preGraphBasic 173 B
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb_noEcoli.fasta 151.3 KB
Brachypodium_distachyon.v1.0.31.chr.gff3 120 MB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.newContigIndex 352.6 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb_noEcoli.fasta.cids 3 B
ecoli_DH10B.fasta 4.5 MB
small_Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.scafSeq 4 MB
HVVMRXALLeA0221I08_cutadapt.stats 4 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo_5kb_noEcoli.fasta.cids 35 B
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_1kb.fasta 153.9 KB
INPUT_kmasker_R1R2.fastq 291.7 MB
243 KB
HVVMRXALLeA0221I08_R2.fastq.stats 842 B
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.readOnContig.gz 6.8 MB
HVVMRXALLeA0221I08_R2.fastq 152.4 MB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.scaf_gap 27.7 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.peGrads 41 B
Commands_GenomeAssembly.sh 38.7 KB
HVVMRXALLeA0221I08_R0_noadapter_QT.fastq.stats 819 B
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb.fasta.cids 3 B
HVVMRXALLeA0221I08_R1_noadapter.fastq 150.8 MB
build_config.kmasker 263 B
HVVMRXALLeA0221I08_R1.fastq.stats 842 B
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss.fasta 153.5 KB
Bdistachyon_314_v3.1.protein_RICEannotation.fa 24 MB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.scafStatistics 1.8 KB
ecoli_DH10B.fasta.nin 96 B
HVVMRXALLeA0221I08_R1_noadapter_QT.fastq.stats 839 B
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo_5kb.fasta.cids 90 B
BLASTN_DB_nt_QUERY_HVVMRXALLeA0221I08_soap_5k_ID90_E10.outfmt.blastn 5.8 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.ContigIndex 154.6 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.edge.gz 3.8 MB
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb_NLB.fasta 148.9 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.contigPosInscaff 18.3 KB
HVVMRXALLeA0221I08_R1.fastq 151.3 MB
ecoli_DH10B.fasta.nhr 161 B
Bdistachyon_314_v3.1.protein_RICEannotation.fa.psq 21.3 MB
Brachypodium_distachyon.v1.0.31.chr.downsample.gff3 2.9 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.preArc 2.5 MB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.updated.edge 2.5 MB
HVVMRXALLeA0221I08_R2_noadapter_QT.fastq 146.7 MB
Bdistachyon_314_v3.1.protein_RICEannotation.fa.phr 5.5 MB
Brachypodium_distachyon.v1.0.31.dna.genome.fa 263.7 MB
LIST_of_sequences.cids 3 B
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.vertex 1.8 MB
KMASKER_HvMrx5x_RT5_N5_MIN100_Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb.fasta 49.4 KB
KMASKER_HvMrx5x_N5_Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb.occ 376.6 KB
Bdistachyon_314_v3.1.protein_RICEannotation.fa.pin 414 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb_NLB.fasta.fai 27 B
BLASTP_DB_Bdist_annotation_QUERY_HVVMRXALLeA0221I08_abyss_5k_augustus_E10_T1.outfmt.blastp 532 B
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb.fasta.fai 19 B
HVVMRXALLeA0221I08_soap_denovo_WGSassembly.config 289 B
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb.fasta 151.3 KB
INPUT_kmasker_R1R2.fastq.stats 842 B
KMASKER_HvMrx5x_Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb.occ 421.2 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb_GENES_augustus.gff 71 KB
HVVMRXALLeA0221I08_R2_noadapter.fastq 152 MB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo_5kb.fasta.stats 498 B
coverage.hist 3.1 KB
HVVMRXALLeA0221I08_R1_noadapter_QT.fastq 145.1 MB
small_Hordeum_vulgare_HVVMRXALLeA0221I08_abyss.fasta 4.7 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb_GENES_augustus.exons 66.7 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.readInGap.gz 2.5 MB
Hordeum_vulgare_HVVMRXALLeA0221I08_abyss_5kb_GENES_augustus.cds 56.4 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.Arc 18 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo_5kb_noEcoli.fasta 41.7 KB
Hordeum_vulgare_HVVMRXALLeA0221I08_soap_denovo.scafSeq 4.1 MB
Download as ZIP (NOTE: ZIP Extraction using the native Windows Zip Client can fail due to file path length, please use third-party ZIP client instead)
Metadata
CONTRIBUTOR:
Alisandra Denton, Thomas Schmutzer [Show full information]
CREATOR:
Andrea Bräutigam [Show full information]
PUBLISHER: e!DAL - Plant Genomics and Phenomics Research Data Repository (PGP), IPK Gatersleben, Seeland OT Gatersleben, Corrensstraße 3, 06466, Germany
SIZE: 1.6 GB
SUBJECT: de.NBI, GCBN, training, NGS, big data
COVERAGE: none
DATE: Event: event
CREATED: TimePoint: Thu Dec 01 17:27:21 CET 2016
UPDATED: TimePoint: Thu Dec 01 17:27:21 CET 2016
LANGUAGE: en
RELATION: none
SOURCE: none
Revision: 0 - CreationDate: Thu Dec 01 17:27:21 CET 2016 - RevisionDate: Thu Dec 01 17:27:21 CET 2016