Methods
The team sampled rapeseed from a collection of approximately 8,000 oilseed accessions that were housed at the National Mid-term Gene Bank of Oil Crops Research Institute at the Chinese Academy of Agriculture Sciences. The researchers identified 488 core collections of rapeseeds which were chosen to represent the global range of geographic distribution and phenotypic diversity. The accessions were grown in the laboratory for several generations, using self-pollination techniques. The 410 rapeseed accessions used were divided into three ecotypes: spring, semi-winter, and winter type accessions.
Following this, the team collected young leaves from each rapeseed accession, and genomic DNA was extracted. Following this, sequencing was carried out by Novogene using the Illumina HiSeq 4000 platform, generating 150bp paired-end reads. In addition, 135 of the 418 rapeseed accessions were selected for resequencing with high coverage on the Illumina HiSeq X Ten platform. Sequencing yielded 5.6 Tb of data. To avoid artificial bias, the data was cleaned by removing adaptors and low-quality reads before the paired-end reads were mapped to the Brassica napus reference genome ‘Darmor-bzh’ using BWA software. After alignment, the researchers performed SNP and InDel calling on a population scale using GATK and only the high-quality SNPs were retained for subsequent analysis.
The research team carried out a variety of subsequent analyses to identify which possible genes and associated loci are responsible for specific agronomic traits. This included population genetic and LD analysis, introgression analysis, identification of selective sweep signals, GWAS, phylogenetic analysis, Hi-C experiments, expression analysis, gene cloning, and plant transformation.