Part 4: Illumina MiSeq & Result Interpretation
The paired reads were assembled into a whole genome sequencing using Unicycler V0.4.8.0 within Galaxy V19.09.rc1 while filtering out contigs smaller than 200bp. Quast V5.0.2 was used to generate statistics on the final assemblies. The assemblies were submitted as whole-genome shotgun sequences to Genbank for annotation using the NCBI Prokaryotic Genome Assembly Pipeline to identify Open Reading Frames (ORFs), rRNAs, and tRNAs.
The 16S regions of the sequences were used to identify the bacterial genus using the Ribosomal Database Project (RDP) V11.5. The whole genome sequence was also used to identify the bacterial genus using Type (Strain) Genome Server (TYGS) V182. The Orthologous Average Nucleotide Identity Tool (OAT) V0.93.1 was then used to determine the relatedness of the selected strains one such result is shown to the right.
A search for phage sequences was performed using Phaster was run to determine if the presence of phage genomes could account for differences in genomic sequence of those related by 97.39%. The result of which indicated that a 37Kb phage sequence in RIT702 could potentially account for the difference between the two genomes.
Part5: AntiSMASH Analysis
The full alignments were uploaded to the antiSMASH V5.1 online program to identify potential secondary metabolite production (Blin, 2019). The program was set to run KnownClusterBlast, ClusterBlast, SubClusterBlast and ActiveSiteFinder.