, 2008), TIGRFAM (Haft et al., 2003; Selengut et al., 2007) and COG (Tatusov et al., 1997, 2003) databases were also supplied. To visualize the annotation draft genome assembly of P. asymbiotica Kingscliff, we used the
‘gbrowse’ Generic Genome Browser. Despite the extensive redundancy in sequence coverage and the end-sequencing of large insert fosmid libraries for contig orientation and gap closure, none of the different sequencing technologies were sufficient to close the genome either alone or in combination. We will therefore discuss the different assembly programs, data and workflows and their resulting assembly statistics Selumetinib ic50 (Table S1). The VCAKE assembly, used in Workflow A, had the longest N50 length and the longest individual contig; however, it also had the largest number of contigs and the lowest mean and median contig lengths, which can be explained by an abundance of short unassembled reads. We found 18 contigs in the VCAKE assembly that were longer than 100 kb, and 88.6% of the genome (assuming a genome size of 5 Mb)
was contained within these contigs. Workflows B and C generated assemblies with similar statistics; however, Workflow C, which combined both Solexa and 454 data, performed better than Workflow B, which used only Illumina reads. It is clear that using sequences from both the 454 and the Illumina platforms in a hybrid assembly produces longer contigs. This confirms previous findings of Reinhardt and CP-690550 in vivo colleagues, who concluded that a hybrid assembly method was more successful and attributed this to the fact that combining the two sequencing technologies can compensate for the inherent
weaknesses SB-3CT of each individual technology. Paired read information from fosmid sequencing was used to verify the assemblies. Although Workflow A generated the longest contigs, it was also found to contain the most misassembled sequence. Workflow B generated the least misassemblies; however, this was the most fragmented assembly. There were 69 regions in the Kingscliff draft genome that were absent from the ATCC43949 strain, representing 10.6% of the new sequence and 91 regions of the ATCC43949 genome that were absent from the Kingscliff strain, representing 15.8% of the genome. The predicted proteome of the P. asymbiotica Kingscliff draft was compared with the complete genome sequences of related strains and species using a protein vs. a translated nucleotide sequence blast search. Figure 1 presents a visualization of the tblastn comparison using the BLASTatlas genome comparison tool (Hallin et al., 2008). Photorhabdus have a number of virulence mechanisms, such as the ability to adhere, invade and cause damage to host cells. The genomes of Photorhabdus species contain genes that express adhesins, toxins and invasins, enabling the bacteria to infect host cells. Both strains of P. asymbiotica contain many genes that are considered to be important virulence factors.