Proksee provides users with a powerful, easy-to-use, and feature-rich system for assembling, annotating, analysing, and visualizing bacterial genomes. Proksee accepts Illumina sequence reads as compressed FASTQ files or pre-assembled contigs in raw, FASTA, or GenBank format. Alternatively, users can supply a GenBank accession or a previously generated Proksee map in JSON format. Proksee then performs assembly (for raw sequence data), generates a graphical map, and provides an interface for customizing the map and launching further analysis jobs. Notable features of Proksee include unique and informative assembly metrics provided via a custom reference database of assemblies; a deeply integrated high-performance genome browser for viewing and comparing analysis results at individual base resolution (developed specifically for Proksee); an ever-growing list of embedded analysis tools whose results can be seamlessly added to the map or searched and explored in other formats; and the option to export graphical maps, analysis results, and log files for data sharing and research reproducibility. All these features are provided via a carefully designed multi-server cloud-based system that can easily scale to meet user demand and that ensures the web server is robust and responsive.


PHASTEST (PHAge Search Tool with Enhanced Sequence Translation) is a web server designed to support the rapid identification, annotation and visualization of prophage sequences within bacterial genomes and plasmids. PHASTEST also supports extensive annotation and interactive visualization of all other genes (protein coding regions, tRNA sequences and rRNA sequences) in those same genomes. PHASTEST can now process a typical bacterial genome in 3.2 minutes from the raw sequence alone or in 1.3 minutes when given a pre-annotated GenBank file.


PlasMapper 3.0 is a web server that allows users to generate, edit, annotate and interactively visualize publication quality plasmid maps. It is the successor to PlasMapper 2.0 and offers many features found only in commercial plasmid mapping/editing packages. PlasMapper 3.0 allows users to paste or upload plasmid sequences as input or to upload existing plasmid maps from its large database of > 2000 pre-annotated plasmids (PlasMapDB). This database can be searched by plasmid names, sequence features, restriction sites, preferred host organism, and sequence length. PlasMapper 3.0 also supports the annotation of new or never-before-seen plasmids using its own feature database (FeatureDB) that contains common promoters, terminators, regulatory sequences, replication origins, selectable markers and other features found in most cloning vectors and plasmids.


DrugBank is a comprehensive, free-to-access, online database containing information on drugs and drug targets. It combines detailed drug (i.e. chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e. sequence, structure, and pathway) information. DrugBank is widely used by the drug industry, medicinal chemists, pharmacists, physicians, students and the general public. Because of its broad scope, comprehensive referencing, and detailed data descriptions, DrugBank is enabling major advancements across the data-driven medicine industry.


The Human Metabolome Database (HMDB) is a freely available electronic database containing detailed information about small molecule metabolites found in the human body. It is intended to be used for applications in metabolomics, clinical chemistry, biomarker discovery and general education. The database is designed to contain or link three kinds of data: 1) chemical data, 2) clinical data, and 3) molecular biology/biochemistry data.

The Sequence Manipulation Suite

A collection of simple programs for generating, formatting, and analyzing short DNA and protein sequences. The Sequence Manipulation Suite is commonly used by molecular biologists, for teaching purposes, and for program and algorithm testing.


A Java package for generating high quality, navigable maps of circular genomes. Its primary purpose is to serve as a component of sequence annotation pipelines. Feature information and rendering options are supplied to the program using an XML file or a tab-delimited file. CGView converts the input into a graphical map (PNG, JPG, or SVG format), complete with labels, a title, and legends. In addition to the default full view map, the program can generate a series of hyperlinked maps showing expanded views. The linked maps can be explored using any web browser, allowing rapid genome browsing, and facilitating data sharing. The feature labels in maps can be hyperlinked to external resources, allowing CGView maps to be integrated with existing web site content or databases.


CGView.js is a circular and linear map viewer for microbial and organellar genomes. Written in JavaScript and based on the original Java-based CGView, CGView.js creates high-quality, interactive maps that can be easily embedded in web pages. A comprehensive API provides actions to manipulate map components and hooks for integration with third-party tools.

CGView Comparison Tool (CCT)

The CGView Comparison Tool (CCT) is a package for visually comparing bacterial, plasmid, chloroplast, and mitochondrial sequences. The comparisons are conducted using BLAST, and the BLAST results are presented in the form of graphical maps that can also show sequence features, gene and protein names, COG category assignments, and sequence composition characteristics. CCT can generate maps in a variety of sizes, including 400 Megapixel maps suitable for posters. Comparisons can be conducted within a particular species or genus, or all available genomes can be used. The entire map creation process, from downloading sequences to redrawing zoomed maps, can be completed easily using scripts included with CCT. User-defined features or analysis results can be included on maps, and maps can be extensively customized.


A Perl script that uses NCBI's Entrez Programming Utilities to perform searches of NCBI databases. The script can be used to search any of the available databases using the Entrez query syntax, and records can be returned in a variety of formats.


Quickly make a PowerPoint presentation from a directory of URLs, images, PDFs, CSV files, and code snippets.


Command-line tools, commands, and code snippets for performing routine data processing and bioinformatics tasks.


Links to resources related to bioinformatics and data analysis.