Dr James Bray

Research Interests

I am a bioinformatics scientist working in the bacterial genomics field.

Bacterial species identification using ribosomal multilocus sequence typing (rMLST)

I curate the ribosomal multilocus sequence typing (rMLST) scheme and associated databases. The goal of rMLST is the accurate taxonomic classification of all bacterial isolates using the DNA sequences found within the ribosomal protein-encoding genes (pubmlst.org/species-id).

My research involves applying rMLST in three ways:

  1. Performing phylogenetic analysis for taxonomic investigations
  2. Defining ribosomal sequence types for the rapid species identification of bacterial isolates
  3. Developing and maintaining a 'one-click' species identification server

Large-scale bacterial genomics projects

I am responsible for assembling high-throughput DNA sequencing data for the large-scale genomic projects with the Maiden Group (with a focus on Campylobacter, Neisseria and Streptococcus). As a result, I manage the exchange of large amounts of data between sequencing centres, different labs and the ENA Sequence Read Archive (SRA).

Assembled genomes are accessible on the PubMLST website (pubmlst.org) for bacterial population studies, pathogen surveillance and gene-by-gene analysis.

My research interests include:

  • Pathogen strain identification
  • Bacterial genome annotation and protein structure/function prediction
  • Knowledge-based structural genomics target selection
Publications
The list was updated

Utilising large and diverse bacterial genome datasets to improve the detection and identification of Streptococcus pneumoniae via PCR-based diagnostics

Origin and Evolution of Bacterial Periplasmic Force Transducers.

Development of a core genome multilocus sequence typing (cgMLST) scheme and life identification number (LIN) code classification system forStaphylococcus aureus

Castanet: a pipeline for rapid analysis of targeted multi-pathogen genomic data

Development of the Pneumococcal Genome Library, a core genome multilocus sequence typing scheme, and a taxonomic life identification number barcoding system to investigate and define pneumococcal population structure

Targeted metagenomics reveals association between severity and pathogen co-detection in infants with respiratory syncytial virus

Castanet: a pipeline for rapid analysis of targeted multi-pathogen genomic data

Development of the Pneumococcal Genome Library, a core genome multilocus sequence typing scheme, and a taxonomic life identification number barcoding system to investigate and define pneumococcal population structure

More
Contact Details
E: james.bray@biology.ox.ac.uk   
T: 01865 281067  
  Maiden Lab profile