Data for: "Recombination in pe/ppe genes contributes to genetic variation in Mycobacterium tuberculosis lineages"
Approximately 10% of the Mycobacterium tuberculosis genome is made up of two families of genes that are poorly characterized due to their high GC content and highly repetitive nature. The PE and PPE families are typified by their highly conserved N-terminal domains that incorporate proline-glutamate (PE) and proline-proline-glutamate (PPE) signature motifs. They are hypothesised to be important virulence factors involved with host-pathogen interactions, but their high genetic variability and complexity of analysis means they are typically disregarded in genome studies. To elucidate the structure of these genes, 518 genomes from a diverse international collection of clinical isolates were de novo assembled. A further 21 reference M. tuberculosis complex genomes and long read sequence data were used to validate the approach. SNP analysis revealed that variation in the majority of the 168 pe/ppe genes studied was consistent with lineage. Several recombination hotspots were identified, notably pe_pgrs3 and pe_pgrs17. Evidence of positive selection was revealed in 65 pe/ppe genes, including epitopes potentially binding to major histocompatibility complex molecules. This, the first comprehensive study of the pe and ppe genes, provides important insight into M. tuberculosis diversity and has significant implications for vaccine development.
Keywords
pe/ppe genes; Mycobacterium tuberculosis genome; Mycobacterium tuberculosis| Item Type | Dataset |
|---|---|
| Resource Type |
Resource Type Resource Description Dataset UNSPECIFIED |
| Capture method | Experiment |
| Date | February 2016 |
| Language(s) of written materials | English |
| Creator(s) | Phelan, JE; Coll, F; Bergval, I; Anthony, RM; Warren, R; Sampson, SL; Gey van Pittius, NC; Glynn, JR; Crampin, AC; Alves, A; Bessa, TB; Campino, S; Dheda, K; Grandjean, L; Hasan, R; Hasan, Z; Miranda, A; Moore, D; Panaiotov, S; Perdigao, J; Portugal, I; Sheen, P; de Oliveira Sousa, E; Streicher, EM; van Helden, PD; Viveiros, M; Hibberd, ML; Pain, A; Mcnerney, R and Clark, TG |
| LSHTM Faculty/Department |
Faculty of Epidemiology and Population Health > Dept of Infectious Disease Epidemiology (-2023) Faculty of Infectious and Tropical Diseases > Dept of Clinical Research Faculty of Infectious and Tropical Diseases > Dept of Pathogen Molecular Biology (-2019) |
| Participating Institutions | London School of Hygiene & Tropical Medicine, London, United Kingdom |
| Funders |
Project Funder Grant Number Funder URI UNSPECIFIED UNSPECIFIED UNSPECIFIED |
| Date Deposited | 28 Apr 2017 10:21 |
| Last Modified | 28 Sep 2018 21:19 |
| Publisher | London School of Hygiene & Tropical Medicine |
Explore Further
- Phelan, Jody
- Coll, Francesc
- Glynn, Judith
- Crampin, Amelia C.
- Campino, Susana
- Dheda, Keertan
- Grandjean, Louis
- Hasan, Rumina
- Moore, David
- Hibberd, Martin
- Mcnerney, Ruth
- Clark, Taane
- Biotechnology & Biological Sciences Research Council
- Bloomsbury Research Fund
- Department of Science and Technology and National Research Foundation
- King Abdullah University of Science & Technology
- Medical Research Council
- Dept of Infectious Disease Epidemiology (-2023)
- Dept of Clinical Research
- Dept of Pathogen Molecular Biology (-2019)
- PathogenSeq file store (Online Data Resource)
- PE/PPE gene diversity project (Project)
No files available. Please consult associated links.
- PathogenSeq file store (Online Data Resource)
- PE/PPE gene diversity project (Project)