6 Finding unharmonized TOPMed study phenotypes on dbGaP
6.1 Ways to get TOPMed phenotype data
- Get DCC harmonized phenotypes from the Exchange Area
 - Get (harmonized or unharmonized) phenotypes directly from the studies (transfer via the Exchange Area)
 - Get unharmonized phenotypes from dbGaP
 
6.2 dbGaP accession lingo
- study accession: A unique identifier, phs, that specifies a study on dbGaP
 - parent accession: The phs that holds the subject, sample, and phenotype data for a study
 - child accession: The phs that holds the genotype or other omics data for a project within a parent study
 - TOPMed accession: The phs that will hold TOPMed sequence data
- Currently a separate parent phs
 - Will eventually be made into a child phs connected to the original parent phs
 
 - dataset accession: A unique identifier, pht, that specifies a dataset within a study
 - variable accession: A unique identifier, phv, that specifies a variable
 
6.3 dbGaP advanced search tools
6.3.1 Entrez advanced search
Search strings for Entrez
# All variables within four studies
(phs000007[Belongs To] OR phs000286[Belongs To] OR phs000284[Belongs To] OR phs000462[Belongs To])
# All variables with "bmi" in the variable name within four studies
(phs000007[Belongs To] OR phs000286[Belongs To] OR phs000284[Belongs To] OR phs000462[Belongs To]) AND bmi[Variable Name]
# All variables with "bmi" in the variable description within four studies
(phs000007[Belongs To] OR phs000286[Belongs To] OR phs000284[Belongs To] OR phs000462[Belongs To]) AND bmi[Variable Description]
# All variables with "bmi" in the variable name or description within four studies
(phs000007[Belongs To] OR phs000286[Belongs To] OR phs000284[Belongs To] OR phs000462[Belongs To]) AND (bmi[Variable Description] OR bmi[Variable Name])
6.3.2 Faceted advanced search
Saved URLs for Faceted search examples