When I wrote the TCGA Toolbox and the TCGA.rppa module, one of biggest problems was to match samples and patients.
Both sample and patient files can be accessed through the Open Access HTTP Directory. For glioblastoma multiforme (gbm
), the samples can be found in the gbm/cgcc/mdanderson.org/mda_rppa_core/protein_exp/mdanderson.org_GBM.MDA_RPPA_Core.Level_3.1.0.0/
directory, and the patients can be found [in the nationwidechildrens.org_clinical_patient_gbm.txt
file in