CellMinerCDB

Data Set Metadata

Provides background information about each data set. Check back for additional data sets as we make them available.    
Drug   DNA   RNA   Protein
Name Platform Information Principal Collaborators Data Description
        Drug
Compound activity: DTP NCI-60
[PMID:22802077]
Manufacturer: Developmental Therapeutics Program, NCI/NIH
Platform: Sulforhodamine assay
Molecular Target:
Platform Description: Negative log10 (GI50) values of sulforhodamine B assay for ~ 50K compounds, including more than 20,000 that passed quality control, 158 Food and Drug Administration approved and 79 clinical trial drugs. Higher values equate to higher sensitivity of cell lines.
[data repository] [Platform description]
LMP, CCR, NCI (K Kohn); DPT, NCI (J Morris); Genomics and Bioinf Gp, LMP, CCR, NCI Raw Data: Compound activity data. All available replicates for each NSC are shown without the quality control steps performed for normalized data.
Processing method: The quality control steps included in the NCI-60 Analysis Tools outputs for data reproducibility and minimum variability across cell lines are not included.
        DNA
DNA: Affy 500K SNP
[PMID:24670534]
Manufacturer: Affymetrix, Inc
Platform: GeneChip Human Mapping 500k Array Set
Molecular Target: DNA
Platform Description: This platform is used for whole-genome association studies. It is comprised of two arrays which enable genotyping of more than 500,000 single nucleotide polymorphisms (SNPs).
[data repository] [Platform description]
Genomics and Bioinf Gp, LMP, CCR, NCI (W Reinhold); Gene Logic (E Kaljian) Raw Data: Cel files
Processing method: For SNP data, the CRLMM algorithm was used to do allelic cross talk calibration, base position normalization and RMA. For Array CGH, normalization was done using the CRMAv2 normalization. Both analyses were done using the aroma.affymetrix package in R.
DNA: Fingerprinting
[PMID:19372543]
Manufacturer: Qiagen; Applied Biosystems
Platform: AmpFLSTR Identifiler PCR Amplification kit
Molecular Target: DNA
Platform Description: DNA was prepared from cells using the Qiagen Blood & Cell Culture DNA Maxi kit according to the manufacturer's protocol (Qiagen, Valencia, CA). DNA fingerprints were obtained for all cell lines using the AmpFLSTR Identifiler PCR Amplification kit (Applied Biosystems, Foster City, CA) according to the manufacturer's protocol
[data repository] [Platform description]
Genomics and Bioinf Gp, LMP, CCR, NCI (P Lorenzi, S Varma, W Reinhold) Raw Data: Number of Short Tandem Repeats (STR) at 13 combined DNA index system (CODIS) loci, the amelogenin gender-determining marker used in forensics, and two
Processing method: NA
DNA: E-cadherin Methylation
[PMID:17272646]
Manufacturer: NA
Platform: PCR amplification and sequencing of sodium bisulfite modified DNA.
Molecular Target: DNA methylation status of E-cadherin promoter.
Platform Description: Genomic DNA (5 micrograms) from each cell line was treated with sodium bisulfite at 50 degrees C for 17 hr. The DNA was then re-suspended in 125 microliters of 10-mM Tris with 1 mM EDTA pH 7.4. Nested PCR amplification and sequencing of the DNA were carried out using either converted or unconverted DNA as template for the PCR. Primers were based on the E-cad promoter DNA sequence (GenBank accession no. L34545). One-strand automated sequencing of the PCR products was performed. For primers and more detailed information see citation.
[data repository] [Platform description]
Gene Logic (D Dolginow), SAIC (D Munroe), Johns Hopkins Univ. (A Feinberg); Genomics and Bioinf Gp, LMP, CCR, NCI (W Reinhold) Raw Data: sequence tracings
Processing method: Multiple sequence tracings were grouped by bisulfite conversion and sequencing date, calculated as a group mean for each CpG within these groups, and then combined as a mean of these groups.
DNA: Exome Seq
[PMID:23856246]
Manufacturer: Illumina
Platform: Genome Analyzer
Molecular Target: DNA
Platform Description: High throughput sequencing of NCI-60 for SNPs/variants
[data repository] [Platform description]
Genetics Branch, NCI, NIH (O Abaan, S Davis, P Meltzer); Genomics and Bioinf Gp, LMP, CCR, NCI (W Reinhold, Y Pommier); Division of Cancer Treatment and Diagnosis, NCI, NIH (S Holbeck, R Simon, J Doroshow) Raw Data: FASTQ Sequence File
Processing method: Segmental duplications and those variants that map to more than one genomic location have been removed.
DNA: Illumina 1M SNP
[PMID:24670534]
Manufacturer: Illumina
Platform: Human 1M-Duo Beadchip
Molecular Target: DNA
Platform Description: BeadChip array based on Illumina's Infinium Assay with probes for 1072820 SNPs
[data repository] [Platform description]
Core Genotyping Facility, NIH (A Hutchinson); Genomics and Bioinf Gp, LMP, CCR, NCI Raw Data: Red and Green IDAT files for each cell line
Processing method:
DNA: Roche NimbleGen 385k aCGH
[PMID:20215517]
Manufacturer: Roche NimbleGen Systems, Inc
Platform: HG19 CGH 385K WG Tiling v2.0
Molecular Target: DNA
Platform Description: See link below.
[data repository] [Platform description]
Genomics and Bioinf Gp, LMP, CCR, NCI Raw Data: PAIR files
Processing method: Subtracted estimate of probe bias for each probe. A detailed explanation of the probe bias calculation is included with the normalized download.
DNA: Sanger sequencing
[PMID:17088437]
Manufacturer: ABI 3730 Sequencer
Platform: Sequencing
Molecular Target: DNA
Platform Description: Sequencing of the NCI-60 for mutations in known human cancer genes. The coding exons and immediate flanking intron sequences of selected genes from the Cancer Gene Census were PCR-amplified and sequenced.
[data repository] [Platform description]
Wellcome Trust Sanger Institute (M Stratton); Genomics and Bioinf Gp, LMP, CCR, NCI Raw Data: NA
Base-calling method: Mutation surveyor, Chromas, in-house Sanger software
DNA: aCGH Agilent 44K
[PMID:24670534]
Manufacturer: Agilent Technologies, Inc.
Platform: Human Genome CGH Microarray 44A and 44B
Molecular Target: DNA
Platform Description: A genome-wide DNA copy number variation profiling array containing 43,000 plus human sequences from both coding and non-coding regions.
[data repository] [Platform description]
Genomics and Bioinf Gp, LMP, CCR, NCI (W Reinhold) Raw Data: ratio of sample vs control of DNA copy number, normal female DNA 46,XX genomic DNA was obtained from Promega (Madison, WI)
Processing method: Agilent Feature Extraction done with software version 8.1 with default settings for CGH arrays.
DNA: Illumina 450K methylation
Manufacturer: Illumina
Platform: Infinium HumanMethylation450 BeadChip Kit
Molecular Target: DNA
Platform Description: Approximately 450,000 probes querying the methylation status of CpG sites within and outside of genes.
[data repository] [Platform description]
Manel Esteller (Cancer Epigenetics and Biology Program (PEBC), Bellvitge Biomedical Research Institute (IDIBELL)), Paul Meltzer (NCI) Raw Data:
Processing method: Beta values normalized to a value between 0 (unmethylated) and 1 (methylated) using the R methylumi package
DNA: Combined aCGH
[PMID:24670534]
Manufacturer: See individual platform information
Platform: Probe intensities combined from four platforms
Molecular Target: DNA
Platform Description: Probe intensities combined from four platforms: Agilent Human Genome CGH Microarray 44A, Nimblegen HG19 CGH 385K WG Tiling v2.0, Affymetrix GeneChip Human Mapping 500k Array Set and Illumina Human1Mv1_C Beadchip
[data repository] [Platform description]
See individual platform information Raw Data: See individual platform information
Base-calling method: NA
        RNA
RNA: 5 Platform Gene Transcript
[PMID:21159603]
Manufacturer: See individual platform information
Platform: Expression values combined from five platforms
Molecular Target: RNA
Platform Description: Expressions from five platforms (Affymetrix: HG-U95; HG-U133; HG-U133 Plus 2.0; HG Exon 1.0 ST. Agilent: Whole Human Genome Oligo array) were combined.
[data repository] [Platform description]
See individual platform information Raw Data: See individual platform information
Processing method: Expression values for each gene were converted to z-scores. Probes from the 5 platforms that correspond to the gene were selected based on a QC step. The average z-score using the selected probes was reported as the combined gene expression for the gene.
RNA: RNA-seq
[PMID:31113817]
Manufacturer: Illumina
Platform: HiSeq 2000
Molecular Target: RNA
Platform Description: High throughput sequencing of NCI-60 transcripts.
Genetics Branch, NCI, NIH (O Abaan, S Davis, P Meltzer); Genomics and Bioinf Gp, LMP, CCR, NCI (W Reinhold, Y Pommier); Division of Cancer Treatment and Diagnosis, NCI, NIH (S Holbeck, R Simon, J Doroshow) Raw Data: sequence tracings
Processing method: RNA was quantified and treated with DNAse according to the manufacturers protocol (Qiagen, Inc). RNA was used for generating sequencing libraries using the TotalScript™ RNA-Seq Kit (Epicentre). The libraries were sequenced at the Center for Cancer Research Sequencing facility using the HiSeq 2000 (Illumina) using the TruSeq Cluster Kit v3 (Illumina). Data was converted to fastq and aligned back to the human genome assembly 19 with the STAR split-read aligner. RNAseQC was used to analyze the data files to determine the data quality.
RNA: OSU Transporter Array
[PMID:15205344]
Manufacturer: Ohio State University
Platform: Spotted Array
Molecular Target: RNA
Platform Description: Spotted 70-mer microarray
[data repository] [Platform description]
OSU (W Sadee); Genomics and Bioinf Gp, LMP, CCR, NCI Raw Data:
Processing method: Normalization based on robust, locally linear fits (Loess), implemented in the SMA R package
RNA: microRNA OSU V3 chip
[PMID:17483436]
Manufacturer: Ohio State University Comprehensive Cancer Center
Platform: OSU-CCC_hsa-miRNA-chip_v3 For array design, see: V3 design protocol A-MEXP-620 published on ArrayExpress (www.ebi.ac.uk/arrayexpress/)
Molecular Target: RNA
Platform Description: See: Liu CG, Calin GA, Meloon B, Gamliel N, Sevignani C, Ferracin M, Dumitru CD, Shimizu M, Zupo S, Dono M, Alder H, Bullrich F, Negrini M, and Croce CM. An oligonucleotide microchip for genome-wide microRNA profiling in human and mouse tissues. Proc Natl Acad Sci U S A 2004;101:9740-4.
[data repository] [Platform description]
CCC, OSU (P Blower); Genomics and Bioinf Gp, LMP, CCR, NCI Raw Data: Two channel sample and background intensity measurements.
Processing method: Human probes were selected and expression was measured as the log of the median signal intensity minus background (thresholded to a minimum of 1) and duplicate spots were averaged. Each sample was quantile normalized and the replicates averaged for each cell line.
RNA: Agilent Human microRNA (V2)
[PMID:20442302]
Manufacturer: Agilent Technologies
Platform: Human miRNA Microarray (V2)
Molecular Target: RNA
Platform Description: 15,000 probes for 723 human and 76 human viral miRNA's. Each slide contains 8 arrays.
[data repository] [Platform description]
Agilent Technologies (S Fulmer-Smentek, P D'Andrade); Genomics and Bioinf Gp, LMP, CCR, NCI Raw Data: Data in tab delimited text files. Array quality information as PDF files
Processing method: GreenSpringGX
MicroRNA location based on HG18
RNA: Agilent mRNA
[PMID:20442302]
Manufacturer: Agilent Technologies
Platform: Whole Human Genome Microarray, 4 x 44K
Molecular Target: RNA
Platform Description: 44,000 Probes for approximately 41,000 genes, with 4 arrays spotted on each slide.
[data repository] [Platform description]
Agilent Technologies (S Fulmer-Smentek, P D'Andrade); Genomics and Bioinf Gp, LMP, CCR, NCI (W Reinhold) Raw Data: Text files with data and PDF files with quality information
Processing method: Processed using GeneSpring
RNA: Affy HuEx 1.0
[PMID:20215517]
Manufacturer: Affymetrix, Inc
Platform: GeneChip Human Exon 1.0 ST
Molecular Target: RNA
Platform Description: 1432155 probesets for all human gene exons
[data repository] [Platform description]
Bioinf Gp, LMP, CCR, NCI (W Reinhold); Affymetrix (T Gingeras); GeneLogic (E Kaldjian); MD Anderson (J Weinstein) Raw Data: cel files
Processing method: GCRMA
RNA: Affy HG-U133 Plus 2.0
[PMID:20215517]
Manufacturer: Affymetrix, Inc
Platform: Human Genome U133 Plus 2.0
Molecular Target: RNA
Platform Description: Aproximately 47,000 transcripts. For more detail see link below.
[data repository] [Platform description]
Genomics and Bioinf Gp, LMP, CCR, NCI (W Reinhold) Raw Data: cel files
Processing method: GCRMA, RMA
59 cell lines - ME:MDA_N is excluded
RNA: Affy HG-U133(A-B)
[PMID:17339364]
Manufacturer: Affymetrix, Inc
Platform: Human Genome U133
Molecular Target: RNA
Platform Description: 44,000 probeset 2-chip set
[data repository] [Platform description]
Gene Logic (U Scherf, E Kaljian); Genomics and Bioinf Gp, LMP, CCR, NCI (W Reinhold) Raw Data: cel files
Processing method: GCRMA, MAS5, RMA
59 cell lines - LC:NCI_H23 is excluded
RNA: Affy HG-U95(A-E)
[PMID:17339364]
Manufacturer: Affymetrix, Inc
Platform: Human Genome U95
Molecular Target: RNA
Platform Description: 65,000 probeset 5-chip set
[data repository] [Platform description]
Gene Logic (U Scherf, D Dolginow); Genomics and Bioinf Gp, LMP, CCR, NCI (W Reinhold) Raw Data: cel files (NCI60 only)
Processing method: GCRMA, MAS5, RMA
RNA: ABC Transporters (RT-PCR)
[PMID:15324696]
Manufacturer: NA
Platform: RT_PCR
Molecular Target: RNA
Platform Description: 47 specific oligonucleotide probes were designed for each of the ABC transporters using DNAStar Primer Select. Expression levels were measured by real-time quantitative RT-PCR using the LightCycler RNA Amplification SYBR Green kit and a LightCycler machine.
[data repository] [Platform description]
LCB, CCR, NCI (M Gottesman, G Szakacs, J Ludwig); Genomics and Bioinf Gp, LMP, CCR, NCI Raw Data: Crossing point values(log2)
Processing method: Mean-centered, log2
        Protein
Protein: SWATH (Mass spectrometry)
[PMID:25730263]
Manufacturer: SCIEX
Platform: SCIEX TripleTOF 5600 System mass spectrometer
Molecular Target: Protein
Platform Description: Cell pellets were processed using pressure cycling technology (PCT) using the barocycler NEP2320. Each digested samples was SWATHed using SCIEX TripleTOF 5600 System mass spectrometer in duplicates.
[data repository]
Institute of Molecular Systems Biology, ETH Zurich (Tiannan Guo, Ruedi Aebersold) Raw Data:
Processing method: log10 transformation of peak area for each peptide precursor. Only SwissProt proteins are included in this data set.
Protein: Lysate Array
[PMID:14623978]
Manufacturer: NCI LMP Genomics and Bioinformatics Group (Nishizuka)
Platform: Reverse-phase lysate arrays (RPLA) using antibodies described in AbMiner at http://discover.nci.nih.gov/abminer/.
Molecular Target: Protein
Platform Description: Reverse-phase lysate arrays (RPLA) for 162 antibodies for 94 genes. Each array included 64 lysates (60 cancer cells and 4 replicate control pools) in 10 serial two-fold dilutions.
[data repository] [Platform description]
LP, CCR, NCI (L Liotta); CBER, FDA (E Petricoin); Genomics and Bioinf Gp, LMP, CCR, NCI Raw Data: Proteins detected using the catalyzed signal amplification system as described previously (Nishizuka et al, PNAS, 2003).
Processing method: Dose interpolation analysis done using the DI25 algorithm. described previously (Nishizuka et al, PNAS, 2003).

Go to top