Table of Contents
Introduction
CellMinerCDB is an interactive web application that simplifies access and exploration of cancer cell line pharmacogenomic data across different sources. The current version is dedicated to the Small Cell Cancer cell lines (see Metadata section for more details). Navigation in the application is done using main menu tabs (see figure below). It includes 6 tabs: Univariate Analyses, Multivariate Analysis, Metadata, Search, Help and Video tutorial. Univariate Analyses is selected by default when entering the site. Each option includes a side bar menu (to choose input) and a user interface output to display results. Analysis options are available on the top for both the Univariant Analysis and Regression model tabs (see sub-menu on figure). The sub-menu first option result is displayed by default (Figure 1).
Figure 1: Main application interface
Univariate Analyses
Molecular and/or drug response patterns across sets of cell lines can be compared to look for possible association. The univariate analysis panel includes 4 options: Plot data, Download Data, Compare Patterns and Tissue Correlation. Almost all options have the same input data in the left side panel.
- The x-axis data choices includes 4 fields to be filled by the user:
- x-Axis Cell Line Set selects the data source. The user can choose: NCI/DTP SCLC, CCLE, GDSC, CTRP or UTSW (see Data Sources for more details).
- x-Axis Data Type selects the data type to query. The options for this vary dependent on the source selected above, and appear in the x-Axis Data Type dropdown. See the Metadata tab for descriptions and abbreviations.
- Identifier selects the identifier of interest for the above selected data type. For instance, if drug activity for the NCI/DTP SCLC is selected, the user can enter a single drug name or drug ID (NSC number) or a paired drug ID (NSC1_NSC2). The Search IDs tab explores potential identifiers interactively, or to download datasets of interest.
- x-Axis Range allows the user to control the x-axis range for better visualization.
- The y-axis data choices are as explained above for the x-axis.
- Selected tissues: by default, all tissues are selected and included in the scatter plot. To include or exclude cell lines from specific tissues, the user should specify:
- Select Tissues to include or exclude specific tissues
- Select Tissues of Origin Subset/s functionality at the bottom of the left-hand panel. On Macs, more than one tissue of origin may be selected using the “command” button. On PC's use the “control” key. All cell lines were mapped to the four-level OncoTree cancer tissue type hierarchy developed at Memorial Sloan-Kettering Cancer Center. In the CellminerCDB application, a tissue value is coded as an OncoTree node that can include elements from level 1 to level 4 separated by “:” character. For instance, the cell line DMS-79 is a “Lung” cell line but also more specifically it is a Small Cell Lung Cancer one. So DMS-79 belong to different cancer tissue types (or hierarchical nodes) “Lung” (level 1) and “Lung: Small Cell Lung Cancer (SCLC) ” (level 2). There is no further sub-categorization for DMS-79.
- Color selection
- Tissues to Color to locate cell lines related to desired tissues within the scatter plot. By default, the cell lines are colored by their OncoTree cancer tissue level 1 pre-assigned color. Selecting a tissue makes related cell lines appear in red while remaining cell lines are colored in blue. The Show Color checkbox should be active.
Plot Data
Any pair of features from different sources across common cell lines can be plotted (as a scatterplot) including the resultant Pearson correlation and p-value. The p-value estimates assume multivariate normal data, and are less reliable as the data deviate from this. Please use the scatter plot to check the data distribution (e.g., for outlying points outside of a more elliptically concentrated set).
Some options are available to play with the plot image using icons on the top from left to right:
| Downloads the plot as a png. |
| Allows the user to zoom in on an area of interest by clicking and dragging with the pointer. |
| Autoscales the image. |
| Allows the user to create horizontal and vertical line from either a cell line dot or the regression line, by hovering over them. |