Hilary Parker | Math 155 Project

HOME | DESIGN | MICROARRAY SAMPLES | NORMALIZATION | SIGNIFICANCE TESTING | SAM | H. CLUSTERING | PAM CLUSTERING | PAM CLASSIFICATION | CONCLUSIONS

This web page was produced as an assignment for a course on Statistical Analysis of Microarray Data at Pomona College.

Experimental design:
Tissues samples were taken from frozen surgical specimens at Stanford University Medical Center and Vancouver General Hospital. Tumor type was clinically determined and confirmed.

miRNA microarrays were printed at the Standfor functional genomics facility. The arrays had a total of 668 probes spotted in duplicate. The probes include 328 known human miRNAs, 113 mouse miRNAs, 45 rat miRNAs, 154 predicted human miRNAs and 28 control probes.

Dye assignments for the microarrays were not readily available. However, the fluorescence ratio used for analysis was described as "sample/reference", which indicates that a dye-swap was not used. Selection criteria (described here under "Data Analysis") limited them to 87 miRNAs (out of 668 probes) for analysis.

Once candidate miRNAs were identified, they were analyzed using unsupervised hierarchical clustering analysis and SAM. By doing so, they identified miRNAs that were most associated with certain tumor types.

In order to further confirm and extend the miRNA library being used, the researchers supplemented the microarray data by cloning and sequencing small RNA libraries from 10 of the sarcomas and 2 normal skeletal muscle samples. The biggest advantage to doing this was finding novel miRNAs that were not included in the aforementioned chip. This allows for a more complete miRNA profile to be constructed for each of the tumor types.

Possible Improvements:
The biggest concern in their experimental design is the use of references, which was not well described. Because they had two types of references, normal smooth muscle tissue and skeletal muscle tissue, from 5 and 2 human sources (respectively), there is no standard procedure for dealing with the references.

Furthermore, although it wasn't described, it appears as though a dye-swap was not performed. This could be a concern because of dye-bias.

Finally, though they ran "duplicates", they were on the same chip. From our reference chip picture here, it appears that the references aren't directly next to each other, which is a plus. However, ideally when things are run in duplicate, they are run on separate physical chips rather than on the same chip in different places.

My suggestion for an improved experimental design would be a balanced design as follows: figure out which sarcomas need which types of references (normal vs. skeletal) and count each of these up. Suppose that 12 tumor samples need skeletal references, and 15 need normal references. We would need, then, 12 independent skeletal samples and 15 independent normal samples. Using a chance device, choose 6 skeletal tumors and pair with 6 skeletal references. Dye the tumor RNA red and the reference green. Do the same with 7 normal tumors. Repeat the procedure, this time dying the tumor RNA green, and using 8 normal tumors and references instead of 7.

This would be my best guess at what a balanced design looks like for this experiment.