Computing
for Research I
Spring
2013
Description: Students learn to use the primary
statistical software packages for data manipulation and analysis, including
(but not limited to): R, R Bioconductor, SAS,
SAS macro, and Stata. Additionally, students will
learn: how to use the division's high speed cluster-computing
environment, how to practice the principles of reproducible research using Sweave in R, and how to use LaTeX
and BibTeX for manuscript and presentation
development. This is a three credit course.
Course Organization: This
course is given by the faculty members in the division. Instructors will take turns giving lectures
in their areas of expertise.
Textbooks: No
textbook. Reading material (primarily
found on the web) will be provided as necessary.
Prerequisites: Biometry 700
Grading:
Instructors will give short exercises to be completed and turned into
the primary instructor by the Thursday of the week following when it was
assigned (e.g., assignments given on Tuesday Feb 5 and Thursday Feb 7 are both
due on Thursday Feb 14). Each assignment
will count equally towards 75% of the course grade. There will be a final project which will
account for the remaining 20% of the course grade. The remaining 5% of the course grade will
reflect class participation.
Homeworks Policy: Homeworks are
due by 5pm on the due date. All homeworks should be emailed to the primary instructor (garrettm@musc.edu) or turned in at lecture
time. Asking for extensions on homeworks is
strongly discouraged. However, it is expected that, on occasion,
extenuating circumstances may arise. Therefore, the policy is that each student may request an extension on
homework twice and the extension is to be no more than 2 days.
You must notify the primary instructor that you are requesting an extension
before the time the assignment is due.
After using two extensions, no more extensions will be granted except
with a medical note.
Office Hours: The
primary instructor will have office hours by appointment. However, given the nature of the course, the
primary instructor may not be knowledgeable regarding all of the topics
covered. As a result, additional help
may be needed to complete assignments from the lecturers. Be considerate and responsible in scheduling
time with course instructors and recognize that they all have busy schedules.
Course
Objectives: Upon successful completion of
the course, the student will be able to
1.
Import data and perform simple
analyses and produce graphical displays in Stata, SAS
and R
2.
Create new functions or commands in
each of R, Stata and SAS
3.
Generate professional quality
scientific manuscripts and presentations using Latex along with statistical
software
4.
Perform standard power and sample size
calculations using available software and simulations.
5.
Operate the division’s cluster
computer with batch computing
Primary Instructor: |
|
Elizabeth
Garrett-Mayer |
Website: |
|
http://people.musc.edu/~elg26/teaching/statcomputing.2013/statcomputingI.2013.htm |
Contact Info: |
|
Hollings
Cancer Center, Rm 118G garrettm@musc.edu (preferred mode of contact is email) 792-7764 |
Time: |
|
Tuesdays and Thursdays, 2:00-3:30 |
Location: |
|
Cannon 301 |
Office
Hours: |
|
By appointment. Contact via email. |
TA
Office Hours: |
|
TBA |
Lectures:
Date |
Lecturer |
Topic |
Lecture notes, links, etc. |
Homework assignment |
Tu Jan 8 |
EGM |
Introduction;
Overview and Principles |
|
|
Th Jan 10 |
Katherine Nicholas |
SAS:
introduction |
||
Tu Jan 15 |
Katherine Nicholas |
SAS: ODS |
||
Th Jan 17 |
Ramesh |
SAS: IML |
||
Tu Jan 22 |
Valerie
Durkalski |
SAS: proc tabulate and proc report |
||
Th Jan 24 |
Nate Baker |
SAS: Gplot |
||
Tu Jan 29 |
Renee
Martin |
SAS:
macros |
||
Th Jan 31 |
Jordan Elm |
SAS: array
processing |
||
Tu Feb 5 |
Sybil Prince-Nelson |
Designing
your own website |
||
Th Feb 7 |
EGM |
STATA:
introduction, “immediate” commands |
http://www.ats.ucla.edu/stat/stata/sk/default.htm http://www.cpc.unc.edu/research/tools/data_analysis/statatutorial/index.html |
|
Tu Feb 12 |
EGM |
STATA: graphical displays |
|
|
Th Feb 14 |
EGM |
STATA: exploratory
data analysis; |
||
Tu Feb 19 |
EGM |
STATA
regression commands |
||
Th Feb 21 |
EGM |
STATA:
programming and do files |
||
Tu Feb 26 |
EGM |
Data
management: principles & Excel |
|
|
Th Feb 28 |
Amy Wahlquist |
Data
management: RedCap |
||
Tu Mar 5 |
EGM |
R:
introduction to object-oriented programming |
||
Th Mar 7 |
Chiuzan,
Cody |
R: downloading
packages/libraries; data input & output |
|
|
Tu Mar 19 |
Delia
Voronca |
R:
graphics |
||
Th Mar 21 |
Georgiana
Onicescu |
R: basic
language structure (ifelse, where, looping) |
||
Tu Mar 26 |
EGM |
R:
exploratory data analysis; writing commands |
|
|
Th Mar 28 |
EGM |
R: : regression commands |
|
|
Tu Apr 2 |
Yanqui
Weng |
R:
simulations; random number generation; sampling from distributions |
||
Th Apr 4 |
Beth Wolf |
R: bioconductor |
|
|
Tu Apr 9 |
EGM |
Sample
size calculation software packages |
||
Th Apr 11 |
Adrian
Nida |
Cluster
computing, etc. |
Code
Examples: |
|
Tu Apr 16 |
Cody
Chiuzan |
Latex and Bibtex: manuscript
production |
||
Th Apr 18 |
Emily
Kistner-Griffin |
Latex and Bibtex:
presentations |
||
Tu Apr 23 |
Betsy Hill |
Reproducible
Research: Sweave |
Sweave.intro.student.notes.2012.pdf Sweave.sty
(style file needed to run Sweave. Save this in the folder that contains the .Rnw file you will be running) carter.cls
(the class file used in the presentation Sweave_intro.tex) PPRCarter.sty
(the style file used in the presentation Sweave_intro.tex. Links: · Sweave
homepage – Leisch is the originator of the package http://www.stat.uni-muenchen.de/~leisch/Sweave/ · The Cancer Letter link http://www.bcm.edu/cancercenter/index.cfm?pmid=12886 · Annals of Applied Statistics paper –
scroll all the way to the bottom of the page to see link to the Baggerly paper http://www.imstat.org/aoas/supplements/issue_3_4.html · SASweave
paper in Journal of Statistical Software http://www.jstatsoft.org/v19/i08/ · STATweave
user’s manual by Russ Lenth http://www.stat.uiowa.edu/~rlenth/StatWeave/StatWeave-manual.pdf |
|
Th Apr 25 |
Caitlyn
Ellerbe |
Mendeley |
|
|
|
|
|
|
|
FINAL
PROJECT |
DUE MAY 3 |
|
|
Computing:
Downloads
and Websites:
Stata
website: http://www.stata.com/
Tutorials :
R tutorial: R-intro.pdf