Workflow for Data Analysis - Scott Long
Planning, organizing and documenting :: Using Stata :: Stata Automation :: Names and labels :: Data cleaning
Analysis and presentation :: Backing up and archiving files :: Hardware and software for efficient workflow

Workflow home What's new? Additions by chapter Downloading Stata files My hardware & software Reader's comments Reader's stories Quotes Getting help Disclaimer Home

Supplementary materials and comments by chapter

Link to Chapters: One, Two, Three, Four, Five, Six, Seven, Eight, Nine

Chapter 1: Introduction

No files.

Chapter 2: Planning, organizing and documenting

(A) Spreadsheet for planning directory structure.

(B) Batch files to create directories.

(C) Spreadsheet for data registry.

(D) Research log template. Word 97-2004: doc file; Word 2007: docx file; Word 2007 template (I can't help you install Word templates) dotx file.

(E) M.C. Escher's Relativity (1953) as an illustration of how workflow in data analysis sometimes feels.

Chapter 3: Writing and debugging do-files

I highly recommend using a text editor that has syntax highlighting. Here is a syntax file for use with UltraEdit. Please check the UltraEdit documentation for information on how to install the syntax file. While I no longer use TextPad, here is the latest syntax file that I used.

Chapter 4: Automating your work: moving toward programming

The program fastcd.ado is a great program for changing working diretory. It is a more powerful, interactive version of the wd command shown in the chapter. To install, type: findit fastcd and follow the instructions.

Chapter 5: Names

No files.

Chapter 6: Cleaning your data

No files.

Chapter 7: Analyzing data and presenting results

(A) A research plan for the CWH project.

(B) The color version of the graph in Chapter 7 that prints with two similar shades of gray.

(C) Using hidden fonts in Word 2007: To make text hidden, right click on selected text, chose font, and and check the box for the Hidden effect. To control whether hidden text is displayed, go to the Word Options, chose Display. Under always show these formatting marks on the screen, chose Hidden text. Under Printing Options, select Print hidden text. Note that you can assign these operations to a keystroke using Word Options, Customize, Keyboard shortcuts.

Chapter 8: Preventing data loss

2009-01-14 Microsoft has replaced the beta program File Sync with Windows Live Sync (https://sync.live.com/welcome.aspx), part of the the Windows Live initiative. With the improvements in Live Sync, my basic backup strategy is to sync my "critical" folders on my home and office machines. My portable drive is synced once a week. I still take periodic "snapshots" to an external drive.

Chapter 9: Conclusions

No files.

Suggestions?

If you have suggestions to be added to the Workflow site, please let me know.

Tools for data analysts

If you want to know what hardware and software I use, check here. If you have suggestions for better tools, let me know.

 

© 2014 J. Scott Long    
The Workflow of Data Analysis - Principles and practice of effective data management and analysis.