| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
ORIGINAL ARTICLES |
From the Department of Veteran Affairs Health Care System, Palo Alto, California (A.H.S.H., K.H.); and Stanford University, Stanford, CA (A.H.S.H., C.E.T., K.H., J.F.).
Address correspondence and reprint requests to Alex H. S. Harris, PhD, Center for Health Care Evaluation (MC:152), VA Palo Alto Health Care System, 795 Willow Road, Menlo Park, CA 94025. E-mail: alexsox{at}sbcglobal.net
| ABSTRACT |
|---|
|
|
|---|
Methods: We conducted a randomized, controlled trial of outpatient asthmatics recruited from hospitals and the community. Of the 137 adult patients with asthma who were randomized, 117 began and 114 completed the study. Patients were randomly assigned to write for 20 minutes, once per week, for 3 weeks about stressful experiences (n = 41), positive experiences (n = 37), or neutral experiences (n = 36; control group). At baseline, postintervention, and 2-month follow up, patients were assessed by spirometry.
Results: The mean change from baseline to 2-month follow up in percentage of predicted forced expiratory volume in 1 second (FEV1) was 4.2% in the stress-writing group, 1.3% in the positive-writing group, and 3.0% in the control group. In forced vital capacity (FVC), there was 3.1% improvement in the stress-writing group, 3.6% in the positive-writing group, and 2.4% in the control group. These changes were not statistically or clinically significant.
Conclusions: The present study reduces confidence in the ability of written emotional expression to benefit the disease status of asthma patients.
Key Words: asthma experiences (events) stress written communication coping behavior treatment
Abbreviations: FEV1 = forced expiratory volume in 1 second; FVC = forced vital capacity; VAPAHCS = Veteran Affairs Palo Alto Health Care System; PTSD = posttraumatic stress disorder; COPD = chronic obstructive pulmonary disease.
| INTRODUCTION |
|---|
|
|
|---|
Encouraged by these positive results, researchers have begun to examine the effects of expressive writing on medical outcomes in clinical samples, including patients with cancer (1720), human immunodeficiency virus (21), chronic pelvic pain (22), cystic fibrosis (23), rheumatoid arthritis (24,25), irritable bowel syndrome (26), and fibromyalgia syndrome (27). The results have been mixed, with some effects being found for specific outcomes, in particular patient groups, and no effects found in other patient samples. In some cases, effects of writing have been found for healthcare utilization measures and self-reported symptoms but not objective markers of disease status (17,23). The fact that very few studies exists per patient group makes it difficult to determine if the medical characteristics of the samples or small procedural differences are responsible for the varying impact of the writing interventions. Furthermore, as is arguably appropriate in the early stages of investigation, some of these studies may capitalize on chance by examining the effects of writing on many outcomes without adjusting experiment-wide type I error rate. These issues highlight the importance of conducting multiple trails for specific patient groups, using a priori hypotheses regarding the same outcomes, and identifying the optimal procedural details of the writing interventions.
In the only published randomized trial of an expressive writing intervention with asthma patients, Smyth et al. (1) reported that, compared with writing about neutral topics, writing about stressful experiences for 20 minutes on 3 consecutive days improved the pulmonary function of patients with asthma as measured by forced expiratory volume in 1 second (FEV1). This benefit was observed 2 weeks after the writing sessions occurred and was maintained at 2 and 4 months. The fact that such a simple, brief, inexpensive, noninvasive intervention produced clinically significant and enduring changes in pulmonary function within the context of a randomized clinical trial is impressive, surprising, and somewhat controversial (28). Efforts to identify the processes mediating the effects in this study have been unsuccessful (29), and no replication studies have been published that either support or fail to support this impressive finding. If expressive writing is to be recommended to people with specific medical conditions, such prescriptions should be based on methodologically sound and replicated studies. We present the results of a clinical trial examining the effects of expressive writing on FEV1 and forced vital capacity (FVC) in adults with asthma. Because we made some procedural adjustments to the intervention with the intention of strengthening its effects and making it logistically less burdensome to participants, this trial cannot be considered a strict replication of the original study.
| METHODS |
|---|
|
|
|---|
People making inquiries were told we were "interested in determining the effects of different kinds of personal writing on pulmonary function in asthmatics," and informed as to the time commitment and procedures involved in the study. Exclusion criteria were 1) under 18 years of age; 2) not diagnosed with asthma by a physician; 3) diagnosed with chronic obstructive pulmonary disease (COPD) by a physician; 4) posttraumatic stress disorder (PTSD); and 5) unable to write for 20 minutes and comply with other expectations of study participation such as getting to once-weekly writing and assessment meetings. Participants received $50 for completing the protocol.
Procedures
Human subject approvals were obtained from both the Stanford University and VAPAHCS Institutional Review Boards. Patients provided written informed consent for randomization, assessment, and intervention procedures. Consenting patients completed questionnaires at baseline assessing demographic information and medication use, as well as measures of disease severity and control (30,31), and psychologic questionnaires (3237) to be used in future examinations of individual responses to treatment.
Intervention
Most theoretical explanations for the effects of stressful writing have assumed that the direct cognitive and/or emotional processing of the stressful experiences being written about is a necessary component to achieve the observed effects. Casting doubt on this assumption are the findings that writing about imaginary traumas (6) or about the perceived benefits of negative events (38) produce the same type of outcomes in healthy samples. These findings raise theoretical questions bearing on why expressive personal writing appears to be beneficial in some cases and increases uncertainty concerning the mechanisms through which these effects might occur. To check that the benefit imparted by writing is the result of the cognitive processing of stressful experience rather than merely deeply personal material, a positive writing condition was included in the design.
Patients were asked to write for 20 minutes, once per week, for 3 weeks on one of three randomly assigned topics: 1) stressful or traumatic experiences, 2) "positive" experiences such as events that stimulated feelings of happiness or joy, or 3) neutral topics focused on the events of the previous day (control group). Writing occurred alone in a private room. The first and third writing sessions occurred in our lab and the second session occurred at the patient's home. After each writing session, participants completed postwriting ratings of mood, somatic reactions, adherence to the task, and the perceived usefulness of their writing.
Patients were instructed not to be concerned about grammar, punctuation, or handwriting. They could write about the same experience on all three occasions or about different experiences as long as they stayed on their assigned theme. Patients were told that their essays were confidential in that we would not read the essays until their participation in the study was complete and the essays would never be linked to them personally.
Sample Size Determination
The primary comparison of interest was between the stress writing and control groups at 2-month follow up. Therefore, sample size calculations were aimed at detecting this difference. The previous study (1) found an effect size (Glass's standardized mean difference) g of 0.64 comparing patients with asthma with control subjects at 2-month follow up. Power analysis indicates that a sample size of 37 patients per group is adequate to detect differences of this size with power of 0.81, using
with two-tailed tests.
Outcome Measures
Pulmonary function, assessed by spirometry in the laboratory and adhering to the guidelines of the American Thoracic Society (39), was measured at baseline, immediately postintervention, and 2 months after writing. Primary outcomes were FEV1 and FVC.
Statistical Analysis
Main analyses compared intervention groups on changes in FEV1 and FVC from baseline to 2-month follow up, using analysis of covariance, statistically controlling for baseline levels of the dependent variable. Chi-squared analysis was used to compare groups on the frequency of clinically significant change, defined as improvement in FEV1 and FEV by 12% or greater. Other analytic approaches yielded substantively identical results.
Random Assignment and Concealment
After screening and consent procedures, patients were randomized to a writing group using computer-generated, equal-probability allocation. Assignments were kept in sealed envelopes until immediately before the first scheduled writing session, at which point study staff prepared materials specific to group assignment. Assignment was concealed from patients until after all baseline assessments were completed. Participants were not informed about the specific nature of the other writing groups and were not aware whether they were in the control or experimental conditions. There was no indication that patients or staff attempted to compromise masking procedures. Staff performing the pulmonary function assessments were not blind to the experimental condition. Statistical analyses were conducted by the first author who was not blind to group assignment.
| RESULTS |
|---|
|
|
|---|
|
Baseline Equivalence
Table 1 presents the baseline demographic characteristics of the sample of 114 completers overall and by group. Twenty-six patients (22.8%) identified themselves as veterans. Experimental and control groups did not differ (p < .05) at baseline on the vast majority of demographic characteristics (sex, ethnicity, marital status), health behaviors (asthma medication use, regular exercise), or psychologic variables (optimism, alexithymia, coping strategies, meaning in life, perceived stress). Baseline group differences did exist on the characteristics of age, education, and smoking status. Baseline disease severity did not differ between experimental and control groups on FEV1, FVC, FEV/FVC, or measures of asthma control (see Table 2).
|
|
Check of Experimental Manipulation
To ensure participants followed the instructions of their randomly assigned group and that the writing conditions were viewed as equally plausible treatments, two methods were used. First, participants completed postwriting questionnaires after each writing session to rate on a 1 to 7 scale (1 = not at all, 7 = a great deal) their postwriting mood, adherence to writing instructions, and belief that their writing task might be useful. Ratings on each item were averaged across the three administrations, and analysis of variance was used to check if differences existed between the groups on these measures. Results confirmed that patients in all groups wrote on the topic assigned, that the writing had the intended impact, and that attempts to keep participants blinded to condition were successful. For example, patients in the stress-writing group reported being significantly more upset (F = 63.17, p < .001), angry (F = 29.79, p < .001), and sad (F = 30.60, p < .001) after writing than patients in the other two groups. In rating the overall helpfulness of the writing and overall adherence to the assigned writing topic, no significant group differences were found. The average perceived benefit of the writing task across the three experimental groups was 4.3 (1.18). Second, two independent raters, blind to the assignment of the patients, classified each essay as being on 1) stressful/upsetting experiences, 2) meaning/purpose/positive experiences, 3) daily events, or 4) benefits of difficult experiences. Concordance between the classification of raters and the actual group assignment was calculated with the kappa statistic that equaled 0.84. The most common misclassifications were essays from the positive writing group that were judged to be on stressful experiences or the benefits of difficult experiences. Independent raters also judged the extent to which the essays were personal. On all three occasions, the essays from the two experimental groups were rated as significantly more personal than the essays from the neutral writing group.
Outcomes
First, we hypothesized that the stress-writing group and positive-writing groups would each show more improvement from baseline to 2-month follow up than the neutral-writing control group in FEV1 and FVC. These hypotheses were not supported. Table 2 displays the group means during the three assessment points for percent of FEV1-predicted and FVC-predicted, as well as changes from baseline to follow up. Because baseline group differences in age, education, and smoking status existed, we fit a model using these variables as covariates. Still, no group differences on main outcomes were revealed.
Second, we hypothesized that more patients who experienced clinically significant improvements from baseline to 2 months would be in the treatment groups compared with the control group. Of the several formulas for calculating percent change in pulmonary function parameters (39), we used: (FEV1time1 FEV1time2)/FEV1expected. A 12-percent rise from baseline to follow up was used as a criterion for clinically significant improvement (39). Using this method, nine of 114 patients improved FEV1 more than 12% by follow up with no significant differences between the groups. Eleven of 114 people improved their FVC 12% or more at follow up. The stress-writing group had six of these patients, the positive-writing group had five, and the control group had none. A chi-squared analysis found this to be a significant group difference (
2 = 5.98, p = .05, effect size W = 0.23). This may be noteworthy because, although FVC is more effort-dependent than FEV1, it is generally more stable and difficult to change. Three patients the stress-writing group and one patient in the positive-writing group experienced a 12% or greater decrease in FEV1 and FVC from baseline to follow up. This is not a significant group difference.
| DISCUSSION |
|---|
|
|
|---|
As noted, the present study was not a exact replication of the previous study. Several procedural differences between the studies may have produced the difference in results. Some of these differences were intentional attempts to strengthen the intervention but may have inadvertently weaken it. For example, in the present study, patients wrote once a week for 3 consecutive weeks instead of on 3 consecutive days. A metaanalysis (40) on the effects of written emotional expression failed to find a definitive relationship between number of sessions, length of each session, or spacing of sessions with physiological outcomes. However, weekly as opposed to daily writing was associated with greater overall effects in healthy samples (average of several effects such as short-term mood changes and healthcare utilization per study). Weekly spacing of the writing intervention has been used in at least one other patient sample with some success (20).
Weekly writing was chosen in this study because it appeared from this literature that greater spacing would be, at worst, inconsequential and, at best, beneficial as well as more convenient for participants. Although massed and spaced writing sessions have never been experimentally compared, this issue has been explored in the treatment of anxiety disorders as well as in the areas of memory and learning. In the treatment of fears, phobias, and agoraphobia, massed and spaced intervention schedules appear to have the same impact on symptoms immediately following treatment, but spacing is associated with lower risk of relapse (41,42). In the area of memory, massed practice is associated with equal or greater short-term performance but poorer retention and delayed performance and recall (43). To the extent that increases in insight or positive reinterpretations are the active ingredient in expressive writing interventions, these cognitions are more likely to be recalled. From these findings, we might expect massed expressive writing to have a greater effect (both positive and negative) immediately after writing, but for spaced expressive writing to have more lasting impact. Most empirically supported trauma-exposure protocols for the treatment of posttraumatic stress disorder (PTSD) take place in weekly sessions (44), but direct comparisons to massed sessions are rare or nonexistent. Greenberg et al. (6) did show significant effects using a single 30-minute writing session. Even so, it is possible that this procedural change was partially responsible for our failure to find the effect found in the previous study.
Another difference between the present study compared with Smyth et al. was the use of both lab and home-based writing. In the present study, participants wrote in the lab on the first and third session, and at home for their second writing session. Examination of the postwriting questionnaires and ratings of the essays showed that the home-written essays were as long as, as personal and evocative, and as adherent to instructions as the lab-written essays. In fact, the only effect of setting observed was that the home-based session evoked more anger in the stress group than did the other two sessions. Therefore, we have no evidence that the difference in writing locations dramatically affected the nature or impact of the writing. Still, the possibility exists that this procedural difference was responsible for the failure to observe significant effects in the present study.
It is unknown to what extent the patients who volunteered and were accepted into this study are representative of all adult patients with asthma. We intentionally used minimal screening criteria beyond a diagnosis of asthma, absence of PTSD, and an ability to write to maximize the generalizability of the results. The previous study excluded people for ongoing psychotherapy or psychiatric disorder, taking medications that interfere with symptom report (eg, that are mood-altering), or taking more than 10 mg of prednisone daily. These differences in screening criteria may partially explain the different results between studies and may affect the (unknown) extent to which the results of each study generalize to all asthmatics.
The extent to which patients believe writing might improve asthma is important in comparing groups within particular studies (blinding) and also in comparing results between studies. Recruiting materials may shape patients' expectations regarding the usefulness of the writing task, therefore inducing more or less of a placebo effect. Indeed, our recruiting material explicitly stated that we were "interested in determining the effects of different kinds of personal writing on pulmonary function in asthmatics" and may have increased the perceived usefulness of the task. As indicated by the manipulation check, the perceived usefulness of the writing did not vary by experimental condition, but may have been affected overall by the way the study was framed. Also, informally we learned that some of our patients were aware that writing had been shown in previous studies to have health benefits. We did not specifically assess or record such knowledge or preconceptions. The previous study predated much of the press and popular belief regarding the purported health benefits of expressive writing. Also, the recruiting language used was much more general, inviting patients "to participate in a study of your daily experiences of illness." We would have expected these differences in factors shaping positive expectations for the writing intervention to favor a greater response in the present study. Indeed, the small improvements in all three experimental groups in this study may have been partial caused by these factors.
Finally, the average patient in this study had somewhat less severe asthma than the patients in the Smyth et al. study. The baseline average (SD) FEV1/FEV1-predicted was 0.73 (0.18) in this study compared with 0.64 (0.20) in the previous study. Our sample may have failed to respond because they had better pulmonary function at baseline and therefore had less room to improve (ceiling effect). To examine this possibility, all the major analyses were conducted on the subgroup of people (n = 59) who started the study with less than 75% of predicted FEV. This sample was evenly distributed among the experimental groups and had a mean (SD) FEV1/FEV1-predicted of 0.67 (0.13). Still, there were no significant group differences (or interesting trends) on the major outcomes.
The sample in the present study had a somewhat lower percentage of females (57% vs. 73%) and, as mentioned previously, higher baseline FEV1 than the previous study. Otherwise, the samples had strikingly similar demographics. For example, for the present study compared with Smyth et al., ages were 43.15 (17.67) versus 41.2 (17.4) years, and the percent white was 73.7 versus 73.3. Use of asthma medications and degree of illness control may have differed between samples, but data to make comparisons are not available.
Although this study used an experimental design with careful controls, several limitations exist that must be considered in interpreting the results. First, staff who made the assessments of FEV1 and FVC were not blind to the patients' group assignments, primarily as a result of financial constraints. Even though standardized instructions and procedures were used in obtaining these measurements, it is possible that lack of double-blinding influenced the results. Second, the evaluation of the interventions was based on changes from baseline to 2-month postintervention, not on more regular, therefore reliable, assessments. A better approach would have been to assess patients monthly for a longer duration. Third, the sample was very heterogeneous with respect to asthma severity, medication use, and other characteristics limiting our power to detect effects, should they exist, in specific subgroups. Also, although all patients were required to have a history of asthma diagnosed by a physician and produce evidence of reduced pulmonary function at baseline, we were unable to check medical records for evidence of reversibility for all patients.
In conclusion, we failed to find evidence that either stress-based or positive expressive writing, in the manner in which we implemented the intervention, can improve the pulmonary function of adult asthmatics. Future studies would benefit from assessing pulmonary function on more occasions and more carefully documenting the use of asthma medications. The general public has enthusiastically taken notice of research and claims that expressive writing leads to better health. Until more supportive evidence can be produced, expressive writing should not be recommended as a means for improving pulmonary function in patients with asthma.
The authors thank C. Barr Taylor, MD, Ruth Cronkite, PhD, Thomas Ritz, PhD, Rudolf Moos, PhD, and Harly Neumann, MS, for important feedback, material support, and encouragement.
| NOTES |
|---|
|
|
|---|
This work was funded by fellowships from the Department of Veteran Affairs Office of Academic Affiliations to the first author and the Fetzer Institute, Kalamazoo, Michigan, to Dr. Thoresen.
DOI:10.1097/01.psy.0000146345.73510.d5
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
D. Holmes, G. W. Alpers, T. Ismailji, C. Classen, T. Wales, V. Cheasty, A. Miller, and C. Koopman Cognitive and Emotional Processing in Narratives of Women Abused by Intimate Partners Violence Against Women, November 1, 2007; 13(11): 1192 - 1205. [Abstract] [PDF] |
||||
![]() |
C. S. Mackenzie, U. J. Wiprzycka, L. Hasher, and D. Goldstein Does Expressive Writing Reduce Stress and Improve Health for Family Caregivers of Older Adults? Gerontologist, June 1, 2007; 47(3): 296 - 306. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Ritz, A. Steptoe, C. Bobb, A. H. S. Harris, and M. Edwards The Asthma Trigger Inventory: Validation of a Questionnaire for Perceived Triggers of Asthma Psychosom Med, November 1, 2006; 68(6): 956 - 965. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. J. Warner, M. A. Lumley, R. J. Casey, W. Pierantoni, R. Salazar, E. M. Zoratti, R. Enberg, and M. R. Simon Health Effects of Written Emotional Disclosure in Adolescents with Asthma: A Randomized, Controlled Trial J. Pediatr. Psychol., July 1, 2006; 31(6): 557 - 568. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |