|
EVALUATING THE "National Demonstration Study" Bradley, R.T., McCraty, R., Atkinson, M., Arguelles, L., Rees, R.A. & Tomasino, D. (2007). Reducing Test Anxiety and Improving Test Performance in America’s Schools: Results from the TestEdge National Demonstration Study. Boulder Creek, CA: Institute of HeartMath. In this study, the subjects were tenth-grade students at two high schools in Northern California. At one school, 602 students received HeartMath training. At the other school, described as the “control group,” 332 students were not trained. The HeartMath TestEdge® program included practicing the Freeze-Frame technique and trying to “re-experience” positive emotions. The students did so while using the computerized “emWave® PC Stress Relief System” that provides biofeedback. When reporting the students’ test scores, the researchers’ summary says: “In four matched-group comparisons (involving subsamples of 50 to 129 students) there was a significant increase in test performance in the experimental group over the control group, ranging on average from 10 to 25 points.” (p. 6) In the sections that follow, I discuss the weaknesses in that claim. WHEN A "CONTROL GROUP" IS NOT REALLY A CONTROL GROUP This study did not have random assignment of individual subjects. However, using a quasi-experimental procedure, the researchers randomly assigned two schools: large, naturally-occurring groups of subjects. This is a legitimate research method, but it requires the collection of extra data to assess the similarity of the natural groups. The danger is that the schools could have pre-existing differences that cause students to perform differently. In addition to the lack of random assignment of individual subjects, and the apparent nonequivalence of the two schools, the study did not have “blind” observers. So, this study had all three of the major problems described on my other page. Also, the researchers used some dubious statistical procedures when analyzing the results. STATISTICAL ISSUES Attrition: Dealing with Dropouts Despite the fact that the study lasted only four months, many students who started the study did not finish it. At the school using HeartMath, 602 started but only 488 finished--an attrition rate of 19.1%. At the other school, 332 started the study but only 261 finished--an attrition rate of 21.4%. The large number of dropouts means that the populations tested at the end of the study may have been significantly different from the populations tested at the beginning. Post Hoc Analysis: Creating Sub-Samples The researchers ran many statistical analyses of students’ test scores. They reported the results of their first analysis this way: “However, the results of an ANCOVA (not shown) for all students in the intervention school did not find evidence of a relationship between the frequency of use of the TestEdge tools and pre-post intervention test anxiety reduction. There was also no evidence of a relationship between student use of the tools and 10th grade CST ELA test performance [the California Standardized Test of English-Language Arts].” (119) In other words, when they first looked at the data for all the students, they did not find evidence that HeartMath techniques reduced test anxiety or increased scores on a standardized English test. “. . . with the exception of the CST English-Language Arts test, which appeared to be administered universally on a standardized basis to all students in both 9th and 10th grades, and thus met our need for a repeated measures format, a number of unanticipated complications prevented our use of much of the CST data.” (87) The problem was that the CST is actually a collection of tests in various subjects, and different students took different combinations of these tests. Students did not always take the same tests in the 9th and 10th grades, and some students at the two schools took different combinations of tests. “For example, in the 9th grade 91% of the experimental group took Earth Science while 85% of the control group took Biology; in the 10th grade most of the experimental group took Biology while the control group took Chemistry. This meant that the CST Science scores could not be compared and thus were unusable.” (87) The good news, mentioned above, was that all students took the English tests, and so their pre- and post-HeartMath Englishscores could be compared. The researchers also got creative and found a way to use math scores of some of the students: “. . . we found a notable subset of 183 students (121 in the experimental group and 62 in the control group) who all took Geometry in the 9th grade and who also all took Algebra 2 in the 10th grade. In the analyses that follow in a later section, this group of students is referred to as Math Group 1.” (87) I don’t know what math faculty may think about equating performance in geometry with performance in algebra, but let’s say that this was a reasonable way to select students who had both a pretest and a posttest in math. Unfortunately, the group of math students at the HeartMath school was almost twice as large as the group at the other school. As I mentioned earlier, large differences in the sizes of groups makes statistical analyses very problematic. There’s another problem: when the researchers report the results of the analyses of the “Math 1” groups, the numbers change:“Moving to the results for Math Group 1 . . . Altogether, there was a total of 129 students in this sub-sample, of whom 69 (53.5%) were in the experimental group and 60 (46.5%) were in the control group.” (157) Somehow, the 183 students with math scores became 129 students; the 121 of them at the HeartMath school became 69; and the 62 students at the other school became 60. It’s true that the sizes of the groups became similar—and this is good—but there is no explanation in the report of how the researchers chose to use these students in their analysis and excluded the rest of the students. Discrepancies such as this raise further doubts about the meaning of the reported results. To return to the discussion of claims about HeartMath, click here. © 2009 David Douglass
|
|
|
|