Exercise
on data from article (or author of)
* Impact of Folic Acid Fortification of the US Food Supply on the Occurrence of Neural
Tube Defects
* Incidence of open neural tube defects in Nova Scotia after folic acid fortification
I | [USA data] Restrict your comparison to spina bifida. | |
a | Formally compare the observed rates pre and post (exclude data from the transition period). Do the same but using the numbers of cases [and denominators] explicitly. What is your best estimate (and 95% CI) of the difference that might be attributed to the fortification? | |
b | Delevop (and fit) a single incidence model that covers all 3 periods and that that allows you to estimate the changes in incidence over these 3 periods. In a deluxe model [not required for this exercise!] you would probably like to 'round the corners'. For this exercise, as a rough approximation consider a simpler one with 3 (connected) straight lines (or curves if you use a rate-ratio model). This is as much an exrecise in how to represent 3 connected straight lines as it is about rates and counts. | |
c | Is there evidence of over-dispersion (ie extra-binomial or extra-Poisson variation)? [the difference between binomial and Poisson is trivial here, since the binomial approaches the Poisson in the limiting case when, as here, the probability of an event is low]. | |
d | Does it change your estimates very much whether you use an additive (rate difference)
or a multiplicative (rate ratio) model? Does it matter very much if you use monthly or quarterly data? |
|
e | If we had the corresponding data for Canada (1/10th the population and numbers of births, with say with same incidence and same source of data), how much wider would the CI's be? | |
f | Estimate, for various size reductions in the incidence, the statistical power of a study that -- with an alpha of 0.05 2-sided -- compares (in the absence of other time trends) Quebec incidence in the 5 years pre with the incidence in the 5 years post introduction of fortification. Do so (a) without considering overdispersion, so that a regular Binomial or Poisson model is appropriate and (b) with the same amount of over-dispersion as is seen in the 'pre' series in the US. [of course if there is overdispersion, and it is ignored in the analysis, it tends to increase both the power and the probability of a type I error! [we won't know which!]. | |
II | [Nova Scotia data] The Results section of the paper gives several point estimates, confidence intervals, and p-values. the authors did not have a statistical analysis paragraph so you will have to try to determine how they calculated these items -- or come up with your own. | |
a | Calculate these items by 'tabular' (i.e., calculator) methods. Should you/they worry about the appropriateness of Gaussian-based inferences? [Remember that the chi-square statistic with 1 df is the same as the square of a z-statistic.. so whatever is accurate or inaccurate about one is equally so about the other] |
|
b | Calculate these items by 'regression' methods. Again, is use of Gaussian-based inferences OK? | |
III | Why do you think the estimates of impact differ so much between the USA and Nova Scotia? What soes this say about any concerns you had in in IIf above? |
jh 2002.10.08