Display:
I'd be very interested in seeing that graph, if you can dig it out. That being said, from what you've presented so far it doesn't actually sound like the fault was the data format (bar graphs strike me as an entirely appropriate way of displaying that kind of data - it's a comparison, which is what bar graphs are good for remember?) so much as the data range. Leaving out inconvenient parts of the data set is another truly classic way of scamming with numbers. (As an aside, truncating data sets has always struck me as something of an exercise in black magic even at the best of times, and that's when everyone involved is being honest!)

The way these guys you're talking about picked out a single month for comparison, however, is precisely the kind of dishonest highlighting I was talking about in the first installment. When doing data analysis, you are not permitted to pick and choose single points and compare them to other single points or to peaks or whatever, because every data set has outliers and random fluctuations, and there is zero guarantee that the point you decide to pick is actually representative of anything.

A couple of general things to keep in mind with the casualty data from Vietraq is that it fluctuates somewhat from month to month (that's why I'd prefer to use a scatterplot rather than a bar graph myself) and it also depends on operational posture, number of troops in the field, etc. It seems like a reasonable proxy for how well the war is going, but one should be careful not to take it too far - after all, the Americans could simply sit in their compounds and get everything they need by air, and casualties would plummet. They would also, however, lose the war that way (to any extend that it isn't already lost, that is).

I'm tired and going to bed now, but I may return tomorrow (well, later today, technically) with some plots of data on Vietraq casualties over time.

- Jake

If you only spend 20 minutes of the rest of your life on economics, go spend them here.

by JakeS (JangoSierra 'at' gmail 'dot' com) on Thu Oct 18th, 2007 at 09:27:44 PM EST
[ Parent ]
I don't know about you, but to me this looks like an upwards trend:

FWIW, when GNUPLOT fits a first order polynomial to these data, it gives an upwards slope that is more than two asymptotic standard errors greater than zero. While this is not a proper statistical significance analysis, it does show that calling it an upwards trend is not grossly misleading.

- Jake

If you only spend 20 minutes of the rest of your life on economics, go spend them here.

by JakeS (JangoSierra 'at' gmail 'dot' com) on Fri Oct 19th, 2007 at 09:50:26 AM EST
[ Parent ]
Oh, forgot to tell what's on the axes and where I got the data: x axis is the month, with the first full month after invasion being labeled 1 and excluding the first and last month of war since they are incomplete. The y axis is the number of casualties during that month.

Data source.

Apologies for the double post.

- Jake

If you only spend 20 minutes of the rest of your life on economics, go spend them here.

by JakeS (JangoSierra 'at' gmail 'dot' com) on Fri Oct 19th, 2007 at 09:54:05 AM EST
[ Parent ]
I'd have some trouble fitting a downwards curve to that data ....

Day by day data might draw an interesting curve.

by Colman (colman at eurotrib.com) on Fri Oct 19th, 2007 at 10:01:14 AM EST
[ Parent ]
I'd have some trouble fitting a downwards curve to that data ....

Not if you were with the American Enterprise Institute :-P

Day by day data might draw an interesting curve.

Nah, random scatter would obscure any trend if you go to that resoluation.

I tried running two- and three-month running averages, but that doesn't make things a lot prettier, so I decided to just go ahead an post the raw data. This leads me to believe that a resolution of about one month (maybe you could push it down to two weeks, or even one week, but I don't think you could go much further) is about optimal as far as grouping goes.

Of course, one could use running averages (i.e. have a data point for each day that represents the average deaths pr. day for the 30-day period leading up to the point). But I'm not convinced that there is sufficient additional information to be obtained by doing so to justify the bother.

- Jake

If you only spend 20 minutes of the rest of your life on economics, go spend them here.

by JakeS (JangoSierra 'at' gmail 'dot' com) on Fri Oct 19th, 2007 at 10:47:51 AM EST
[ Parent ]

Not saying it's meaningful though ;-)

the subtle mathematical technique used is called drawing a random line that shows the message you intend  and bluffing competence.  I think it's the same technique used in the original graph in the story.

Any idiot can face a crisis - it's day to day living that wears you out.

by ceebs (ceebs (at) eurotrib (dot) com) on Fri Oct 19th, 2007 at 10:59:16 AM EST
[ Parent ]
In the previous plot of casualties from Vietraq, I accidentally only displayed data going up to month 35 (but the fit was from the full data set). Full plot here:

Unfortunately, the trend only becomes clearer if you include all the data...

Apologies all around for lack of proofreading.

- Jake

If you only spend 20 minutes of the rest of your life on economics, go spend them here.

by JakeS (JangoSierra 'at' gmail 'dot' com) on Fri Oct 19th, 2007 at 11:03:53 AM EST
[ Parent ]
Is that the same data set from iCasualties.org that this graph uses?

Iraqi Deaths By Year (iCasualties.org as of 2007/10/24)

Truth unfolds in time through a communal process.

by marco (cowannar at gmail punkt com) on Wed Oct 24th, 2007 at 09:06:13 AM EST
[ Parent ]
It's the same source, so If they use their own data, it is. But their graph only goes back two and a half years, and I disagree with the way they've presented the data (not that it's wrong, but I'm not sure it's particularly meaningful to do month-by-month comparisons - you'd normally do that if you thought that there was a reproducible pattern that depended on time of year, and it doesn't look that way to me), so I made my own to cover the entire war and occupation.

- Jake

If you only spend 20 minutes of the rest of your life on economics, go spend them here.

by JakeS (JangoSierra 'at' gmail 'dot' com) on Wed Oct 24th, 2007 at 03:49:02 PM EST
[ Parent ]

Display:
Login
. Make a new account
. Reset password
Occasional Series