Presidential Debate Analysis
Whenever I watch a televised debate, I always wonder what percentage of the speaker’s message is actually thinking on the feet and how much is canned material. With the advent of available transcripts, these sorts of questions can be addressed with various computational methods.
A simple way to identify repeated statements is to count the number of times a particular noun phrase is metioned. Noun phrases act as both a proxy to the subject matter of a given piece of text, but also the way in which things are worded.
For this simple experiment, we’ll need four tools:
- The transcript (simplified from the original)
- Lingua::EN::Tagger, an English Part-of-speech tagger written in Perl
- phrases.pl, a perl script to parse the document and extract the noun phrases
- Debate Spotter, an interactive interface to visualize the results
The results are quite interesting. Looking only at noun phrases of at least 2 words occuring at least twice for a given speaker, we arrive at some spectacular catch phrases. For Bush my favorite is “hard work,” which he said repeatedly. Apparently Bush thinks that the world is a difficult place to be. For Kerry, a salient phrase was “war as a last resort.”
The top 25 phrases for Bush and Kerry follow. The number following each phrase is a rank described by the length of the phrase and the number of times it appeared.
There are so many other types of analysis that could be run on these data. If you find anything interesting, please let me know. Also, the Debate Spotter allows for any query, so post any interesting phrases that you find.
Update: I have also analyzed the Vice Presidential and the Second Presidential debates.
Bush
free iraq (14),
hard work (13),
wrong war at the wrong place at the wrong time (13),
wrong war at the wrong time at the wrong place (12),
north korea (10),
kim jong il (10),
my opponent (9),
american people (8),
same intelligence (8),
prime minister allawi (8),
best way (7),
free afghanistan (7),
world a more peaceful place (7),
mixed messages (7),
iraqi citizens (6),
al qaida (6),
weapons of mass destruction (6),
dynamics on the ground (6),
breach on the agreement (6),
end of this year (6),
grave threat (6),
matter of fact (5),
cannot lead (5),
grand diversion (5),
wrong signals (5)
Kerry
saddam hussein (14),
north korea (14),
nuclear weapons (10),
weapons of mass destruction (9),
osama bin (9),
united nations (9),
war as a last resort (9),
american people (8),
90 percent of the casualties (7),
nuclear proliferation (7),
remedies of the united nations (7),
90 percent of the costs (7),
united states of america (7),
homeland security (7),
mountains of tora bora (6),
10 active duty divisions (6),
different set of convictions (6),
four years (6),
president bush (6),
president of south korea (6),
strong alliances (6),
two years (5),
secretary of state (5),
tax cut (5),
bilateral talks (5)


