home > results and commentary > Trump vs. Biden (1st)

Word Analysis of 2020 U.S. Presidential Debates

Donald Trump vs. Joe Biden (1st debate)

29 September 2020



Introduction

This was a very chaotic debate. Let's get into it.

Speaking Turns and Interruptions

Here, I look at the length of each turn of uninterrupted speech.

Table 1
length of sections in words
The number of uninterrupted deliveries (sections), mode/median/mean length of sections in words, and the shortest section length in words that composed 10%, 50% and 90% of the debate.
speaker sections section length debate contiguity (L10 L50 L90)
Donald Trump
313
313
4.0 9.0 24.3
4.0009.0000000024.319
9 83 165
9.00083.000165.000
Joe Biden
249
249
5.0 8.0 27.4
5.0008.0000000027.361
10 92 191
10.00092.000191.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 1
legend
a b c
51025

a — section length (mode), shortest section length in 10% of debate

b — section length (median), shortest section length in 50% of debate

c — section length (mean), shortest section length in 90% of debate

bar — proportion of a:b:c

Table 1
commentary

Trump spoke more often than Biden +25.7% (313 vs 249) and the length of Trump's deliver was also slightly shorter when looking at the mode –20.0% (4 vs 5) and median –11.3% (24.3 vs 27.4) section length. The contiguity of the debate was low — we see that 50% of Trump's entire delivery was in sections shorter than 83. For Biden, 50% of his delivery was in sections shorter than 92 words.

Flesch-Kincaid Reading Ease and Grade Level

The Flesch-Kincaid reading ease and grade level metrics are designed to indicate how difficult a passage in English is to understand.

This metric does not take repetition into account. A grade level 10 sentence that is repeated 100 times still generates the same metrics because the words per sentence and syllables per word remain constant. To measure how many times a speaker repeats themselves, I use my Windbag Index, below.

Reading ease ranges from 100 (easiest) down to 0 (hardest) and can be interpreted as follows

100 –905th gradeVery easy to read. Easily understood by an average 11-year-old student.
90 – 806th gradeEasy to read. Conversational English for consumers.
80 – 707th gradeFairly easy to read.
70 – 608th & 9th gradePlain English. Easily understood by 13- to 15-year-old students.
60 – 5010th to 12th gradeFairly difficult to read.
50 – 30collegeDifficult to read.
30 – 10college graduateVery difficult to read. Best understood by college/university graduates.
10 – 0professionalExtremely difficult to read. Best understood by college/university graduates.

The grade level corresponds roughly to a U.S. grade level. It has a minimum value of –3.4 and no upper bound.

Two sets of readability scores are calculated. One for the entire debate and one that only considers section with at least 9 words.

Table 2a
readability — entire debate
Flesch-Kincaid reading ease and grade level.
speaker grade level reading ease sections sentences words syllables
Donald Trump
3.60
100.0%
3.60
85.88
100.0%
85.88
313
100.0%
313
815
100.0%
815
7,612
100.0%
7612
10,030
100.0%
10030
Joe Biden
4.27
118.6%
4.27
82.74
96.3%
82.74
249
79.6%
249
664
81.5%
664
6,813
89.5%
6813
9,155
91.3%
9155

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 2b
readability — excluding short sections
Flesch-Kincaid reading ease and grade level for sections with at least 9 words.
speaker grade level reading ease sections sentences words syllables
Donald Trump
4.02
100.0%
4.02
84.56
100.0%
84.56
194
100.0%
194
695
100.0%
695
7,141
100.0%
7141
9,441
100.0%
9441
Joe Biden
4.91
122.1%
4.91
80.81
95.6%
80.81
129
66.5%
129
540
77.7%
540
6,353
89.0%
6353
8,567
90.7%
8567

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 2
legend
a
b
30

a — value for candidate

b — value relative to Donald Trump

bar — proportion of a

Table 2
commentary

Trump's grade level was –15.7% (3.6 vs 4.27) lower than Biden's, if we consider the entire debate. For sections of at least 9 words in length (this attempts to exclude short interruptions) his grade level was –18.1% (4.02 vs 4.91) lower. There is less of a spread in the reading ease for both speakers — both scored at least 80.

Sentence Size

Table 3
sentence size
Number of sentences spoken by each speaker and sentence word count statistics. Number of words in a sentence is shown by average and 50%/90% cumulative values for all, stop and non-stop words.
speaker number of sentences sentence size
all stop non-stop
Donald Trump
815
815
9.5 12 29
9.45812.00029.000
5.4 7 18
5.3957.00018.000
4.1 5 12
4.0635.00012.000
Joe Biden
664
664
10.4 15 34
10.35115.00034.000
6.0 9 20
6.0329.00020.000
4.3 6 16
4.3196.00016.000
total
1,479
1479
11.9 14 33
11.85914.00033.000
7.7 9 20
7.6819.00020.000
6.2 7 14
6.1787.00014.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 3
legend
a b c
51025

a — average sentence size

b — largest sentence size for 50% of content

c — largest sentence size for 90% of content

bar — proportion of a:b:c

Table 3
commentary

Biden had marginally longer sentences, on average, but in as messy a debate as this was, this metric is very sensitive to how interruptions were transcribed.

Word Statistics

Debate Word Count

Summary Word Count

The summary word count reports the total number of words and the number of unique, non-stop words used by each candidate. Word number is expressed as both absolute and relative values.

Table 4a
all words
Number of all words and unique words used by each speaker.
set word count
Donald Trump
7,708 1,131
52.9% 14.7%
65771131
Joe Biden
6,873 1,181
47.1% 17.2%
56921181
total
14,581 1,743
100.0% 12.0%
128381743

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 4b
exclusive and shared words
Words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Donald Trump
1,004 562
13.0% 56.0%
442562
Joe Biden
930 612
13.5% 65.8%
318612
both candidates
12,647 569
86.7% 4.5%
12078569

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 4
legend
a c
b d
3010

a — word count

b — word count, as fraction in total in debate

c — unique words in (a)

d — unique words in (a), as fraction in (a)

bar — proportion of (a-c):c

Table 4
commentary

Trump delivered +12.1% (7,708 vs 6,873) more words but had –4.2% (1,131 vs 1,181) fewer unique words. Trump spoke 1,004 words that Biden did not use but in this set there were –8.2% (562 vs 612) fewer unique words than in Biden's set of 930 words that Trump did not use.

Stop Word Contribution

In the table below, the candidates' delivery is partitioned into stop and non-stop words. Stop words (full list) are frequently-used bridging words (e.g. pronouns and conjunctions) whose meaning depends entirely on context. The fraction of words that are stop words is one measure of the complexity of speech.

Table 5a
non-stop words
Counts of stop and non-stop words.
speaker all words stop words non-stop words
Donald Trump
7,708 1,131
100.0% 14.7%
65771131
4,397 134
57.0% 3.0%
4263134
3,311 997
43.0% 30.1%
2314997
Joe Biden
6,873 1,181
100.0% 17.2%
56921181
4,005 135
58.3% 3.4%
3870135
2,868 1,046
41.7% 36.5%
18221046
total
14,581 1,743
100.0% 12.0%
128381743
8,402 145
57.6% 1.7%
8257145
6,179 1,598
42.4% 25.9%
45811598

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 5b
exclusive and shared non-stop words
Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Donald Trump
985 552
29.7% 56.0%
433552
Joe Biden
907 601
31.6% 66.3%
306601
both candidates
4,287 445
69.4% 10.4%
3842445

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 5
legend
a c
b d
3010

a — total number of words, for a given category (all, stop, non-stop)

b — (a) relative to words in the debate if category=all, otherwise relative to words by the candidate

c — number of unique words with set (a)

d — (c) relative to (a)

bar — proportion of (a-c):c

Table 5
commentary

Both candidates used roughly the same fraction of stop words, 57.0% for Trump and 58.3% for Biden.

Word frequency

The word frequency table summarizes the frequency with which words were used. I show the average word frequency and the weighted cumulative frequencies at 50 and 90 percentile. The average word frequency indicates how many times, on average, a word is used. For a given fraction of the entire delivery, the weighted cumulative frequency indicates the largest word frequency within this fraction (details about weighted cumulative distribution).

Table 6a
word use frequency
Average and 50%/90% percentile word frequencies.
speaker word frequency
all stop non-stop
Donald Trump
6.8 31 189
6.81531.000189.000
32.8 101 261
32.813101.000261.000
3.3 6 40
3.3216.00040.000
Joe Biden
5.8 27 174
5.82027.000174.000
29.7 94 222
29.66794.000222.000
2.7 4 38
2.7424.00038.000
total
8.4 54 379
8.36554.000379.000
57.9 179 392
57.945179.000392.000
3.9 8 62
3.8678.00062.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 6b
exclusive and shared non-stop word use frequency
Average and 50%/90% cumulative percentile word frequencies. Non-stop words exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word frequency
Donald Trump
1.78 2 6
1.7842.0006.000
Joe Biden
1.51 2 4
1.5092.0004.000
total
3.87 8 62
3.8678.00062.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 6
legend
a b c
51025

a — average word frequency

b — largest word frequency in 50% of content

c — largest word frequency in 90% of content

bar — proportion of a:b:c

Table 6
commentary

Trump used the same words more often. His average word frequency was +17.2% (6.8 vs 5.8) higher than Biden. When considering words unique to a candidate, Trump repeated these +17.9% (1.78 vs 1.51) more often than Biden did for words that Trump didn't use.

All further word use statistics represent content that has been filtered for stop words, unless explicitly indicated.

Part of Speech Analysis

In this section, word frequency is broken down by their part of speech (POS). The four POS groups examined are nouns, verbs, adjectives and adverbs. Conjunctions and prepositions are not considered. The first category (n+v+adj+adv) is composed of all four POS groups.

Part of Speech Count

Table 7
part of speech count
Count of words categorized by part of speech (POS).
part of speech
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Donald Trump
3,042 934
39.5% 30.7%
85151284027520214415266
1,363 512
44.8% 37.6%
851512
1,115 275
36.7% 24.7%
840275
346 144
11.4% 41.6%
202144
218 66
7.2% 30.3%
15266
Joe Biden
2,695 991
39.2% 36.8%
77253061832016114510049
1,302 530
48.3% 40.7%
772530
938 320
34.8% 34.1%
618320
306 145
11.4% 47.4%
161145
149 49
5.5% 32.9%
10049
total
5,737 1,509
39.3% 26.3%
1820845157947442522728186
2,665 845
46.5% 31.7%
1820845
2,053 474
35.8% 23.1%
1579474
652 227
11.4% 34.8%
425227
367 86
6.4% 23.4%
28186

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 7
legend
a c
b d
1535

a — total number of words for a given POS (all, noun, verb, adjective, adverb)

b — (a) relative to all words by candidate

c — unique words in (a)

d — (c) relative to (a)

bar — proportion of (a-c):c

Table 7
commentary

Fewer of Trump's words were nouns than Bidens by –7.2% (44.8 vs 48.3) but proportionately he used more adverbs by +30.9% (7.2 vs 5.5).

Part of Speech Frequency

Table 8
part of speech frequency
Frequency of words categorized by part of speech (POS).
part of speech frequency
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv) pronouns (pro)
Donald Trump
3.26 6 40
3.2576.00040.000
2.66 4 20
2.6624.00020.000
4.05 12 115
4.05512.000115.000
2.40 3 11
2.4033.00011.000
3.30 7 25
3.3037.00025.000
31.13 135 284
31.130135.000284.000
Joe Biden
2.72 4 38
2.7194.00038.000
2.46 4 34
2.4574.00034.000
2.93 5 130
2.9315.000130.000
2.11 3 17
2.1103.00017.000
3.04 6 24
3.0416.00024.000
24.57 94 167
24.57194.000167.000
total
3.80 8 62
3.8028.00062.000
3.15 6 42
3.1546.00042.000
4.33 13 245
4.33113.000245.000
2.87 4 19
2.8724.00019.000
4.27 10 49
4.26710.00049.000
49.31 249 392
49.306249.000392.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 8
legend
a b c
51025

a — average word frequency

b — largest word frequency in 50% of content

c — largest word frequency in 90% of content

bar — proportion of a:b:c

Table 8
commentary

We've already seen that Trump repeated his words more often. Here, we see that the largest increase in repetition over Biden was in his verbs, which he repeated +38.2% (4.05 vs 2.93) more times, on average. He also repeated his pronouns +26.7% (31.13 vs 24.57) more often.

Part of Speech Pairing

Through word pairing, I extract concepts from the text. The number of unique word pairs is a function of sentence length and is one of the measures of complexity.

Table 9a
part of speech pairing — Donald Trump
Word pairs (total and unique) categorized by part of speech (POS)
part of speech pairings - Donald Trump
noun verb adjective adverb
noun
146 107
  73.3%
39107
verb
59 54
  91.5%
554
7 7
  100.0%
07
adjective
178 140
  78.7%
38140
1 1
  100.0%
01
15 14
  93.3%
114
adverb
3 3
  100.0%
03
38 27
  71.1%
1127
11 11
  100.0%
011
9 8
  88.9%
18

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 9b
part of speech pairing — Joe Biden
Word pairs (total and unique) categorized by part of speech (POS)
part of speech pairings - Joe Biden
noun verb adjective adverb
noun
149 105
  70.5%
44105
verb
66 57
  86.4%
957
6 5
  83.3%
15
adjective
152 128
  84.2%
24128
4 3
  75.0%
13
6 6
  100.0%
06
adverb
1 1
  100.0%
01
31 27
  87.1%
427
7 6
  85.7%
16
2 2
  100.0%
02

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 9c
unique part of speech pairing — candidate comparison
Unique word pairs categorized by part of speech (POS)
unique part of speech pairings
noun (n) verb (v) adjective (adj) adverb (adv)
noun
107 105
  98.1%
107
105
verb
54 57
  105.6%
54
57
7 5
  71.4%
7
5
adjective
140 128
  91.4%
140
128
1 3
  300.0%
1
3
14 6
  42.9%
14
6
adverb
3 1
  33.3%
3
1
27 27
  100.0%
27
27
11 6
  54.5%
11
6
8 2
  25.0%
8
2

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 9 a,b
legend
a c
  d
3010

a — total number of pairs, for a given category (e.g. verb/noun)

c — number of unique pairs within set (a)

d — (c) relative to (a)

bar — proportion of (a–c):c

Table 9c
legend
a c
  d
50
45

a — unique pairs for Donald Trump

c — unique pairs for Joe Biden

d — (c) relative to (a) (i.e. Joe Biden relative to Donald Trump)

bars — (a) and (c)

Table 9
commentary

Trump had more pairs of adjectives and nouns by +17.1% (178 vs 152) and more pairs of adverbs and verbs +22.6% (38 vs 31) than Biden. The words pairings unique to a candidate are quite interesting to explore. For example, Trump used the noun noun pair "word smart" (I'm sure you remember it) whereas Biden had noun noun pairs like "kitchen table" and "COVID crisis".

Detailed Part of Speech Tags

You can really get into the weeds here. Parts of speech are counted more granularly in these tables — nouns and verbs are split into classes and many other word types are shown, such as conjunctions and prepositions.

Table 10a
detailed POS tags — nouns and verbs
Count by part of speech tag: NN (noun, singular), NNP (proper noun, singular), NNPS (proper noun, plural), NNS (noun plural), VB (verb, base form), VBD (verb, past tense), VBG (verb, gerund/present participle), VBN (verb, past participle), VBP (verb, sing. present, non-3d), VBZ (verb, 3rd person sing. present)
Penn Treebank part of speech tag
NN NNP NNPS NNS VB VBD VBG VBN VBP VBZ
Donald Trump
743
9.64%
743
330
4.28%
330
11
0.14%
11
343
4.45%
343
399
5.18%
399
340
4.41%
340
151
1.96%
151
163
2.11%
163
488
6.33%
488
230
2.98%
230
Joe Biden
828
12.05%
111.4%
828
228
3.32%
69.1%
228
20
0.29%
181.8%
20
276
4.02%
80.5%
276
395
5.75%
99.0%
395
201
2.92%
59.1%
201
180
2.62%
119.2%
180
145
2.11%
89.0%
145
293
4.26%
60.0%
293
358
5.21%
155.7%
358

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 10b
detailed POS tags — adjectives, pronouns, adverbs and wh-words
Count by part of speech tag: JJ (adjective), JJR (adjective, comparative), JJS (adjective, superlative), PRP (personal pronoun), PRP$ (possessive pronoun), RB (adverb), RBR (adverb, comparative), RBS (adverb, superlative), WDT (wh-determiner), WP (wh-pronoun), WP$ (possessive wh-pronoun), WRB (wh-abverb)
Penn Treebank part of speech tag
JJ JJR JJS PRP PRP$ RB RBR RBS WDT WP WP$ WRB
Donald Trump
332
4.31%
332
22
0.29%
22
26
0.34%
26
1,099
14.26%
1099
86
1.12%
86
529
6.86%
529
12
0.16%
12
2
0.03%
2
37
0.48%
37
69
0.90%
69
62
0.80%
62
Joe Biden
317
4.61%
95.5%
317
27
0.39%
122.7%
27
9
0.13%
34.6%
9
723
10.52%
65.8%
723
89
1.29%
103.5%
89
416
6.05%
78.6%
416
7
0.10%
58.3%
7
30
0.44%
81.1%
30
92
1.34%
133.3%
92
58
0.84%
93.5%
58

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 10c
detailed POS tags — prepositions, conjunctions, determiners and others
Count by part of speech tag: CC (coordinating conjunction), CD (cardinal digit), DT (determiner), EX (existential there), FW (foreign word), IN (preposition/subordinating conjunction), MD (modal), PDT (predeterminer), POS (possessive ending), RP (particle), TO (to), UH (interjection)
Penn Treebank part of speech tag
CC CD DT EX FW IN MD PDT POS RP TO UH
Donald Trump
274
3.55%
274
153
1.98%
153
664
8.61%
664
20
0.26%
20
707
9.17%
707
142
1.84%
142
1
0.01%
1
13
0.17%
13
73
0.95%
73
168
2.18%
168
19
0.25%
19
Joe Biden
197
2.87%
71.9%
197
99
1.44%
64.7%
99
699
10.17%
105.3%
699
28
0.41%
140.0%
28
725
10.55%
102.5%
725
114
1.66%
80.3%
114
11
0.16%
1,100
11
16
0.23%
123.1%
16
58
0.84%
79.5%
58
222
3.23%
132.1%
222
12
0.17%
63.2%
12

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 10
legend
a
b
c
10

a — total number of words with a given tag

b — (a) relative to all tagged words

c — (a) relative to number of words with this tag used by Donald Trump

bar — proportion of a

Table 10
commentary

Trump used +44.7% (330 vs 228) more proper nouns and +52.0% (1,099 vs 723) more prepositions than Biden. His use of singular present non-3rd person verbs (VBP) was +66.6% (488 vs 293) higher but his use of 3rd person singular verbs (VPZ) was –35.8% (230 vs 358) lower. Trump also spoke in the past tense +69.2% (340 vs 201) more than Biden.

Exclusive and Shared Usage

This section enumerates words that were exclusive to a candidate (e.g. used by one candidate but not the other). This content provides insight into what the candidates' priorities are and reveals differences in perspective on similar topics.

For a given part of speech, the table breaks down the number of words that were spoken by only one of the candidates or both candidates (intersection). The last row includes words spoken by either candidate (union).

Table 11
exclusive word usage
Total and unique words used exclusively by a candidate, or by both.
part of speech
n+v+adj+adv nouns (n) verbs (v) adjectives (adj) adverbs (adv)
Donald Trump
937 518
100.0% 55.3%
16.3% 34.3%
419518
2422908813859721632
532 290
56.8% 54.5%
20.0% 34.3%
242290
242290
226 138
24.1% 61.1%
11.0% 29.1%
88138
88138
131 72
14.0% 55.0%
20.1% 31.7%
5972
5972
48 32
5.1% 66.7%
13.1% 37.2%
1632
1632
Joe Biden
871 575
100.0% 66.0%
15.2% 38.1%
296575
141312951854074618
453 312
52.0% 68.9%
17.0% 36.9%
141312
141312
280 185
32.1% 66.1%
13.6% 39.0%
95185
95185
114 74
13.1% 64.9%
17.5% 32.6%
4074
4074
24 18
2.8% 75.0%
6.5% 20.9%
618
618
both candidates
3,929 416
100.0% 10.6%
68.5% 27.6%
3513416
139519713611213086225829
1,592 197
40.5% 12.4%
59.7% 23.3%
1395197
1395197
1,482 121
37.7% 8.2%
72.2% 25.5%
1361121
1361121
370 62
9.4% 16.8%
56.7% 27.3%
30862
30862
287 29
7.3% 10.1%
78.2% 33.7%
25829
25829
total
5,737 1,509
100.0% 26.3%
100.0% 100.0%
42281509
1820845157947442522728186
2,665 845
46.5% 31.7%
100.0% 100.0%
1820845
1820845
2,053 474
35.8% 23.1%
100.0% 100.0%
1579474
1579474
652 227
11.4% 34.8%
100.0% 100.0%
425227
425227
367 86
6.4% 23.4%
100.0% 100.0%
28186
28186

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 11c
legend
a d
b e
c f
4030
40302015105

a — total number of words in set (e.g. obama \ romney, obama ∩ romney, obama ∪ romney , for a given part of speech

b — (a) relative to all exclusive words in n+v+adj+adv

c — (a) relative to all words in n+v+adj+adv

d — unique words in (a)

e — (d) relative to (a)

f — (d) relative to all unique words in n+v+adj+adv

bar1 — normalized ratio of (a-d):d

bar2 — absolute ratio of (a-d):d for all POS groups (first column) or POS group (other columns)

Table 11
commentary

Biden used 24 adverbs that Trump didn't use, meanwhile trump used 48 adverbs +100.0% (48 vs 24) that Biden didn't use. These lists are fun to look at. Trump used adverbs like "certainly" and "definitely" and "powerfully" and "phenomenally" whereas Biden used "constantly" and "fairly" and "honorably" and "socially".

Pronoun Usage

This section explores pronoun use in detail. Refer to the methods section for details.

Pronoun Count

Fraction of all words that were pronouns.

Table 12a
pronoun fraction
Fraction of words that were pronouns.
speaker all pronouns
Donald Trump
7,708 1,131
100.0% 14.7%
65771131
1,681 54
21.8% 3.2%
162754
Joe Biden
6,873 1,181
100.0% 17.2%
56921181
1,376 56
20.0% 4.1%
132056
total
14,581 1,743
100.0% 12.0%
128381743
3,057 62
21.0% 2.0%
299562

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 12b
exclusive and shared pronouns
Pronouns exclusive to speaker (e.g. speaker A but not speaker B) and shared by speakers (speaker A and B).
set word count
Donald Trump
13 6
0.4% 46.2%
76
Joe Biden
15 8
0.5% 53.3%
78
both candidates
3,029 48
99.1% 1.6%
298148

Hover over fields with (e.g. 155) to download the corresponding data file.

Pronoun by Person, Gender and Count

Pronoun usage by person (1st, 2nd, 3rd), gender (masculine, feminine, neuter) and count (singular, plural).

Table 13a
Pronoun by person
Count of pronouns by first, second or third person.
pronoun person
all first second third
Donald Trump
1,188 19
100.0% 1.6%
3917309246910
398 7
33.5% 1.8%
3917
311 2
26.2% 0.6%
3092
479 10
40.3% 2.1%
46910
Joe Biden
812 19
100.0% 2.3%
2296126243811
235 6
28.9% 2.6%
2296
128 2
15.8% 1.6%
1262
449 11
55.3% 2.4%
43811

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 13b
Pronoun by gender
Count of pronouns by masculine, feminine or neuter gender.
pronoun gender
all masculine feminine neuter
Donald Trump
300 7
100.0% 2.3%
973721892
100 3
33.3% 3.0%
973
9 2
3.0% 22.2%
72
191 2
63.7% 1.0%
1892
Joe Biden
355 7
100.0% 2.0%
2153621272
218 3
61.4% 1.4%
2153
8 2
2.3% 25.0%
62
129 2
36.3% 1.6%
1272

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 13c
Pronoun by number
Count of pronouns by singular or plural.
pronoun number
all singular plural
Donald Trump
1,208 40
100.0% 3.3%
7692439916
793 24
65.6% 3.0%
76924
415 16
34.4% 3.9%
39916
Joe Biden
1,064 38
100.0% 3.6%
7622326415
785 23
73.8% 2.9%
76223
279 15
26.2% 5.4%
26415

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 13
legend
a b
c d
153

a — total number of pronouns, by type

b — unique pronouns in (a)

c — (a) as fraction of all pronouns

d — (b) as fraction in (a)

bar — proportion of (a – b):b

Table 13
commentary

It gets messy but interesting here. Looking at pronouns by person, Trump used +69.4% (398 vs 235) more 1st person pronouns and +143.0% (311 vs 128) more 2nd person pronouns than Biden. In other words, he referred to himself and Biden much more than Biden did (to Trump and himself, respectively). Looking at pronouns by gender, Trump used –54.1% (100 vs 218) fewer masculine pronouns and +48.1% (191 vs 129) more neuter pronouns than Biden.

First and third person pronouns — a closer look

These tables break pronouns by interesting contrasts. For example, the ratio of singular to plural 1st person pronouns reveals the use of "I/my/myself" vs. "we/our/ours".

Table 14a
1st person pronouns, by count
Count of singular and plural first person pronouns. This table contrasts use of I/my/myself vs. we/our/ours.
pronoun
first first singular first plural
Donald Trump
398 7
100.0% 1.8%
23831534
241 3
60.6% 1.2%
2383
157 4
39.4% 2.5%
1534
Joe Biden
235 6
100.0% 2.6%
12531043
128 3
54.5% 2.3%
1253
107 3
45.5% 2.8%
1043
Table 14b
3rd person pronouns, by count
Count of singular and plural third person pronouns. This table contrasts he/she/his/her/it vs. they/them/theirs.
pronoun
third third singular third plural
Donald Trump
479 10
100.0% 2.1%
29371763
300 7
62.6% 2.3%
2937
179 3
37.4% 1.7%
1763
Joe Biden
449 11
100.0% 2.4%
3487904
355 7
79.1% 2.0%
3487
94 4
20.9% 4.3%
904
Table 14c
Me and you — 1st person singular and second person pronouns
Count of 1st person singular and second person pronouns. This table contrasts me/my/myself vs you/yours/yourself.
pronoun
all 1st singular 2nd
Donald Trump
552 5
100.0% 0.9%
23833092
241 3
43.7% 1.2%
2383
311 2
56.3% 0.6%
3092
Joe Biden
256 5
100.0% 2.0%
12531262
128 3
50.0% 2.3%
1253
128 2
50.0% 1.6%
1262
Table 14d
I, me, myself and my — closer look at 1st person singular pronouns
Count of specific 1st person singular pronouns: I, me, myself and my.
pronoun
all I me myself my
Donald Trump
241
100.0%
188.00042.0000.00011.000
188
78.0%
188.000
42
17.4%
42.000
0
0.0%
0.000
11
4.6%
11.000
Joe Biden
128
100.0%
98.00012.0000.00018.000
98
76.6%
98.000
12
9.4%
12.000
0
0.0%
0.000
18
14.1%
18.000
Table 14
legend
a b
c d
153

a — total number of pronouns, by type

b — unique pronouns in (a) (if more than one)

c — (a) as fraction of all pronouns

d — (b) as fraction in (a) (if less than 100%)

bar — proportion of (a – b):b

Table 14
commentary

We've already seen that Trump used +22.2% (1,681 vs 1,376) more pronouns than Biden. Breaking usage down by categories, Trump used +69.4% (398 vs 235) more 1st person pronouns and his delivery had Δrel=+11.2% (Δabs=+6.1%, 60.6% vs 54.5%) proportionately more 1st person singular pronouns (e.g. me, my, myself). Biden, on the other hand, had Δrel=+15.5% (Δabs=+6.1%, 45.5% vs 39.4%) proportionately more 1st person plural pronouns (e.g. we, us, our, ours).

When looking at 3rd person pronouns, Trump had Δrel=+78.9% (Δabs=+16.5%, 37.4% vs 20.9%) more use of the 3rd person plural (e.g. they, them) than Biden.

Considering only the 1st person singular (e.g. me, my, myself) and 2nd person pronouns (e.g. you, yours), Biden actually used the 1st person singular Δrel=+14.4% (Δabs=+6.3%, 50% vs 43.7%) proportionately more.

Focusing only on the subset of prounouns (I, me, myself and my), Trump used +88.3% (241 vs 128) more of these words than Biden. Proportionately his use of "me" was Δrel=+85.1% (Δabs=+8.0%, 17.4% vs 9.4%) proportionately higher but his use of "my" was lower.

Pronouns by Category

This table tallies the use of pronoun by category. The categories are personal, demonstrative, indefinite, object, possessive, interrogative, others, relative, reflexive. Note that some pronouns that belong to multiple categories are counted in only one. For a list of pronouns for each category, see the pronoun methods section.

Table 15
Pronouns by cateogry
Count of pronouns by category.
pronoun category
all personal demonstrative indefinite object possessive interrogative others relative reflexive
Donald Trump
1,681
100.0%
1005.000179.000168.00096.00085.00082.00041.00025.0002.000
1,005
59.8%
9987
179
10.6%
1754
168
10.0%
14622
96
5.7%
915
85
5.1%
796
82
4.9%
784
41
2.4%
365
25
1.5%
241
2
0.1%
11
Joe Biden
1,376
100.0%
663.000234.000160.00058.00088.000103.00043.00024.0003.000
663
48.2%
6567
234
17.0%
2304
160
11.6%
13822
58
4.2%
535
88
6.4%
835
103
7.5%
994
43
3.1%
376
24
1.7%
231
3
0.2%
12
Table 15
legend
a b
15

a — total number of pronouns, by category

b — (a) as fraction of all pronouns

bar — proportion of (a)

Table 15
commentary

Most of the pronouns used by the speakers were of the personal class (he, i, it, she, they, we, you). Proportionately, Trump used Δrel=+24.1% (Δabs=+11.6%, 59.8% vs 48.2%) pronouns from this class.

The next two common categories were demonstrative (that, these, this, those) and indefinite (all, another, any, anybody, anything, both, either, everybody, everything, few, many, most, no, nobody, none, nothing, one, other, others, some, somebody, something). Biden had Δrel=+60.4% (Δabs=+6.4%, 17% vs 10.6%) proportionately more demonstrative pronouns.

Biden also use Δrel=+53.1% (Δabs=+2.6%, 7.5% vs 4.9%) proportionately more interrogative pronouns (what, whatever, which, who).

Noun Phrase Usage

Noun phrases were extracted from the text and analyzed for frequency, word count, unique word count and richness. Single-word phrases were not counted.

Top-level noun phrases are those without a parent noun phrase (a parent phrase is one that a similar, longer phrase). Derived noun phrases are those with a parent (more details about noun phrase analysis).

The top-level noun phrases can be interpreted as independent concepts. Derived noun phrases can be interpreted as variants on concepts embodied by the top-level phrases.

Noun Phrase Count and length

This table reports the absolute number of noun phrases, which is related to the number of nouns, and their length.

Table 16a
noun phrase count
Counts of noun phrases in words and per noun.
speaker noun phrase count
all top-level
Donald Trump
330 185
100.0% 56.1%
0.24 0.36
145185
312 182
94.5% 58.3%
0.23 0.36
130182
Joe Biden
302 174
100.0% 57.6%
0.23 0.33
128174
283 170
93.7% 60.1%
0.22 0.32
113170

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 16b
noun phrase length
Average and 50%/90% cumulative length of noun phrases, in words.
speaker noun phrase length
all top-level
Donald Trump
2.16 2 3
2.1612.0003.000
2.17 2 3
2.1702.0003.000
Joe Biden
2.16 2 3
2.1622.0003.000
2.16 2 3
2.1632.0003.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 16a
legend
a d
b e
c f
1070

a — number of noun phrases

b — (a) relative to number of all noun phrases

c — number of noun phrases per noun

d — number of unique phrases

e — (c) relative to (a)

f — number of unique noun phrases per unique noun

bar — normalized ratio of (a–c):c

Table 16b
legend
a b c
102080

a — average noun phrase size, in words

b — largest noun phrase size in 50% of content

c — largest noun phrase size in 90% of content

bar — proportion of a:b:c


Table 16
commentary

Trump delivered slightly more +9.3% (330 vs 302) noun phrases.

Exclusive and Shared Noun Phrase Count and length

Table 17a
exclusive and shared noun phrase count
Counts of exclusive and shared noun phrases in words and per noun.
speaker noun phrase count
all top-level
Donald Trump
294 164
46.5% 55.8%
130164
286 165
97.3% 57.7%
121165
Joe Biden
266 155
42.1% 58.3%
111155
259 154
97.4% 59.5%
105154
both candidates
72 19
11.4% 26.4%
5319
50 15
69.4% 30.0%
3515

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 17b
exclusive and shared noun phrase length
Average and 50%/90% cumulative length of noun phrases, in words.
speaker noun phrase length
all top-level
Donald Trump
2.17 2 3
2.1702.0003.000
2.18 2 3
2.1822.0003.000
Joe Biden
2.17 2 3
2.1692.0003.000
2.17 2 3
2.1742.0003.000
both candidates
2.10 2 3
2.0972.0003.000
2.04 2 2
2.0402.0002.000

Hover over fields with (e.g. 155) to download the corresponding data file.

Table 17a
legend
a c
b d
1070

a — number of noun phrases

b — (a) relative to number of all noun phrases

c — number of unique phrases

d — (c) relative to (a)

bar — normalized ratio of (a–c):c

Table 17b
legend
a b c
102080

a — average noun phrase size, in words

b — largest noun phrase size in 50% of content

c — largest noun phrase size in 90% of content

bar — proportion of a:b:c


Table 17
commentary

Trump delivered slightly more +10.5% (294 vs 266) noun phrases that were exclusive to him. For example, phrases like "big pharma", "big problem" and "big stuff". Meanwhile, Biden used phrases like "better shape", "american people" and "foreign policy".

Windbag Index

The Windbag Index is a compound measure that characterizes the complexity of speech. A low index is indicative of succinct speech with low degree of repetition and large number of independent concepts.

Unlike the Flesch-Kincaid readability metrics, the Windbag Index does not take into account the length of sentences or complexity (e.g. number of syllables) of individual words.

Table 18
windbag index
Windbag Index for each speaker. The higher the value, the more repetitive the speech.
speaker Windbag Index
index value index terms
Donald Trump
1,200
+122.6%
1200.83498673744
0.430 0.301 0.376 0.247 0.416 0.303 0.561 0.984
+2.9% -17.4% -7.7% -27.7% -12.2% -7.9% -2.7% +0.7%
0.4295537104307210.3011174871639990.3756419662509170.2466367713004480.4161849710982660.3027522935779820.5606060606060610.983783783783784
Joe Biden
539
-55.1%
539.391711059201
0.417 0.365 0.407 0.341 0.474 0.329 0.576 0.977
-2.9% +21.1% +8.4% +38.3% +13.9% +8.6% +2.8% -0.7%
0.417285028371890.3647140864714090.4070660522273430.3411513859275050.4738562091503270.3288590604026850.5761589403973510.977011494252874
Table 18
legend
The Windbag Index is 1/(t1*t2*...*t9) where t1,t2,...,t8 are

t1 — fraction of words that are non-stop

t2 — fraction of non-stop words that are unique

t3 — fraction of nouns that are unique

t4 — fraction of verbs that are unique

t5 — fraction of adjectives that are unique

t6 — fraction of adverbs that are unique

t7 — fraction of noun phrases that are unique

t8 — fraction of noun phrases that are top-level


Large individual terms t1...t9 contribute to a smaller index.

The percentage values below the index and each term are relative differences to the other speaker's corresponding term (i.e. 100*(a-b)/b where a is the value for one speaker and b for the other).
Table 18
commentary

The Windbag Index is a fun metric. There isn't a lot of surprise here: Trump is +122.6% (1,200 vs 539) more of a windbag. This value becomes interesting when eventually compared across all three debates.

Word Clouds

In the word clouds below, the size of the word is proportional to the number of times it was used by a candidate (method details).

Not all words from a group used to draw the cloud fit in the image — less frequently used words for large word groups may fall outside the image.

All Words for Each Candidate

Each candidate's debate portion was extracted and frequencies were compiled for each part of speech (noun, verb, adjective, adverb), with words colored by their part of speech category.

The distribution of sizes within a tag cloud follows the frequency distribution of words. However, word size cannot be compared between clouds, since the minimum and maximum size of the words is fixed.

Debate Word Cloud for Donald Trump - all words

Debate tag cloud for Donald Trump
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

Debate Word Cloud for Joe Biden - all words

Debate tag cloud for Joe Biden
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb
commentary

Trump's main words were "good", "people", "want" and "know". Biden used "true" a lot.

Exclusive Words for Each Candidate

The clouds below show words used exlusively by a candidate. For example, if candidate A used the word "invest" (any number of times), but candidate B did not, then the word will appear in the exclusive word tag cloud for candidate A.

Words exclusive to Donald Trump

Debate tag cloud for Donald Trump
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb

Words exclusive to Joe Biden

Debate tag cloud for Joe Biden
Size proportional to word frequency. Color encodes part of speech: noun verb adjective adverb
commentary

Words that Trump used that Biden did not use include "closed", "individual", "joe" (as expected) and "phenomenal". Biden's set was centered on "america", "american", "together", "create" and "discredited".

Pronouns for Each Candidate

Word clouds based on only pronouns.

Pronouns for Donald Trump

Debate tag cloud for Donald Trump
Size proportional to word frequency. Color encodes pronoun type: masculine feminine neuter 1st person 2nd person singular plural other

Pronouns for Joe Biden

Debate tag cloud for Joe Biden
Size proportional to word frequency. Color encodes pronoun type: masculine feminine neuter 1st person 2nd person singular plural other
commentary

The pronoun clouds for both candidates look very similar. Biden's use of "we" was more frequent.

Part of Speech Word Clouds

In these clouds, words from each major part of speech were colored based on whether they were exclusive to a candidate or shared by the candidates.

The size of the word is relative to the frequency for the candidate — word sizes between candidates should not be used to indicate difference in absolute frequency.

Cloud of noun words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

The unique noun cloud (nouns spoken by only one candidate but not the other) is nearly entirely blue (Biden's color). Essentially, Biden repeated nouns exclusive to him far more than Trump repeated his set. It's surprising that Trump never said "America", "affordable", "violence", "recession" and "outcome".

Cloud of verb words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

The unique verb cloud is more mixed in color, indicating that the candidates used their unique verbs in a more balanced way. Trump's main unique verb was "closed" whereas Biden's was "create" and "discredited".

Cloud of adjective words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

Biden said "American". Trump did not.

Cloud of adverb words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

Biden said "together" whereas Trump used "ahead".

Cloud of all words, by speaker

Words unique to each candidate (Trump, Biden) and those spoken by both.
commentary

The center of this cloud is nearly all blue except for "Joe", used by Trump. Around the center there is a ring of red words — these are only by Trump, such as "predators", "couple", "drugs", "water" and "supporters".

Word Pair Clouds for Each Candidate

Pairs used only once during the debate are not shown.

word pairs for Donald Trump

JJ/JJ by Donald Trump
JJ/RB by Donald Trump
JJ/N by Donald Trump
JJ/V by Donald Trump
RB/RB by Donald Trump
RB/N by Donald Trump
RB/V by Donald Trump
N/N by Donald Trump
N/V by Donald Trump
V/V by Donald Trump

word pairs for Joe Biden

JJ/JJ by Joe Biden
JJ/RB by Joe Biden
JJ/N by Joe Biden
JJ/V by Joe Biden
RB/RB by Joe Biden
RB/N by Joe Biden
RB/V by Joe Biden
N/N by Joe Biden
N/V by Joe Biden
V/V by Joe Biden
commentary

There are naturally more of certain pairings (noun noun) than others (verb verb). Here you can see when the tagger made a mistake, for example classifying "left" as a verb in Trump's "radical left", which shows up in his adjective verb pair.

Downloads

Debate transcript

Parsed word lists and word clouds (word lists, part of speech lists, noun phrases, sentences) (word clouds)

Raw data structure

Please see the methods section for details about these files.