27

Feb

Words with Reps: Map of Vocab on the House Floor

What am I looking at?
This map shows the breadth of vocabulary of each of the nation’s 435 voting Representatives. A darker green color indicates that a Rep has a larger vocabulary. There are 6 non-voting Representatives excluded from the map, though they are included in the rankings. The map is colored by each Rep’s “SQPD Ranking” (more details below), which measures a Representative’s usage of 3,393 different words that might be found on the SAT.

What are the numbers that appear when I click a district?

  • Word Diversity Rank reflects how frequently a rep uses SAT words vs. usage of a basket of common words. The common words include: ‘are’, 'one’, 'they’, 'was’, 'were’, and 'with’. The basket was specifically designed to be non-partisan. Oddly enough, some common words–even 'one’, for example–show what appears to be a slight partisan tilt. [Teaser: subject of a future post.] The highest word diversity rank belongs to none other than Ronald Earnest Paul (aka Ron Paul).
  • SAT Words Used (Count) reflects the number of different SAT words used by a Representative. The highest total (1,178) belongs to Sheila Jackson-Lee (D-TX). Not only is she highly educated, but she’s also been in office since 1995.
  • SQPD Index, or the “sesquipedalian index”, is an aggregate measure of a Representative’s vocabulary. It includes both the SAT Words Used (Count), and Word Diversity Rank. Slightly higher weight is given to the count. Both factors are important, though, since certain Reps have a much, much longer record than others.
  • SAT Words Used (Total) is the total number of SAT word utterances. In other words, if 10 SAT words were said 20 times, a Rep would have a score of 200. The highest total (5,027) also belongs to Sheila Jackson-Lee.

Who have the 10 highest scores?
In order, the Representatives with the 10 highest SQPD Index values are as follows:

  1. Sheila Jackson Lee, (D-TX): 4.79
  2. Christopher H. Smith, (R-NJ): 2.98
  3. Dennis Kucinich, (D-OH): 1.00
  4. Steve King, (R-IA): 0.99
  5. Barney Frank, (D-MA): 0.97
  6. Charles Rangel, (D-NY): 0.95
  7. John Conyers, (D-MI): 0.92
  8. Ronald Ernest Paul, (R-TX): 0.87
  9. Steny Hoyer, (D-MD): 0.84
  10. Marcy Kaptur, (D-OH): 0.84

What data sources were used to create this map?
The Congressional Record is the “official record of the proceedings and debates of the United States Congress.“ The Sunlight Foundation, through its Capital Words API, processes the Congressional Record every day. They currently maintain data from 1996 onward. You can query word or phrase frequency by political party, over time, by representative, and so on.

The list of SAT words was obtained from two sites, freevocabulary.com and majortests.com. There were 5,372 words on the original list, although some of them hardly seem like SAT words ('further’, 'believe’, 'off’), while others are mainstays of Congressional dialogue ('foreign’, 'unanimous’, 'bureaucracy’). So, I excluded all words said more than 500 times from the final list, leaving 3,393 SAT words. The most common of these are 'barring’, 'culprit’, 'passive’, and 'revert’.

Finally, the map itself was created using Google Fusion Tables. Here’s a link to the table. Enjoy!

Related Posts:

  1. laphotos-blog reblogged this from sunfoundation
  2. photosnew-blog reblogged this from sunfoundation
  3. hilariouslywizardblonde reblogged this from dfkoz
  4. lovebeautifulthings-blog reblogged this from dfkoz
  5. e-tag reblogged this from npr
  6. withoutayard reblogged this from dfkoz
  7. yeahwellyourface reblogged this from dfkoz
  8. stakooza-blog reblogged this from dfkoz
  9. udderfly-blog reblogged this from dfkoz
  10. shabrittnaynay reblogged this from dfkoz and added:
    so cool!
  11. myrthablackswan reblogged this from npr
  12. dugbaker reblogged this from npr
  13. danibeef-blog reblogged this from dfkoz and added:
    yeah, and that’s why people from nj and nyc talk circles around other people. and that’s why i call that big lumpy state...
  14. corinneavital-blog reblogged this from onthemedia-blog
  15. sjwhipp-blog reblogged this from dfkoz
  16. snowgray reblogged this from dfkoz
  17. onthemedia-blog reblogged this from dfkoz
  18. milwaukeestat reblogged this from dfkoz
  19. wrench-wench reblogged this from dfkoz and added:
    This is pretty fucking cool.
  20. belladonnalesbica reblogged this from dfkoz
  21. iycrmm reblogged this from npr
  22. dfkoz posted this