Limits...
Spatio-temporal variation of conversational utterances on Twitter.

Alis CM, Lim MT - PLoS ONE (2013)

Bottom Line: Previously, we found that utterance lengths in English fictional conversations in books and movies have shortened over a period of 200 years.Furthermore, the subset of geographically-tagged tweets from the United States show an inverse proportion between utterance lengths and the state-level percentage of the Black population.We argue that shortening of utterances can be explained by the increasing usage of jargon including coined words.

View Article: PubMed Central - PubMed

Affiliation: National Institute of Physics, University of the Philippines, Diliman, Quezon City, Philippines.

ABSTRACT
Conversations reflect the existing norms of a language. Previously, we found that utterance lengths in English fictional conversations in books and movies have shortened over a period of 200 years. In this work, we show that this shortening occurs even for a brief period of 3 years (September 2009-December 2012) using 229 million utterances from Twitter. Furthermore, the subset of geographically-tagged tweets from the United States show an inverse proportion between utterance lengths and the state-level percentage of the Black population. We argue that shortening of utterances can be explained by the increasing usage of jargon including coined words.

Show MeSH
Utterance lengths across US states.A. Box plot of the utterance length distribution of each US state sorted by increasing median utterance length. The notches were estimated using 10,000 bootstrap repetitions but the resulting bootstrapped median values are the same as the empirical median values B. Contiguous US states colored with the bootstrapped median utterance length.
© Copyright Policy
Related In: Results  -  Collection


getmorefigures.php?uid=PMC3814942&req=5

pone-0077793-g006: Utterance lengths across US states.A. Box plot of the utterance length distribution of each US state sorted by increasing median utterance length. The notches were estimated using 10,000 bootstrap repetitions but the resulting bootstrapped median values are the same as the empirical median values B. Contiguous US states colored with the bootstrapped median utterance length.

Mentions: To estimate how the quartiles change, the quartiles were bootstrapped using repetitions but the bootstrapped values (Fig. 6A) turned out to be the same as the empirical values. The spread in the bootstrapped medians is very small that the interquartile range (IQR = Q3-Q1) of 40% of the bootstrapped medians is zero. Any difference, therefore, in the median between two US states is almost guaranteed to be significant. Both Kruskal-Wallis H-test (, ) [23] and pairwise Mann-Whitney U-test [24] on the empirical ULD of each US state conclude that not all ULD of the US states are the same.


Spatio-temporal variation of conversational utterances on Twitter.

Alis CM, Lim MT - PLoS ONE (2013)

Utterance lengths across US states.A. Box plot of the utterance length distribution of each US state sorted by increasing median utterance length. The notches were estimated using 10,000 bootstrap repetitions but the resulting bootstrapped median values are the same as the empirical median values B. Contiguous US states colored with the bootstrapped median utterance length.
© Copyright Policy
Related In: Results  -  Collection

Show All Figures
getmorefigures.php?uid=PMC3814942&req=5

pone-0077793-g006: Utterance lengths across US states.A. Box plot of the utterance length distribution of each US state sorted by increasing median utterance length. The notches were estimated using 10,000 bootstrap repetitions but the resulting bootstrapped median values are the same as the empirical median values B. Contiguous US states colored with the bootstrapped median utterance length.
Mentions: To estimate how the quartiles change, the quartiles were bootstrapped using repetitions but the bootstrapped values (Fig. 6A) turned out to be the same as the empirical values. The spread in the bootstrapped medians is very small that the interquartile range (IQR = Q3-Q1) of 40% of the bootstrapped medians is zero. Any difference, therefore, in the median between two US states is almost guaranteed to be significant. Both Kruskal-Wallis H-test (, ) [23] and pairwise Mann-Whitney U-test [24] on the empirical ULD of each US state conclude that not all ULD of the US states are the same.

Bottom Line: Previously, we found that utterance lengths in English fictional conversations in books and movies have shortened over a period of 200 years.Furthermore, the subset of geographically-tagged tweets from the United States show an inverse proportion between utterance lengths and the state-level percentage of the Black population.We argue that shortening of utterances can be explained by the increasing usage of jargon including coined words.

View Article: PubMed Central - PubMed

Affiliation: National Institute of Physics, University of the Philippines, Diliman, Quezon City, Philippines.

ABSTRACT
Conversations reflect the existing norms of a language. Previously, we found that utterance lengths in English fictional conversations in books and movies have shortened over a period of 200 years. In this work, we show that this shortening occurs even for a brief period of 3 years (September 2009-December 2012) using 229 million utterances from Twitter. Furthermore, the subset of geographically-tagged tweets from the United States show an inverse proportion between utterance lengths and the state-level percentage of the Black population. We argue that shortening of utterances can be explained by the increasing usage of jargon including coined words.

Show MeSH