Limits...
Spatio-temporal variation of conversational utterances on Twitter.

Alis CM, Lim MT - PLoS ONE (2013)

Bottom Line: Previously, we found that utterance lengths in English fictional conversations in books and movies have shortened over a period of 200 years.Furthermore, the subset of geographically-tagged tweets from the United States show an inverse proportion between utterance lengths and the state-level percentage of the Black population.We argue that shortening of utterances can be explained by the increasing usage of jargon including coined words.

View Article: PubMed Central - PubMed

Affiliation: National Institute of Physics, University of the Philippines, Diliman, Quezon City, Philippines.

ABSTRACT
Conversations reflect the existing norms of a language. Previously, we found that utterance lengths in English fictional conversations in books and movies have shortened over a period of 200 years. In this work, we show that this shortening occurs even for a brief period of 3 years (September 2009-December 2012) using 229 million utterances from Twitter. Furthermore, the subset of geographically-tagged tweets from the United States show an inverse proportion between utterance lengths and the state-level percentage of the Black population. We argue that shortening of utterances can be explained by the increasing usage of jargon including coined words.

Show MeSH
Exploring possible mechanisms of shortening.Annual values of A. median word length of all words, B. median word length of the 1000 most frequently occurring words, C. most frequent word length of the 1000 most frequently occurring words, D. fraction of 1000 most frequently occurring words relatively to all occurrences of words, E. median utterance length in number of words F. median tweet length in number of words, and G. median trending topic phrase length.
© Copyright Policy
Related In: Results  -  Collection


getmorefigures.php?uid=PMC3814942&req=5

pone-0077793-g005: Exploring possible mechanisms of shortening.Annual values of A. median word length of all words, B. median word length of the 1000 most frequently occurring words, C. most frequent word length of the 1000 most frequently occurring words, D. fraction of 1000 most frequently occurring words relatively to all occurrences of words, E. median utterance length in number of words F. median tweet length in number of words, and G. median trending topic phrase length.

Mentions: The median word length of all words is 4 characters (Fig. 5A) for all years from 2009 to 2012. Although the median length of the 1000 most frequently used words from 2009 to 2012 is constant at 4 characters (Fig. 5B), the peak (mode) moved from 4 characters in 2009 to 3 characters (Fig. 5C) in the succeeding years. Based on Kruskal-Wallis tests, the word length distributions of the 1000 most frequently used words for 2010–2012 are not significantly different (, ) with each other but are significantly different with the distribution for 2009 (, ). However, the observed shortening is not just due to a sudden shortening of the 1000 most frequently occurring words from 2009 to 2010 because it was still observed in 2010–2012 (Table 1 and Fig. 4, 2010–2012).


Spatio-temporal variation of conversational utterances on Twitter.

Alis CM, Lim MT - PLoS ONE (2013)

Exploring possible mechanisms of shortening.Annual values of A. median word length of all words, B. median word length of the 1000 most frequently occurring words, C. most frequent word length of the 1000 most frequently occurring words, D. fraction of 1000 most frequently occurring words relatively to all occurrences of words, E. median utterance length in number of words F. median tweet length in number of words, and G. median trending topic phrase length.
© Copyright Policy
Related In: Results  -  Collection

Show All Figures
getmorefigures.php?uid=PMC3814942&req=5

pone-0077793-g005: Exploring possible mechanisms of shortening.Annual values of A. median word length of all words, B. median word length of the 1000 most frequently occurring words, C. most frequent word length of the 1000 most frequently occurring words, D. fraction of 1000 most frequently occurring words relatively to all occurrences of words, E. median utterance length in number of words F. median tweet length in number of words, and G. median trending topic phrase length.
Mentions: The median word length of all words is 4 characters (Fig. 5A) for all years from 2009 to 2012. Although the median length of the 1000 most frequently used words from 2009 to 2012 is constant at 4 characters (Fig. 5B), the peak (mode) moved from 4 characters in 2009 to 3 characters (Fig. 5C) in the succeeding years. Based on Kruskal-Wallis tests, the word length distributions of the 1000 most frequently used words for 2010–2012 are not significantly different (, ) with each other but are significantly different with the distribution for 2009 (, ). However, the observed shortening is not just due to a sudden shortening of the 1000 most frequently occurring words from 2009 to 2010 because it was still observed in 2010–2012 (Table 1 and Fig. 4, 2010–2012).

Bottom Line: Previously, we found that utterance lengths in English fictional conversations in books and movies have shortened over a period of 200 years.Furthermore, the subset of geographically-tagged tweets from the United States show an inverse proportion between utterance lengths and the state-level percentage of the Black population.We argue that shortening of utterances can be explained by the increasing usage of jargon including coined words.

View Article: PubMed Central - PubMed

Affiliation: National Institute of Physics, University of the Philippines, Diliman, Quezon City, Philippines.

ABSTRACT
Conversations reflect the existing norms of a language. Previously, we found that utterance lengths in English fictional conversations in books and movies have shortened over a period of 200 years. In this work, we show that this shortening occurs even for a brief period of 3 years (September 2009-December 2012) using 229 million utterances from Twitter. Furthermore, the subset of geographically-tagged tweets from the United States show an inverse proportion between utterance lengths and the state-level percentage of the Black population. We argue that shortening of utterances can be explained by the increasing usage of jargon including coined words.

Show MeSH