Limits...
A Geometric Representation of Collective Attention Flows.

Shi P, Huang X, Wang J, Zhang J, Deng S, Wu Y - PLoS ONE (2015)

Bottom Line: As a result, knowing how collective attention distributes and flows among different websites is the first step to understand the underlying dynamics of attention on WWW.And the patterns are stable across different periods.Thus, the overall distribution and the dynamics of collective attention on websites can be well exhibited by this geometric representation.

View Article: PubMed Central - PubMed

Affiliation: Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha, China; School of Systems Science, Beijing Normal University, Beijing, China.

ABSTRACT
With the fast development of Internet and WWW, "information overload" has become an overwhelming problem, and collective attention of users will play a more important role nowadays. As a result, knowing how collective attention distributes and flows among different websites is the first step to understand the underlying dynamics of attention on WWW. In this paper, we propose a method to embed a large number of web sites into a high dimensional Euclidean space according to the novel concept of flow distance, which both considers connection topology between sites and collective click behaviors of users. With this geometric representation, we visualize the attention flow in the data set of Indiana university clickstream over one day. It turns out that all the websites can be embedded into a 20 dimensional ball, in which, close sites are always visited by users sequentially. The distributions of websites, attention flows, and dissipations can be divided into three spherical crowns (core, interim, and periphery). 20% popular sites (Google.com, Myspace.com, Facebook.com, etc.) attracting 75% attention flows with only 55% dissipations (log off users) locate in the central layer with the radius 4.1. While 60% sites attracting only about 22% traffics with almost 38% dissipations locate in the middle area with radius between 4.1 and 6.3. Other 20% sites are far from the central area. All the cumulative distributions of variables can be well fitted by "S"-shaped curves. And the patterns are stable across different periods. Thus, the overall distribution and the dynamics of collective attention on websites can be well exhibited by this geometric representation.

No MeSH data available.


The Lorenze-liked curves and the GINI-liked coefficients among cumulative attention flows, attention dissipations, and the number of websites along the radius.The green nodes in the two sub-figures on the left mean the attention flows or dissipations of the 20% websites in the core. The insets show the log-log plots of the focal variable pairs.
© Copyright Policy
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC4556699&req=5

pone.0136243.g006: The Lorenze-liked curves and the GINI-liked coefficients among cumulative attention flows, attention dissipations, and the number of websites along the radius.The green nodes in the two sub-figures on the left mean the attention flows or dissipations of the 20% websites in the core. The insets show the log-log plots of the focal variable pairs.

Mentions: To compare the relative rates of accumulation for different variables along radius, we can plot two variables together on one coordinate as shown in Fig 6. The curves can be also predicted theoretically by combining the gompertze functions together to eliminate R. For example, we consider the relationship between N(R) and T(R) and we know:{N(R)=exp(-kNexp(-cNR))T(R)=exp(-kTexp(-cTR))(10)


A Geometric Representation of Collective Attention Flows.

Shi P, Huang X, Wang J, Zhang J, Deng S, Wu Y - PLoS ONE (2015)

The Lorenze-liked curves and the GINI-liked coefficients among cumulative attention flows, attention dissipations, and the number of websites along the radius.The green nodes in the two sub-figures on the left mean the attention flows or dissipations of the 20% websites in the core. The insets show the log-log plots of the focal variable pairs.
© Copyright Policy
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC4556699&req=5

pone.0136243.g006: The Lorenze-liked curves and the GINI-liked coefficients among cumulative attention flows, attention dissipations, and the number of websites along the radius.The green nodes in the two sub-figures on the left mean the attention flows or dissipations of the 20% websites in the core. The insets show the log-log plots of the focal variable pairs.
Mentions: To compare the relative rates of accumulation for different variables along radius, we can plot two variables together on one coordinate as shown in Fig 6. The curves can be also predicted theoretically by combining the gompertze functions together to eliminate R. For example, we consider the relationship between N(R) and T(R) and we know:{N(R)=exp(-kNexp(-cNR))T(R)=exp(-kTexp(-cTR))(10)

Bottom Line: As a result, knowing how collective attention distributes and flows among different websites is the first step to understand the underlying dynamics of attention on WWW.And the patterns are stable across different periods.Thus, the overall distribution and the dynamics of collective attention on websites can be well exhibited by this geometric representation.

View Article: PubMed Central - PubMed

Affiliation: Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha, China; School of Systems Science, Beijing Normal University, Beijing, China.

ABSTRACT
With the fast development of Internet and WWW, "information overload" has become an overwhelming problem, and collective attention of users will play a more important role nowadays. As a result, knowing how collective attention distributes and flows among different websites is the first step to understand the underlying dynamics of attention on WWW. In this paper, we propose a method to embed a large number of web sites into a high dimensional Euclidean space according to the novel concept of flow distance, which both considers connection topology between sites and collective click behaviors of users. With this geometric representation, we visualize the attention flow in the data set of Indiana university clickstream over one day. It turns out that all the websites can be embedded into a 20 dimensional ball, in which, close sites are always visited by users sequentially. The distributions of websites, attention flows, and dissipations can be divided into three spherical crowns (core, interim, and periphery). 20% popular sites (Google.com, Myspace.com, Facebook.com, etc.) attracting 75% attention flows with only 55% dissipations (log off users) locate in the central layer with the radius 4.1. While 60% sites attracting only about 22% traffics with almost 38% dissipations locate in the middle area with radius between 4.1 and 6.3. Other 20% sites are far from the central area. All the cumulative distributions of variables can be well fitted by "S"-shaped curves. And the patterns are stable across different periods. Thus, the overall distribution and the dynamics of collective attention on websites can be well exhibited by this geometric representation.

No MeSH data available.