Limits...
A Geometric Representation of Collective Attention Flows.

Shi P, Huang X, Wang J, Zhang J, Deng S, Wu Y - PLoS ONE (2015)

Bottom Line: As a result, knowing how collective attention distributes and flows among different websites is the first step to understand the underlying dynamics of attention on WWW.And the patterns are stable across different periods.Thus, the overall distribution and the dynamics of collective attention on websites can be well exhibited by this geometric representation.

View Article: PubMed Central - PubMed

Affiliation: Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha, China; School of Systems Science, Beijing Normal University, Beijing, China.

ABSTRACT
With the fast development of Internet and WWW, "information overload" has become an overwhelming problem, and collective attention of users will play a more important role nowadays. As a result, knowing how collective attention distributes and flows among different websites is the first step to understand the underlying dynamics of attention on WWW. In this paper, we propose a method to embed a large number of web sites into a high dimensional Euclidean space according to the novel concept of flow distance, which both considers connection topology between sites and collective click behaviors of users. With this geometric representation, we visualize the attention flow in the data set of Indiana university clickstream over one day. It turns out that all the websites can be embedded into a 20 dimensional ball, in which, close sites are always visited by users sequentially. The distributions of websites, attention flows, and dissipations can be divided into three spherical crowns (core, interim, and periphery). 20% popular sites (Google.com, Myspace.com, Facebook.com, etc.) attracting 75% attention flows with only 55% dissipations (log off users) locate in the central layer with the radius 4.1. While 60% sites attracting only about 22% traffics with almost 38% dissipations locate in the middle area with radius between 4.1 and 6.3. Other 20% sites are far from the central area. All the cumulative distributions of variables can be well fitted by "S"-shaped curves. And the patterns are stable across different periods. Thus, the overall distribution and the dynamics of collective attention on websites can be well exhibited by this geometric representation.

No MeSH data available.


The variations of parameters, cT, cD, cN, kT, kF, kN of “S” curve for selected snapshots.
© Copyright Policy
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC4556699&req=5

pone.0136243.g009: The variations of parameters, cT, cD, cN, kT, kF, kN of “S” curve for selected snapshots.

Mentions: The parameter k of gompertz function controls the translation of “S” curves and c controls the growth rate of the curves. From Fig 9, we can see that there is a downward trend from October 10, 2006 to February 10, 2008, corresponding to the cumulative curve of the number of websites offsets to left. It is also apparent that cN is larger than cT, cD in most situations, meaning that N(R) has more dramatic increases than T(R) and D(R) in general. This indicates the distribution of websites along the radius is always more heterogeneous.


A Geometric Representation of Collective Attention Flows.

Shi P, Huang X, Wang J, Zhang J, Deng S, Wu Y - PLoS ONE (2015)

The variations of parameters, cT, cD, cN, kT, kF, kN of “S” curve for selected snapshots.
© Copyright Policy
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC4556699&req=5

pone.0136243.g009: The variations of parameters, cT, cD, cN, kT, kF, kN of “S” curve for selected snapshots.
Mentions: The parameter k of gompertz function controls the translation of “S” curves and c controls the growth rate of the curves. From Fig 9, we can see that there is a downward trend from October 10, 2006 to February 10, 2008, corresponding to the cumulative curve of the number of websites offsets to left. It is also apparent that cN is larger than cT, cD in most situations, meaning that N(R) has more dramatic increases than T(R) and D(R) in general. This indicates the distribution of websites along the radius is always more heterogeneous.

Bottom Line: As a result, knowing how collective attention distributes and flows among different websites is the first step to understand the underlying dynamics of attention on WWW.And the patterns are stable across different periods.Thus, the overall distribution and the dynamics of collective attention on websites can be well exhibited by this geometric representation.

View Article: PubMed Central - PubMed

Affiliation: Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha, China; School of Systems Science, Beijing Normal University, Beijing, China.

ABSTRACT
With the fast development of Internet and WWW, "information overload" has become an overwhelming problem, and collective attention of users will play a more important role nowadays. As a result, knowing how collective attention distributes and flows among different websites is the first step to understand the underlying dynamics of attention on WWW. In this paper, we propose a method to embed a large number of web sites into a high dimensional Euclidean space according to the novel concept of flow distance, which both considers connection topology between sites and collective click behaviors of users. With this geometric representation, we visualize the attention flow in the data set of Indiana university clickstream over one day. It turns out that all the websites can be embedded into a 20 dimensional ball, in which, close sites are always visited by users sequentially. The distributions of websites, attention flows, and dissipations can be divided into three spherical crowns (core, interim, and periphery). 20% popular sites (Google.com, Myspace.com, Facebook.com, etc.) attracting 75% attention flows with only 55% dissipations (log off users) locate in the central layer with the radius 4.1. While 60% sites attracting only about 22% traffics with almost 38% dissipations locate in the middle area with radius between 4.1 and 6.3. Other 20% sites are far from the central area. All the cumulative distributions of variables can be well fitted by "S"-shaped curves. And the patterns are stable across different periods. Thus, the overall distribution and the dynamics of collective attention on websites can be well exhibited by this geometric representation.

No MeSH data available.