The speed and reach of forwarded emails, rumors, and hoaxes in electronic social networks

large_spain_5We have just published an experimental/theoretical work on the speed of information diffusion in social networks in Physical Review Letters. Specifically we have studied the impact of the heterogeneity of human activity in propagation of emails, rumors, hoaxes, etc. Tracking email marketing campaigns, executed by IBM Corporation in 11 European countries, we were able to compare their viral propagation with our theory (see below the campaigns details).

The results are very simple. Let me give you an example: the typical time between two emails sent by the same person is around 1 day. Traditional models of information diffusion will then yield to an infection speed of 1 day. However, some email computer viruses spread widely in a matter of hours (minutes, sometimes), while some viral propagation (for example the Veuve-Clicquot hoax) last for years. How can that occur? The reason is that traditional models are not correct because they neglect the large heterogeneity in the frequency of human activity: the average time between emails (1 day) does not actually represent the collectivity. In fact, most of us respond very quickly to emails, but some take a lot of time to do it. This fact (known and discovered previously by others) has a profound consequence in the way information spreads:

  1. When information spreads “successfully”, in the sense that it propagates and reaches most of the collectivity (i.e. it surpasses the tipping-point), its propagation speed of is determined by the people that have higher activity.
  2. However, when information reaches just a small fraction of the population (below the tipping-point), its propagation is controlled by those who take a lot of time to respond/forward and the spreading is very slow.

This phenomenon, as explained in our paper, has consequences for viral marketing, fads and hoaxes diffusion or opinion dynamics because the speed of their messages propagation depends strongly on the size of the sub-communities of very active and not-so active people. For example, in our campaigns (which were below the tipping-point yet successful from a viral marketing perspective), endogenous propagation of the commercial message lasted for months while the average time between getting the message and forwarding was only 1 day. We also found that messages do not “go viral”: They are viral because of the diffusion mechanism they use, but their spreading success largely depends on the social network propensity and heterogeneous behavior.

Finally, our work has some consequences for the way we model and understand human dynamics, since it shows that there is no such a thing as a typical time scale in the human dynamics. This is in sharp contrast with epidemic models, information diffusion models, etc. in which the heterogeneity in human activity and frequency is usually neglected, in favor of a more homogeneous picture of the activity of humans.

About the empirical data:
The viral marketing campaigns were conducted by IBM using the typical “refer-a-friend” mechanism which led to the endogenous diffusion of information. The campaigns’ offerings were promoted at the IBM. homepage where initial participants heard about them. Their primary marketing objective was to generate subscriptions to the company’s on-line newsletter. Subscriptions were entered through a form located in the campaign main web page (a.k.a. registration page). Additionally, a viral propagation mechanism accessible through a button located at the registration page was available to foster the message propagation. The button caption enticed visitors to recommend the page to friends and colleagues by offering, as additional incentive for people to forward the page, tickets for a prize draw to win a laptop computer. More technical details about the campaign can be found at Appendix D of the arXiv version of our paper

Press coverage:

  • ‘Infectious’ people spread memes across the web, New Scientist (12/08/09)
  • Email hoaxes are like viruses, The Inquirer (10/08/09)
  • The flow of viral video, ABC News (8/08/09)
  • New model for social marketing campaigns details why some information ‘goes viral’, PhysOrg (6/08/09)
  • Los perezosos frenan los rumores en Internet, ABC.es (14/8/09)
  • Party people spread viral internet memes, ComputerWeekly (14/8/09)
  • Desvelan las claves de la difusión de la información en las redes sociales, PlataformaSINC.es (7/9/09)
  • Nuevas claves para la difusión de información en las redes sociales, Noticias Madri+d (7/9/09)
  • Specialization of strategies and herding behavior of trading firms in a financial market

    Fabrizio Lillo, Esteban Moro, Gabriella Vaglica y Rosario Mantegna
    New Journal of Physics 10 (2008) 043019 [pdf]

    Abstract:
    Agent-based models of financial markets usually make assumptions  about agent’s preferred stylized strategies. Empirical validations of these assumptions have not been performed so far on a full-market scale. Here we present a comprehensive study of the resulting strategies followed by the firms which are members of the Spanish Stock Exchange. We are able to show that they can be characterized by a resulting strategy and classified in three well-defined groups of firms. Firms of the first group have a change of inventory of the traded stock which is positively correlated with the synchronous stock return, whereas firms of the second group show a negative correlation.Firms of the third group have an inventory variation uncorrelated with stock return. Firms tend to stay in the same group over the years indicating a long term specialization in the strategies controlling their inventory variation. We detect a clear asymmetry in the Granger causality between inventory variation of firms and stock return. We also detect herding in the buying and selling activity of firms. The herding properties of the two groups are markedly different and consistently observed over a four-year period of trading. Firms of the second group herd much more frequently than the ones of the first group. Our results can be used as an empirical basis for agent-based models of financial markets.

    Press Release (in spanish) 

    Writing bad letters of recommendation: the story of Bachelier and Lévy

     

    Take a coin and toss it a number of times in a time interval of duration T.  Suppose that every time you get head you win a euros and that you lose the same amount of money when you get tail. Then your capital is a random process with ups and dows like this:

     

    This process is a stochastic process usually called “Random Walk” and its properties depend on the parameters N, a and T. For example: if we play this game several times, the average mean value of the capital obtained after a time T  is zero! This is simple to realize since the probability to get either head or tails is the same. The problem comes when you analyze the fluctuations around this zero gain: the root mean square deviations from this zero mean behavior go like

            RMS(N) =   N a2

     

    which brings the sad conclusion that the more times you play the game the higher the fluctuations are. If you are risk-averse, this is the worst situation since, although in average you don’t lose or win, the uncertanty of what quantity you will get in one shot of the game is growing in time.

    We now ask the following question: do the properties of this game change if we play M > N  times in the same time T with a smaller payoff b < a? Of course the stochastic process change, but some of the properties remain unchanged under proper choices of a and b . Obviously the average gain of this new game is also zero. What about the RMS? Note that if we take a2 = T/N or b2 = T/M then we have

            RMS(N) = N a2 = T     for the first game
            RMS(M) = M b2 = T     for the second game

    which is independent of the payoff. This fact led some mathematicians early last century to study the asymptotic case a  0 and N →  , BUT taking

            a2 N = T = constant  

    (1)

    which usually called Brownian Motion. The name “Brownian” comes from the botanist Brown who observed how particles of (probably) clay moved in water under the kicks of the molecules of water.

    Bachelier (1870-1946 right) was the first one to study the Brownian motion in his PhD thesis at the Sorbonne in Paris and applied it as a possible model for the stock market. He was well ahead of his time not only for its application to the stock market, but also because he derived a lot of the properties of this stochastic process. Unfortunately his notation was a little bit sloppy. In particular, the dependence of a2 with N and T [given by equation (1) above] in the limit N   was omitted in most of his books and papers but always assumed by Bachelier. This “minor” omission and a careless reading of Bachelier’s work was the origin of Paul Lévy’s strong criticism to his work. It was so strong, that Levy wrote a very critical and negative report about Bachelier’s work when the latter was trying to get an appointment at Dijon. Bachelier of course didn’t get the position and moved then to a small university at Besançon and kept on working without much impact in the field.

    It was after Kolmogorov’s 1931 citation of Bachelier work that Lévy went back to his work and realized that he made a misjudgment of Bachelier’s work. Apparently Levy didn’t even read Bachelier’s papers and books in the very first place and, even so, he disregarded Bachelier’s findings as erroneous. Quite a strange behavior for one of the best mathematicians of all times. After that, in 1931, Lévy wrote to Bachelier a letter apologizing for his behavior. It was a little bit late since Bachelier retired in 1937 although Bachelier was quite happy to receive Lévy’s letter. At last his work was read by someone, and by the best!

    More information
    Biography of Bachelier
    Bachelier and his times: A conversation with Bernard Bru, an article by M.S. Taqqu, published in Mathematical Finance – Bachelier Congress 2000

    Spreading of Viscous Fluid Drops on a Solid Substrate Assisted by Thermal Fluctuations

    Benny Davidovitch, Esteban Moro, and Howard A. Stone
    Physical Review Letters 95, 244505 (2005). [pdf]

    Abstract:
    We study the spreading of viscous drops on a solid substrate, taking into account the effects of thermal fluctuations in the fluid momentum. A nonlinear stochastic lubrication equation is derived and studied using numerical simulations and scaling analysis. We show that asymptotically spreading drops admit self-similar shapes, whose average radii can increase at rates much faster than these predicted by Tanner’s law. We discuss the physical realizability of our results for thin molecular and complex fluid films, and predict that such phenomenon can in principal be observed in various flow geometries.

    Continue reading