In the engineering point of view, the self-similarity means that the traffic had similar statistical properties at a range of timescales: milliseconds, seconds, minutes, hours, even days and weeks. The merging of traffic streams, as in a statistical multiplexer, does not result in smoothing of traffic, in other words, burst data streams that are multiplexed tend to produce a bursty aggregate stream. Correspondingly, this traffic property can be described in the mathematical terms that the traffic has a long range dependence, which can be characterized that the correlation function of the traffic process is a heavy-tailed distribution.
A distribution is heavy-tailed if
![\begin{displaymath}
P [X \gt x] \sim x ^{-\alpha} \end{displaymath}](img10.gif)

, k>0, and
. Its distribution function has the form
![\begin{displaymath}
F (x) = P [X \leq{}x] = 1-{(k/x)}^{\alpha} \end{displaymath}](img16.gif)
The parameter k represents the smallest possible value of the random variable.
, the distribution has infinite variance.
, then the distribution has also infinite mean.
Thus, as depicted in Figure 1, with
decreases, a large portion of the probability mass is present
in the tail of the distribution. In practical terms, a random variable that
follows a heavy-tailed distribution can be extremely large with non negligible
probability.
The strict mathematics indicates that there are self-similar process that
are not long-range dependent, and vice versa. However, by the restriction
in the definition, self-similarity implies long-range dependence,
and vice versa.