Description of Tables
This page describes the numerous analysis tables provided for each dataset. For illustrative purposes, we include the tables of the D04 [Backbone 1, 2002-08-14], bytes dataset on this page.
In the general design of the tables, a statistic is provided for all four possible combinations of link direction and source/destination endpoint (except for D08 [Backbone 1, 2003-05-07], which has only one direction). We use geographic orientation to label the direction of the backbone links rather than using the arbitrary (and not very informative) designations of 0 and 1. The northbound ("north") direction of both Backbone 1 and Backbone 2 travels from San Jose, CA to Seattle, WA. For the university link, we use inbound and outbound.
Object Diversity at Specific Traffic Percentiles
These tables show the percentage of objects that together contribute a given percentage of the total traffic. For example, we can see that 7.6% of the northbound source IP addresses contribute 99% of the total traffic. The values in the second column under each direction heading give the minimum volume of traffic (in either bytes or packets, depending on which set of tables is being viewed) contributed by each object. In the case of the previous example, each of the 7.6% of objects contribute at least 77k bytes.
See "Number of Objects Responsible for Specific Traffic Percentiles" for the object counts rather than object percentages. See "Traffic by IP/Prefix/Atom/AS" for the object diversity data grouped by object type rather than by traffic percentage.
|
|||||||||||||||||||||||||||||||||||||||||||||
Object Diversity of 99% of the Traffic |
|
|||||||||||||||||||||||||||||||||||||||||||||
Object Diversity of 95% of the Traffic |
|
|||||||||||||||||||||||||||||||||||||||||||||
Object Diversity of 90% of the Traffic |
|
|||||||||||||||||||||||||||||||||||||||||||||
Object Diversity of 50% of the Traffic |
Number of Objects Responsible for Specific Traffic Percentiles
These tables show the number of objects that together contribute a given percentage of the total traffic. For example, we can see that 21,816 of the northbound source IP addresses contribute 95% of the total traffic. See "Object Diversity at Specific Traffic Percentiles" for the object percentages rather than counts.
|
|||||||||||||||||||||||||
Number of Objects Responsible for 95% of the Traffic |
|
|||||||||||||||||||||||||
Number of Objects Responsible for 50% of the Traffic |
Number of Objects that Individually Contribute 1% of the Traffic
This table shows the number of objects that individually contribute at least 1% of the total traffic. Note that some of these objects contribute far more than 1% of the traffic, which is made clear by the earlier table titled "Number of Objects Responsible for 50% of the Traffic." For instance, in that table, only 4 ASes are responsible for 50% of the traffic in the (north, src) column. Hence, there must be a single AS that contributes at least 12.5% of the traffic.
Note that the counts in a particular column may decrease in the progression from IP addresses to prefixes to atoms to ASes. For example, in the first column (north, src), the prefix count is 15 while the AS count is 14. This can happen when two prefixes, each contributing at least 1% of the traffic, belong to the same AS. The atom count can be higher than the AS count for a similar reason; in this case, two prefixes belonging to the same AS are put into two different atoms (it is not abnormal for the prefixes belonging to a single AS to be assigned to different atoms).
|
|||||||||||||||||||||||||
Number of Objects that Individually Contribute 1% of the Traffic |
Traffic by IP/Prefix/Atom/AS
These tables show the object diversity at specific traffic percentiles in the same manner as "Object Diversity at Specific Traffic Percentiles", except with the tables grouped by object type rather than by traffic percentage.
|
|||||||||||||||||||||||||||||||||||||||||||||
Traffic by IP |
|
|||||||||||||||||||||||||||||||||||||||||||||
Traffic by Prefix |
|
|||||||||||||||||||||||||||||||||||||||||||||
Traffic by Atom |
|
|||||||||||||||||||||||||||||||||||||||||||||
Traffic by AS |
Crossover
These tables describe several properties of the crossovers present in the datasets. By definition, the crossover split (100 - C)%/C% occurs when C% of the largest objects (by traffic volume) contribute (100 - C)% of the total traffic. For example, a split of 90%/10% would mean just 10% of the objects contribute 90% of the traffic. Although the crossover must exist, it rarely happens that C% of objects contribute exactly (100 - C)% of the traffic. Because object counts are discrete values with a limited range, it is more typical for C% of objects to contribute (100 - C + delta)%, for some small delta. In our tables, we list the C% that produces the smallest absolute delta. For all our datasets, the sum of the chosen object percentage (C%) and the traffic volume percentage [approx. (100 - C)%] is within 1% of 100%.
For the crossover split (100 - C)%/C%, the C% of largest objects (by traffic volume) are the elephants, and the remaining (100 - C)% are the mice. The traffic cutoff at the crossover is the minimum volume of traffic (in either bytes or packets) contributed by each of the elephants. In the tables showing the aggregate traffic volume of elephants and mice, percentages are with respect to the total traffic volume.
|
|||||||||||||||||||||||||
Crossover Split (volume%/object%) |
|
|||||||||||||||||||||||||
Traffic Cutoff at the Crossover (i.e., Minimum Size of Elephants) |
|
|||||||||||||||||||||||||||||||||||||||||||||
Number of Elephants |
|
|||||||||||||||||||||||||||||||||||||||||||||
Total Traffic Volume of Elephants |
|
|||||||||||||||||||||||||||||||||||||||||||||
Total Traffic Volume of Mice (computed from volume of elephants, and thus may be slightly imprecise) |
Geographic Distribution of Traffic
These tables show the breakdown of the traffic by continent. The percentages add up to 100% in each column. The actual volume of traffic (in bytes or packets) appears in the second column below each direction heading.
Traffic is aggregated by continent with the following procedure:
- Match a source or destination IP address to an address block issued by the Regional Internet Registeries (namely, ARIN, RIPE, APNIC, and LACNIC).
- Extract the country recorded in the registration record of the address block.
- Map the country to continent.
- Add up all traffic volumes by continent.
The above procedure has known limitations. The most important being that the registration record merely provides the contact address of the organization that has obtained an address block. This contact address could simply be the headquarters of a multi-national organization (e.g., a Tier-1 ISP) while the actual hosts are spread worldwide, leading to improper placement of hosts. Registration records are also sometimes out-of-date.
Registry data collected on the 1st and 26th of June, 2003, are used to compute the geographic breakdown for all datasets, including those from 2002. For the country-to-continent mapping, we use data from NetGeo along with a few manual additions.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Geographic Distribution of Traffic |
|
|||||||||||||||||||||||||||||||||||||||||||||
Geographic Distribution of Traffic (brief) |
|
||||||||||||||||||||||||||||||||||||
Geographic Distribution of Traffic (briefer) |