The Macroscopic Internet Topology Data Kit (ITDK)

| 
|
|
CAIDA's Macroscopic Internet Topology Data Kit (ITDK) contains
data for researchers about
connectivity, routing and latency gathered from a large cross-section of the
global Internet.
| 
|

|
Request to download ITDK
ITDK data can be requested for download by filling in a CAIDA Skitter Data Request Form.
Data Collection Interval
- Kit #0204: Apr 1-16, 2002
- Kit #0304: Apr 21 to May 8, 2003
skitter Monitors
The set of
active skitter monitors changes over time.
- Kit #0204: contains data from 21 monitors.
- Kit #0304: contains data from 23 monitors.
Probe Lists
Over several years, CAIDA has developed and managed lists of hosts to
probe using our lightweight traceroute tool
skitter.
CAIDA seeks to maximize
address space coverage (1 probe per /24 in the IPv4 space)
while remaining sensitive to do-not-probe requests
from particular hosts. Two of CAIDA's major probe lists are 1) the IPv4
list, containing destinations spread over BGP-routed IP
space and /24 blocks;
and 2) the DNS list, containing destinations collected from
queries to the DNS root server. There are several different sized versions
of the IPv4 list to accomodate the varying capacity of the skitter
monitors.
- Kit #0204:
- IPv4: 133k, 301k, or 825k addresses
- DNS: 143k addresses
- Kit #0204:
- IPv4: 147k, 366k, or 865k addresses
- DNS: 147k addresses (different than the 147k IPv4 addresses)
ITDK #0204 Directory Structure
There are several directories containing different
categories of data. README files in these directories describe files and file formats in more detail. The master processing
sequence resides in SKI/procSki.
-
CAIDA's skitter-related Data
| Directory | Contents |
| SKI | CAIDA's parsing scripts, C programs and
command sequences for skitter data analysis, |
| SKB0204 | measurement data from 21 skitter monitors
aggregated on a daily basis. |
| SKI0204 | merged datasets, including IP and AS graphs, their
cores, distance and reachability statistics, and the scripts and commands
used to generate them. |
| IPC0204 | IP addresses with their counts, valid/private
tags and all observed hop distances from each monitor. April 01 data
is given as an example of parsing done using a script from SKI
(above). |
| LNK0204 | traceroute links, including placeholder nodes
(non-replies and private addresses in the middle of a path). |
| RTT0204 | per-detination and aggregated statistics about
replies and RTTs. |
-
CAIDA's iffinder-related Data
| Directory | Contents |
| IFF0204 | original iffinder file, links, connected components
(sets of IP interfaces) and parsing scripts. |
-
U Oregon's RouteViews BGP tables
| Directory | Contents |
| BGP | parsing scripts, C programs and command sequences,
(including a script for analyzing RouteViews tables,
a program for converting IP addresses to prefixes, and
scripts for abstracting data into BGP atoms.. |
| BGD | Oregon Route Views raw ("show ip bgp")
tables for April 1, 15, and 16, 2002. |
| BGD020401P35 | processed BGP tables, prefix tables with origin
ASes and statistics, and BGP atoms [3] for April 1, 12:00 Route Views
data (containing 36 full-size tables). |
| BGD020415P36 | processed BGP tables, prefix tables with
origin ASes and statistics and BGP atoms for April 15, 12:00
Route Views data (containing 36 full-size tables). |
-
Domain Names
| Directory | Contents |
| SKI0204 | all DNS names found for IP addresses and for
well-connected nodes (having an outdegree of 100 or more). |
ITDK #0304 Directory Structure
There are several directories containing different categories of data.
README files in these directories describe files and file formats in
more detail.
-
CAIDA's skitter-related Data
| Directory | Contents |
| SKI | CAIDA's parsing scripts, C programs and
command sequences for skitter data analysis. |
| SKB0304 | traceroute data from 23 skitter monitors
aggregated into daily files. |
| SKI0304 | merged datasets, including IP and AS graphs, their
cores, distance and reachability statistics, and the scripts and commands
used to generate them. |
| IPC0304 | statistics on IP addresses, including their
frequency, validity, and observed hop distances from the monitor. |
| LNK0304 | IP links, including links involving placeholder
nodes (which represent non-replies and private addresses in the middle
of a path). |
-
CAIDA's iffinder-related Data
| Directory | Contents |
| IFF0304 | original iffinder output file |
-
U Oregon's RouteViews BGP tables
| Directory | Contents |
| BGP | BGP table analysis scripts |
| BGD030508P39 | processed BGP tables, prefix tables with
origin ASes and statistics, and BGP atoms for May 8, 12:00 PDT,
Route Views data (containing 39 full-size tables). |
-
Domain Names
| Directory | Contents |
| SKI0304 | all DNS names that could be found for all IP
addresses included in this ITDK. |
Bibliography
- A. Broido, kc claffy. Internet topology: Connectivity of IP graphs.
SPIE Conference on Scalability and Traffic Control in IP Networks, Denver,
August, 2001.
http://www.caida.org/publications/papers/2001/OSD/.
- B. Huffaker, A. Broido, kc claffy, M. Fomenkov, K. Keys, Y. Hyun, D.
Moore. Skitter AS Internet Graph. CAIDA, April 2002.
http://www.caida.org/research/topology/as_core_network/.
- A. Broido, kc claffy. Complexity of global routing policies.
http://www.caida.org/publications/papers/2001/CGR/.
Attribution
Please identify this data resource as
- "CAIDA Macroscopic IP Topology Data Kit (ITDK) #0204,
provided to the Network Modeling and Simulation (NMS) community under
DARPA grant N66001-01-1-8909." Cooperative Association for Internet Data
Analysis, San Diego Supercomputer Center (SDSC), University of California,
San Diego (UCSD), 2002.
- "CAIDA's Internet Topology Data Kit #0304." Cooperative
Association for Internet Data Analysis, San Diego Supercomputer Center
(SDSC), University of California, San Diego (UCSD), 2003.
|
|