Skip to main content

"state of the art in visualization of Internet data"

Archived MagicPoint presentation slides, compiled into a single PDF document.

2000_viz0002.pdf (47 slides, 663 KB)

Slide text transcript

Slide 1: state of the art in visualization of Internet data:

state of the art in visualization of Internet data:

topology, workload, performance and routing

11 January 2000


       scientific apparatus offers a window to knowledge, 
       but as they grow more elaborate, 
       scientists spend ever more time washing the windows. 
                                     -- Isaac Asimov 



kc claffy, UCSD/SDSC/CAIDA
kc@caida.org 
www.caida.org

Slide 2: SDSC visualization research

SDSC visualization research 


SDSC Vislab research:
modeling complex conditions
extending interactions
scalable visualization toolsets

CAIDA research at SDSC:
analyzing characterization criteria
correlating different datasets
visualizing events and trends

Slide 3: CAIDA vis: aiding Internet analyses

CAIDA vis:  aiding Internet analyses


identify and present insights into

topology
historical workload trends
active performance evaluation
routing efficiency

represent complex behavior, including

correlation of large datasets
enabling drill-down

Slide 4: CAIDA vis: Internet visualization

CAIDA vis: Internet visualization 


helping to manage explosive growth throughout the Internet

enhancing ability for providers and users to interpret vast quantities of data in real-time

@@@
creating user-friendly integration with network utilities and control systems

Slide 5: CAIDA vis: challenges

CAIDA vis:  challenges


manage complexity 
enable inter- and intra-ISP analysis and feature detection

develop new methods for data collection, reduction, aggregation, and mining (GByte or Tbyte datasets)

manage data that is geographically and logically distributed as well as dynamically changing

Slide 6: => visualizing topology: overview

=> visualizing topology: overview

        
several methods available

geographical

logical

hyperbolic (T. Munzner,Stanford)

Slide 7: visualizing topology: skitter

visualizing topology:  skitter

macroscopic, infrastructure-wide 
dynamically discover/depict topology (& b/w)
correlate path perf. w events, e.g. BGP
identify critical pieces of infrastructure

Slide 8: skitter: infrastructure-wide measurements

skitter: infrastructure-wide measurements

17 monitors (inc. 1 root name server)
multiple dst lists (29k servers, 36k dns)
architecture:
     - parallel ICMP probes
     - 52-byte packets
     - kernel time stamping
     - ssh / Kerberos

Slide 9: skitter: colored by more countries

skitter: colored by more countries

Slide 10: skitter: AS interconnectivity

skitter: AS interconnectivity

Slide 11: topology vis: geographic mapping

topology vis:  geographic mapping


difficult data analysis
requires mapping of thousands (millions?) of nodes to latitude/longitude coordinates

NetGeo service designed to help
http://netgeo.caida.org

backbones require company-specific heuristics

DNS registry growth is problematic
no common data formats

Slide 12: GTrace: geographic traceroute

GTrace: geographic traceroute
  
www.caida.org/Tools/GTrace/

Slide 13: semi-geographical topology (otter)

semi-geographical topology (otter)

www.caida.org/Tools/Otter

Slide 14

logical vs geographic topology
 

2-dimensional, hierarchical


geographic

Slide 15: skitter: 3D hyperbolic

skitter: 3D hyperbolic 
 
making huge topology graphs navigable

Slide 16: differencing routing tables

differencing routing tables 

www.caida.org/Tools/Mantra (multicast)

Slide 17: topology vis: research priorities

topology vis:  research priorities 


identify & extract features from large, complex datasets:
enable dynamic feature detection 
develop better data aggregation/reduction techniques
create meaningful displays, user-friendly tools
accurately correlate different datasets

obstacles:
     mapping IP addresses to ... anything meaningful

Slide 18: => visualizing Internet workload

=> visualizing Internet workload 

many uses
workload profiling 
performance and QOS assurance across ISPs
accounting/billing

measurement tools
router-based (cflowd, netflow)
stand-alone monitors (coral,skitter)

traditional vis can be helpful
time series, box&whisker, etc.	
3D real-time flow analysis

evolution requires use
envisioning new methods?
better data correlation tools are essential

Slide 19: traffic workload by protocol

traffic workload by protocol

Cichlid animations http://moat.nlanr.net/Cichlid/

Slide 20

traffic workload by protocol 

19 aug 99, ucsd-cerfnet

Slide 21

workload by protocol proportions 

19 aug 99, ucsd-cerfnet

Slide 22: visualizing workload by applications

visualizing workload by applications 
(e.g. ucsd-cerfnet)

Slide 23: multicast workloads (using mantra)

multicast workloads (using mantra)

18 oct 99, fix-west.mbone.nasa.gov

Slide 24: workload vis: research priorities

workload vis:  research priorities 

id and present `useful' workload metrics, particularly given persistence of fire-fighting environment

id significant patterns, timeframes, correlations
vary by user need
change as technologies and 'net change

obstacles:
limited access to commercial networks
network performance impact 
faster speeds and changing transport technologies complicate data acquisition and processing

Slide 25: => performance evaluation vis

=> performance evaluation vis

 
  useful for:

diagnosing problems

verifying service level agreements 

predicting software HCI for network apps

providing Internet weather reports

Slide 26: IPPM visualization

IPPM visualization (ippm-db.advanced.org)

Slide 27: perf. eval: skping (www.freebsd.org)

perf. eval: skping (www.freebsd.org)

Slide 28: perf.eval: routing (path change)

perf.eval: routing (path change)

Slide 29: perf.eval: sktrace (www.cnet.com)

perf.eval: sktrace (www.cnet.com)

Slide 30: performance eval vis: priorities

performance eval vis: priorities 


faster data collection, faster processing, faster rendering

need intuitive graphic presentations correlating:
performance across sources
comparisons w/topology, workload, routing analyses

obstacles
poorly defined user requirements/interfaces
negative perceptions regarding quality and worth driven by explosive growth

Slide 31: => visualizing routing efficiency

=> visualizing routing efficiency



sausage/laws....

how best to identify outages, flaps, critical paths

how best to meet real-time challenges

currently non-intuitive, incantation-driven

Slide 32: routing: example (instability)

routing: example (instability)


RTT data changes color if path changes
10 unique paths over 24 hour period
lots of jitter in data
unlikely to be intentional 
heavy tails predominate

Slide 33: routing: example (load balancing)

routing: example (load balancing)


RTT similar over predominantly 2 paths 
likely intentional load balancing

Slide 34: routing: sktrace (parc.xerox.com)

routing: sktrace (parc.xerox.com)

Slide 35: routing vis: research priorities

routing vis:  research priorities 


better IP routing instrumentation 
real-time analysis without interfering with performance
realistic inter-domain routing models

tasks
identification & vis of flaps, outages, critical paths
correlation with performance problems
analysis of asymmetries and routing instabilities
analysis of effects of unicast/multicast incongruities

Slide 36

routing vis:  research obstacles


routes may change faster than ability to measure or analyze

poorly instrumented infrastructure (new tools needed)

mapping IP address to anything 
  (deja vu)

prudent security dictates inhibiting research

Slide 37: now what?

now what?  


the ideal:
well-instrumented infrastructure 
seamless integration of variety of data sources
important for simulation/prediction
but unlikely for the foreseeable future

a case study (real-life problem)
optimize root name server location
use visualization to correlate data sources / types
use collaborative project to encourage 
       proactive participation 
       (network operators, researchers, others)

Slide 38: => skitter: macroscopic case study

=> skitter: macroscopic case study 

DNS f root server (pv's):  path wingspans 
www.caida.org/Tools/Skitter

Slide 39: skitter: rtt vs hopcount (correlation?)

skitter: rtt vs hopcount (correlation?)

Slide 40: skitter: rtt distribution: tri-modal

skitter: rtt distribution: tri-modal

Slide 41: skitter: rtt vs longitude (light cone)

skitter: rtt vs longitude (light cone)

Slide 42: dispersion among ASes across paths

dispersion among ASes across paths

Slide 43: dispersion among ASes across paths (sdsc)

dispersion among ASes across paths (sdsc)

Slide 44: dispersion among countries across paths

dispersion among countries across paths

Slide 45: Internet visualization summary

Internet visualization summary


CAIDA research pushes boundaries
analysis of complex conditions
management of large datasets
correlation between different datasets
development of timely, insightful visualization 

new CAIDA Vis group will investigate
tool integration
user interface improvements
networked collaborative environments

Slide 46: potential payoffs

potential payoffs


give insights to vendors re next 
   generation hw/sw requirements

calibrate user activity (e.g., moni-
   toring service level agreements) 

provide diagnostic and planning tools 
   for ISPs 

identify and quantify infrastructure 
   dependencies for researchers

Slide 47: www.caida.org/Presentations/

www.caida.org/Presentations/

kc claffy
UCSD/SDSC/CAIDA
kc@caida.org
www.caida.org

Related Objects

See https://catalog.caida.org/media/2000_viz0002/ to explore related objects to this document in the CAIDA Resource Catalog.