Skip to main content

MagicPoint presentation foils

Archived MagicPoint presentation slides, compiled into a single PDF document.

2001_bw0111.pdf (30 slides, 1.7 MB)

Slide text transcript

Slide 1: Internet measurement:

Internet measurement:
state of DeUnion

no aphorism is more frequently repeated...
than that we must ask Nature few questions,
or ideally, one question at a time.
...this view is wholly mistaken.
Nature will best respond to a logically
and artfully thought out questionnaire;
indeed if we ask her a single question, she will often refuse
to answer until some other topic has been dicussed.
             -- perspectives in medicine and biology 1973 sir ronald a fisher


15 nov 01
kc claffy, UCSD/SDSC/CAIDA
kc@caida.org 
www.caida.org

Slide 2: abysmal but unsurprising

abysmal but unsurprising


little capacity to predict, depict, or even measure traffic behavior on current and advanced networks 

few tools to engineer/operate networks or identify traffic anomalies in real time

doesn't stop researchers from building junk

doesn't stop random users from doing random junk (no dearth of activity)

    increasing [opportunity] cost to infrastructure

Slide 3: Internet's resistance to modeling/measurement

Internet's resistance to modeling/measurement

evolution-based (good!) reasons
protocols, technologies, applications
independently developed and deployed
by no means synergistic  
by all accounts rapid
`punctuated' but no equilibrium
"have done fine without modeling so far"
(let's wait till modeling is cheaper than bandwidth)
but simulation/analysis validation
(& lately other stuff) needs data
right granularities hard to come by
measurement technology just not there
argument for it also not there
"helps everyone", but who pays?
losing battle?

Slide 4

Internet's resistance to measurement
                
many would benefit
vendors, users, researchers, ISPs
          
ISPs would bear cost   
multiple media: atm, pos, dwdm, mpls
logistics/management  
privacy implications
analysis/research obsolete after (before) done
                
   ........how to justify/accomplish measurement?
              (when market forces are torqued)

alternatives:
 1) tools that positively affect an ISP's balance sheet
 2) regulatory intervention

Slide 5: payoffs

payoffs


insights for vendors re next generation hw/sw requirement

calibration for users, e.g., monitoring service level agreements

diagnostic and planning tools for ISPs 

windows into the infrastructure for researchers

Slide 6

measurement tools lack

                
well-defined traffic metrics
e.g supporting SLAs or billing
uniformly applied methodologies
varied topologies, equipment, ISP practices
clear definition of measurement hypotheses or goals
measurement scalability
ability to explain phenomena
topology changes, routing loops, black holes
relevance to actual ISP problems or mechanisms for fixing
communication of useful results

Slide 7: so what happened instead

so what happened instead


andrew odlyzko's excellent "myth of Internet growth" study (nov 2000)

'traffic doubling every 90 days'
maybe for a few months in 1995-1996
in reality, no real data since 1995 (nsfnet sunset)
more like every 12-18 months for rest of 1990s
financial markets (at least in US) believed (bubbly!) estimates
over 6 years, that means a factor of 16 million
assume (generously) 500M users, 1.5Mbps per user around the clock
and yet we're mostly still using 28k modems, & only for an hr/day, and ave 5k bits/sec then
the math just does not work out
it took 5 years for true traffic growth data to finally manifest itself 
                (since providers would not release data, if they even had it) 
         via other metrics (hardware and bandwidth sales)

that's actually an embarrassingly pathetic willingness 
to just ignore real data (or just invent it)

Slide 8: Internet traffic growth myths (cont.)

Internet traffic growth myths (cont.)

costs
tech stock bubble?
really takes new technologies a decade to penetrate
web was exception, Internet is not
retarded technical developments
negligence of what users what, and likely to get
community gets mired in sub-necessary QOS hubbub, ATM, GMPLS

benefits 
unparalled platform for innovation
open standards, rapid development of new services
big empty pipes was key factor in supporting revolution

lessons
25 year contracts for pipes should be amortized over 3
capacity planning impossible 
simplify engineering (atm/sonet --> IP over WDM, GigE)

Slide 9: three `waves' of application

three `waves' of application

first wave: shared (remote) use of computers
telnet, email, ftp
second wave: client/server model, formatted lang.
web
third wave:  collaborative, peer-to-peer, interactive
napster, imesh, kazaa, gaming, video

emergence of third wave will require more real-time
interaction with and reaction to network status

the growth of these applications will be self-limiting
(by user frustration with performance) unless we have either:
a better grip on measurement 
either done by the applications themselves (e.g., vat) 
or via some other middleware aspect of the infrastructure)
or no service-affecting queueing anywhere in the network
(seems unlikely)

Slide 10: four areas of measurement

four areas of measurement


workload characterization (passive)

topology (mapping, path dynamics)

performance evaluation (active, passive)

routing (dynamics)

  caida focuses on 
measurement tools
macroscopic analyses
identifying priorities and obstacles

Slide 11: workload characterization

workload characterization


workload profiling (s/w & h/w design, architecture optimizing, capacity planning)
security
performance analysis
delay, loss, jitter?
QOS assurance across ISPs
accounting/billing

tools: netramet, netflow, cflowd, coral
some suck less? ...evolution requires use

Slide 12: workload characterization: priorities

workload characterization: priorities

coral/ocXmons (OC3,12,48,192 gigE)

persistent, real-time, full-frame collection 

dynamic packet filtering triggered by attack precursors 

security policy 
compliance auditing (passive) 
enforcement (active) 

 obstacles
hardware expensive
privacy issues 
IPsec

Slide 13: wkld char. on OC48 backbone link: hardware

wkld char. on OC48 backbone link: hardware


DAG - PCI network monitoring cards:

project at University of Waikato (New Zealand) computer science department
http://dag.cs.waikato.ac.nz/

DAG4 card - ATM and PoS capture at OC48c 
2.5 GBit/sec link rate
exceeds PCI bus bandwidth, requires filtering & compression
provides highly accurate timestamping
timestamp sync across boards available via cable and GPS

CAIDA/U.Waikato collaboration (subcontract)

Slide 14: workload on OC48 backbone link: data

workload on OC48 backbone link: data

data collected by CAIDA and Waikato Applied Network Dynamics group
http://wand.cs.waikato.ac.nz/

collected on OC48 commercial backbone link in San Jose, CA

oc48mon2 link, one direction only

duration: 76 minutes total
20:00 - 21:16 (PDT), 5 Aug 2001

volume: 32 GB of data

Slide 15: workload on OC48 backbone link: analysis

workload on OC48 backbone link: analysis


use coralreef software suite
http://www.caida.org/tools/measurement/coralreef/
public domain

obtain quantitative parameters of captured traffic:
packet and byte rates 
flows (src IP, src port, dest IP, dest port, protocol)

use NetGeo tool to map src/dst IP addresses to ASes and countries
http://www.caida.org/tools/utilities/netgeo/

consider various aggregations of traffic: 
applications
ASes
countries

Slide 16: results: application characteristics

results: application characteristics 



 - well-known applications are determined from port numbers
 - plot distributions of bytes, packets, and flows by applications

Slide 17: results: traffic by (top 10) applications

results: traffic by (top 10) applications 

by bytes
www (79%)
unclassified TCP (4%)
kazaa - peer-to-peer file sharing system (for music) (2%)
1% or less:
nntp - netnews; unclassified UDP; realaudio, smtp, napster, gnutella, ftp
by packets
www (67%)
unclassified TCP/UDP (7%/3%)
1% or less:
Halflife (game); ICMP (e.g., ping - small pkts); smtp; kazaa (large pkts), dns, realaudio, Starcraft (game)
by flows
www (69%)
ICMP (16%)
dns (3%)
unclassified TCP (2%)
asherons (game) (2%)
1% or less:
smtp, unclassified UDP, ftp, https (secure http), Halflife

Slide 18: workload char.: new killer application?

workload char.: new killer application?

compare to university commodity link (UCSD)
3-12 nov 01 (post-napster.demise)

Slide 19: results: flow interval spacing

results: flow interval spacing


- packet interarrival time is _always_ less than 2s
- 4s timeout sufficient to capture most application flows
- cannot do any such analysis via sampling

Slide 20: results: traffic by countries/continents

results: traffic by countries/continents

distribution of bytes, packets, flows by source and destination countries

top source countries: 
US - 1st by bytes and packets, 2nd by flows
Japan - 1st by flows, 2nd by bytes and packets
also: Canada, United Kingdom, Hong Kong, Denmark (but: 10 times less bytes, packets, or flows than US or JP)

top destination countries:
Korea, US, China
also: Japan, New Zealand, Taiwan, Australia

suggests that routing policy for this link
directs most traffic to asia

Slide 21: results: 3D traffic matrices

results: 3D traffic matrices 


source/destinations pairs aggregated by continents/regions

3D plots use XRT-based tool 
http://www.caida.org/tools/utilities/graphing/graph_xrt3d.xml
x/y axes - source/destination locations
z-axis - log scale traffic volume (bytes, packets, or flows)
examples follow (byte volume)

Slide 22: results: 3D traffic matrices

results: 3D traffic matrices 

peaks: asia/n.amer/eur. -> asia; n.amer/eur. -> n.amer; n.amer/eur -> oceania

Slide 23: results: 3D traffic matrices

results: 3D traffic matrices 

continents-to-asia: primary srcs - n.amer/asia/europe
     primary dsts - south korea, china, japan, taiwan

east asia locations are primary traffic destinations for this link

Slide 24: results: 3D traffic matrices (continents)

results: 3D traffic matrices  (continents)

continents-to-africa: only 3 countries receive (little) traffic: .za, .eg, .bw

Slide 25: results: anomalies

results: anomalies


we see the following unexpected traffic at our measurement location in San Jose, California, US:

asia to asia - rather significant amount

.uk to .eg  (egypt)
.uk to .tu  (turkey)
.fr to .fr and .uk to .uk
.se (sweden) to .es (spain)

Slide 26: results: anomalies (example)

results: anomalies (example)



significant amount of asia-to-asia traffic passes through San Jose!
     includes even same country traffic (e.g., .jp->.jp, .tw->.tw)

Slide 27: workload on OC48 backbone link: conclusions

workload on OC48 backbone link: conclusions


we infer applications using port mappings
dominated by WWW, filesharing, and gaming applications

flow duration
relatively short timeouts (i.e. 4 seconds) may be adequate 

traffic destinations
dominated by East Asia (KR, TW, CN, JP) and US

significant traffic `anomalies' through San Jose
western europe - western Europe
eastern asia - eastern asia
likely typical, routing policies a function of economic and regulatory realities

Slide 28: workload on OC48 backbone link: conclusions

workload on OC48 backbone link: conclusions


unique
first and only OC48 flow monitor worldwide
caida's public tools analyze data without modification

software implemented  
CoralReef, NeTraMet, custom routines (CAIDA)
custom routines by U. of Waikato, others
darpa/nsf/caida members funded

software, data analysis, viz tools all prototypes 

backbone core now needs oc192/oc768 monitoring

Slide 29: now what?

now what?


`seamless': no such thing 
measurement tools/infrastructure 
well-considered
strategically deployed
collaboratively maintained 
more infrastructure-relevant research on resulting data
feedback into tool design 
correlation among data sources/types, simulation, visualization 
proactive participation
top-down (app developers scope constraints) 
bottom-up (ISP cooperation)

Slide 30: www.caida.org/publications/presentations/

www.caida.org/publications/presentations/


kc claffy
UCSD/SDSC/CAIDA
kc@caida.org
www.caida.org

Related Objects

See https://catalog.caida.org/media/2001_bw0111/ to explore related objects to this document in the CAIDA Resource Catalog.