MagicPoint presentation foils

Archived MagicPoint presentation slides, compiled into a single PDF document.

2001_myths2002.pdf (51 slides, 3.9 MB)

Slide text transcript

Slide 1: internet measurement

internet measurement:
myths about Internet data

it ain't the things you don't know that hurt you, 
it's the things you know that ain't so.
- mark twain



july 2002 
ucsd/sdsc/caida
kc@caida.org 
http://www.caida.org/publications/presentations/

Slide 2: what i mean by `myth'

what i mean by `myth'


if you google for "Internet myths", you'll get lots of figments about Internet marketing/sociology, like
it's cheap to do business on the web.
advertising is flocking to the web in record numbers and will be its savior.
you can give away the merchandise as long as you generate enough eyeballs because one day you will monetize those eyeballs.
if you have a clever URL, they will come.
people will never pay for content over the web.
traditional advertising brings eyeballs which generates much traffic.
people like to shop on the web ( <-- that's a good one).
it costs nothing to get a site up and running.
the web is a reliable commercial activity.
just you wait, profitability is right around the corner.
http://www.thestreet.com/comment/wrongtactics/786636.html

these are not `myths' since noone actually believes them
these are called fantasies
people want them to be true... (or more sustaining)
get return for convincing someone they're true

myths: things people actually believe but that are wrong

Slide 3: fantasies vs myths

fantasies vs myths


fantasies
who believes:
marketing, advertising people, lawyers, consultant (consenting) adults
who gets hurt:
marketing, advertising people?   (no comment..)

myths
who believes:
researchers, vendors, policymakers, journalists, secretary of defense
potentially: marketing, advertising people, lawyers, consultant (consenting) adults
who gets hurt:
packets (dropped)
engineers (paged) 
protocol developers (in worst case they invent stuff like atm, mpls)
grad students (useless dissertations, sub-employability, lost decades of youth)
economy (irrational speculation in capital markets -> global recession)

Slide 4: Internet myths relevant to engineering

Internet myths relevant to engineering

workload: (besides basic traffic growth fiction)
level and nature of fragmented traffic
increase in flows as bandwidth grows
mice vs elephants
private addresses in core
prevalence of encrypted passwords
applications can be identified (much less controlled)
multicast traffic, flows, addressing
performance:
DoS attacks affect only large sites
geography not correlated with latency
DNS system performs well
single router can't trash the Internet
topology:
Internet topologies, object sizes follow power laws
routing:
routing tables reflect Internet topology
intra-country traffic stays there
AS path length is decreasing
small providers and multi-homing (more specifics) cause all the churn
why so many?  no real measurement. (so no real surprise...)

Slide 5: Internet's resistance to modeling/measurement

Internet's resistance to modeling/measurement


evolution-based (good!) reasons
protocols, technologies, applications, users
independently developed and deployed/connected
by no means synergistic  
by all accounts rapid
'punctuated' but no equilibrium
"have done fine without modeling so far" (let's wait till modeling is cheaper than bandwidth)

but simulation/analysis validation (& lately engineering/billing/[homeland.]security) needs data
right granularities hard to come by
measurement technology just not there
argument for it also not there
"helps everyone", but who pays?
losing battle?

Slide 6: measurement tools lack (related obstacles)

measurement tools lack (related obstacles)

                
well-defined traffic metrics
e.g supporting SLAs or billing
uniformly applied methodologies
varied topologies, equipment, ISP practices
clear definition of measurement hypotheses or goals
measurement scalability
ability to explain phenomena
topology changes, routing loops, black holes
relevance to actual ISP problems or mechanisms for repair
communication of useful results

Slide 7: Internet's resistance to measurement

Internet's resistance to measurement

                
many would benefit
vendors, users, researchers, ISPs

ISPs would bear cost   
multiple media: atm, pos, dwdm, mpls
logistics/management  
privacy implications
analysis/research obsolete after (before) done

........how to justify/accomplish measurement?
(when market forces are detrimentally torqued)

alternatives:
1) tools that positively affect an ISP's balance sheet
2) regulatory intervention

Slide 8: what happened instead of measurement

what happened instead of measurement


a. odlyzko's "myth of Internet growth" study (nov 2000)
plus great assessment (...) of larry roberts 2001 caspian.goo 
(see also his recent lightreading/other statements)
'traffic doubling every 90 days'
maybe for a few months in 1995-1996
in reality, no real data since 1995 (nsfnet sunset)
more like every 12-18 months for rest of 1990s
financial markets (at least in US) believed (bubbly!) estimates
over 6 years, that would have meant a factor of 16 million
assume (generously) 500M users, 1.5Mbps per user around the clock
and yet we're mostly still using 28k modems, & only for an hr/day, & ave 5k bits/sec 
come again?  the math just does not work out 
it took 5 years for true traffic growth data to finally manifest itself
(since providers would not release data, if they even had it) via other metrics (hardware and bandwidth sales) required in annual reports to SEC (closest we have to an Internet Measurement Commission)


that's actually an embarrassingly pathetic willingness 
to ignore real data (or just invent it)

Slide 9: living in a mythical world: tradeoffs

living in a mythical world: tradeoffs

costs
tech stock bubble? (hey infinite demand is infinite jnpr stock price)
really takes new technologies a decade to penetrate
web was exception (when it was young/free), Internet is not
retarded technical developments
negligence of what users what, and likely to get
community gets mired in sub-necessary QOS hubbub, ATM, GMPLS
benefits 
unparalled platform for innovation
open standards, rapid development of new services
big empty pipes was key factor in supporting [r]evolution
pipes wouldn't be empty for grad students (napster,kazaa) if the myths had been true
lessons
25 year contracts for pipes should be amortized over 3
come to terms with a much looser definition of `capacity planning' 
simplify engineering (atm/sonet --> IP over WDM, GigE.)

(first commandment: Thou Shalt Get Rid of Layer Goo)

Slide 10: living on borrowed time in a mythical world

living on borrowed time in a mythical world


(opportunity costs of not measuring)
three `waves' of Internet applications/usage
first wave: shared (remote) use of computers
telnet, email, ftp
second wave: client/server model, formatted languages
web
third wave:  collaborative, peer-to-peer, interactive
napster, imesh, kazaa, gaming, video



emergence of third wave (`ngi') will require more real-time interaction with and reaction to network status
growth of these applications will be self-limiting (by user frustration with performance) unless we have either:
a better grip on measurement
either done by the applications themselves (e.g., vat) 
or via some other middleware aspect of the infrastructure
or no service-affecting queueing anywhere in the network
seems unlikely, even with lots of empty pipes

Slide 11: four areas of measurement (and thus myths)

four areas of measurement (and thus myths)


workload characterization (passive)

topology (mapping, path dynamics)

performance evaluation (active, passive)

routing (dynamics)


caida focuses on
measurement tools (prototypes) 
macroscopic (or macroscopically relevant) analyses
identifying priorities and obstacles

Slide 12: workload measurement: in the Internet `core'

workload measurement: in the Internet `core' 


(background only)
caida/waikato coral+dag oc48mon
first and only OC48 flow monitor worldwide
caida's public software analyzes data without modification

software implemented  
CoralReef, NeTraMet, custom routines (CAIDA)
other custom/enhanced routines by U. of Waikato, others
darpa/nsf/caida members funded

software, data analysis, viz tools all prototypes 
commercial spinoff for the cards (www.endace.co.nz)

but btw backbone core now needs oc192/oc768 monitoring 
there's not yet even such thing as a 24 hour trace for an oc48 link

Slide 13: workload measurement: in the Internet `core'

workload measurement: in the Internet `core' 


(background only: dag oc48 capture card)
current oc48mon system (prototype at MFN in SJC, subc/collab. w U.Waikato)
captures 1M packets/sec to disk (40% util. link)
provides highly accurate timestamping
.5Mp, 1Gbps (125MB/sec) each direction
avg pkt size 370, 590 bytes (210k, 340k pkts/sec)
64 bytes/record -> 6-9X compression over link load
problems: bursts of small packets cause machine thrash
http://dag.cs.waikato.ac.nz/

upgrading oc48mon this qtr to house (bigger) Dag4.10 cards 
dual-Pentium (Intel) processor on tyan S2510 
1GB of RAM
floppy, cdrom 
IDE/ATA disk drive (40GB min)
6 SCSI Ultra/160 disks, 3/each SCSI channel each 18GB min
4U rack mountable chassis 

this will get us One Hour (and just barely, and ~50GB)
(MFN SJC 76 min @20:00 PDT 5 aug 2001 ==> 32GB)

Slide 14: workload measurement: in the Internet `core'

workload measurement: in the Internet `core' 


(background only: workload data analysis)
use coralreef software suite
http://www.caida.org/tools/measurement/coralreef/

obtain quantitative parameters of captured traffic:
byte rates and packet rates
flow = {src ip, src port, dest ip, dest port, protocol}

use netgeo tool to map src/dst ip addresses to ASes or countries
http://www.caida.org/tools/utilities/netgeo/

consider various aggregations of traffic: 
applications
ASes
countries

Slide 15: workload myth: mice vs elephants

workload myth: mice vs elephants


myth:
10% of flows cause 90% of traffic on a link (or pick x%,y%)
data:  
sometimes true for bytes 
especially if the link has KaZaa-type stuff
never true for packets 
in any traces we've studied
actual proportion of traffic (bytes or packets) covered by 90% of 
               streams can change rapidly
following changes in the applications/protocol mix
obviously depends on site as well
but also per site changes over course of day, week, month



need to measure proportions before making
assumptions need longer traces!

Slide 16: workload: mice vs elephants

workload: mice vs elephants

longer traces are essential
two modes of Internet usage (interactive, downloads)
boundary between modes is ~300 packets (0.5 MBytes)
most flows on the left (by far),  most packets on the right (by far)	
for a 24 hour (sd) flow table, 4.7% packets are in still-active flows
50% packets are in flows with > 8192 pkts; max. flow: 9M pkts. max. active flow: 5M packets.

upshot: do not analyze flow sizes with less than 24 hours of data

Slide 17: workload: mice vs elephants

workload: mice vs elephants

longer traces are essential
for a 48 min (sjc) packet trace (15GB), 22% packets are in still-active flows
50% packets are in flows with > 1024 pkts
for 3 min (sjc) trace, 70% packets in still-active flows
for each 2X in sample duration, 2X in max of pkt/flow
convergence nowhere in sight

repeat do not analyze flow sizes with less than 24 hours of data

Slide 18: workload: mice vs elephants

workload: mice vs elephants

(generally, we do not yet know what we're talking about)

but we know not to analyze Internet flow sizes with less than 24 hours of data
btw, nobody has 24 hours worth of useful core packet trace data (we're $nMs away)

Slide 19: workload myths: prevalence of IP fragmentation

workload myths: prevalence of IP fragmentation

myth:
there is no fragmented traffic
data:
while it is true that overall only a small amount (1-2%) of Internet traffic is fragmented, fragmented traffic can surge to 8% by packets and 10% by bytes in some hour-long periods.
Some protocols, for example IGMP, have fragmented traffic far exceeding non-fragmented traffic.


myth:
fragmented traffic exists only on LANs
data:
we've monitored aggregated exchange points and backbone links.

Slide 20: workload myths: prevalence of IP fragmentation

workload myths: prevalence of IP fragmentation

myth:
tcp traffic is never fragmented
data:
while tcp traffic is fragmented much less frequently than other protocols due to path MTU discovery, we saw 0.009% by packets (0.019% by bytes) of fragmented tcp traffic.
and a majority of fragmented tunneled traffic is tcp!


myth:
nfs causes all (or almost all) fragmented traffic
data:
microsoft's media player was single largest cause (52%) of measured fragmented traffic.  Tunneled traffic, also a major cause, accounts for 16% of all fragment series.  nfs caused 0.1% of all monitored wide-area fragmented traffic.
tunneled traffic (ipencap, ipip, gre, udp l2tp), icmp, and realmedia all cause more fragmented traffic than nfs (0.1%).

Slide 21: workload myth: private (rfc1918) addresses

workload myth: private (rfc1918) addresses 

myth:
private addresses do not appear in the core
data:
private addresses appear all over the place including (consistently) in queries to root name servers as do multicast and other `shouldn't be seen' junk
Broido's 1st Law:
`what should not be seen in the Internet will appear 1% of the time'

Slide 22: workload: prevalence of encrypted passwords

workload: prevalence of encrypted passwords

myth:
unencrypted (cleartext) passwords are mostly gone 
data:
most unencrypted passwords are from one source: POP 
why aren't folks using APOP? (authentication already provided)
mere existence of an encryption technology is no guarantee of its adoption

Slide 23: workload myths: p2p file sharing

workload myths: p2p file sharing 

myth:
the US government can stop file sharing 
(or `legislation takes less time to write than code [v.t.]') 
data:
napster: red
kazaa (fasttrack): orange (post-federal.judge napster.shutdown decision)
admit it's in the fantasy category, myth might be `currently no killer app'
...in an expanding system, such as a growing organism,
freedom to change the pattern of performance is one of
the intrinsic properties of the organism itself...

Slide 24: workload myths: killer apps

workload myths: killer apps

myth:
we're in between killer apps
data 
how do you know when something is a `killer app'?
when every university tries to stop it and _can't_.  _that's_ how you know 
it's a killer app.  that it takes a federal judge to threaten to put you in jail
if you don't stop.  _that_'s how you know it's a killer app!
-eric schmidt, keynote for dns navigation workshop, july 2001

Slide 25: workload myths: code vs law

workload myths: code vs law

myth:
we can handle the few bad seeds with legislation 
data
not just a few huge packets sneaking in
note similarity to gopher/web transition (patent/port# control)
(not that anyone would know via measurement... ask Internet historian)
code seems to get written faster than legislation

Slide 26: workload myths: code vs law

workload myths: code vs law

myth:
govt can stop file sharing/no killerapp
data
not just a few punk users
compare how different apps affect network...  especially bytes vs. tuples.
gnutella/fasttrack: both big flows; fasttrack (kazaa): lot more connections
uh, `no killer app'?  
(what, still waiting for multicast?)

Slide 27: workload myth: multicast traffic

workload myth: multicast traffic

(background only)
month-long multicast flow monitoring project
passively monitor OC-12 links connecting backbone to peer multicast networks and to customer aggregation switches
4M flows, 12G packets, 11T bytes

Slide 28: workload myth: multicast flows

workload myth: multicast flows 

myth
multicast flows are long-lived
data:
75% of flows were a single packet
majority of flows short-lived 
filter out single packet flows and multicast control packets to arrive at "representative" multicast traffic
filtering preserves 99% of the packet and byte counts but only 14% of the flows

Slide 29: workload myth: multicast flows

workload myth: multicast flows 

myth
multicast flows consist of small packets and are never fragmented
data:
strong packet size modes at 1480 bytes (presumably to avoid fragmentation) 
0.5% of the packets were fragmented
3% of the packets had the 'don't fragment' flag

Slide 30: workload myth: multicast addressing

workload myth: multicast addressing

myth
multicast addressing is efficiently allocated and assigned
data:
psychological disposition to use beginning of each /8 multicast block 
reserved IANA blocks indiscriminately used 
improper use of GLOP and EGLOP space

Slide 31: workload myth: multicast technology

workload myth: multicast technology

myth
multiple source multicast is popular and architecturally fundamental

data:
only 1% of the groups observed ever had multiple simultaneous sources 
largest number of simultaneous sources was 33 (Access Grid)

Slide 32: performance myth: DoS attacks

performance myth: DoS attacks

myth:
flooding DoS attacks (randomly spoof src, main attack type)
only affect large commercial sites, are long in duration
and at extremely high rates
data:
>12,000 attacks against >5,000 targets in 3 weeks
~20-60 attacks occuring at all times
80% of attacks last less than an hour, a few lasted 3 weeks
70% of attacks <1,000 pps, some over 600,000 pps
10-20% of attacks to home machines (cable, dsl, dialup)
5% of attacks target infrastructure (routers, dns servers)
(usenix 2001, david,colleen@caida, stefan,geoff@ucsd)

Slide 33: performance myth: DoS attacks

performance myth: DoS attacks

myth:
flooding DoS attacks (randomly spoof src, main attack type)
only affect large commercial sites, are long in duration
and at extremely high rates
data:
romania and brazil have disproportionate number of infected hosts (victims)
other domains have roughly same ratio of infected/total machines

Slide 34: performance myth: worm spread

performance myth: worm spread

myth:
worm spread
data:
40% of all hosts infected (first round CodeRed) lacked reverse DNS records, so we were unable to determine their hostnames
ISPs providing connectivity to home & small-business users had the most infected hosts
machines maintained by home/small-business users (i.e. less likely to be maintained by a professional sysadmin) are an important aspect of global Internet health

Slide 35: performance myth: geography vs latency

performance myth: geography vs latency

myth:
geography not correlated w/latency
data:
rtt densities from san diego to hosts around world (strong correlation)

Slide 36: performance myth: root DNS performance

performance myth: root DNS performance

myth:
root DNS system performs well
data:
8 of the 13 root servers perform well, so users do not notice the poor performance of the other five (gTLD's do somewhat better)

Slide 37: performance myth: DNS performance

performance myth: DNS performance

myth:
the DNS system performs well
data:
error taxonomy
bogus A queries to root name servers for a few hours at f-root in 2001
A queries ask for the IP address of a hostname 
not supposed to be 'in theory'
malformed A queries were 14% of the load at F.root
guilty: microsoft: Win2k resolver, viruses (win95/98/nt), macOSX resolver
asking for the IP address of an IP address
20% of queries asking for non-existent TLD
lots of internal microsoft names (active directory) 
lots ending in .local, .localhost, .workgroup, .msft, .domain, etc. 
hard to track down, nameservers just relay clients queries 
cannot see back to the actual client that asked the question

Slide 38: performance myth: power of strategic router

performance myth: power of strategic router 

myth:
single router can't trash the Internet (`certainly not by accident')
(hint: just need to trash 13 hosts to effectively trash the Internet)
data (just one example):
microsoft's feb 2001 dns woes 
microsoft's 4 authoritative nameservers visible to world on one subnet 
               (and now all you need is a comma in the wrong place)
misconfigured router upstream of that subnet 
TTL for their names set to 2 hours started timing out of peoples caches
query load at the roots started climbing
microsoft nameservers don't do negative caching
microsoft properties are usually about 6k queries/hour (0%) increased 
                to 25% of the load at f-root 

lesson:
prominent site w/DNS problems affects whole Internet 
cf. 9/11 cnn.com queries to roots were sustainable because of caching 
this only a tiny piece of the root server workload damage found

Slide 39: topology myth: outdegree distributions

topology myth: outdegree distributions

myth:
outdegree distributions follows power law
data: 
distribution follows Weibull far better than power law

Slide 40: topology myth: routing tables and topology

topology myth: routing tables and topology

myth:
routing table data reflects topology
data: 
even best avail. inter-domain routing (BGP) data will not reflect true topology 
(and yet this BGP data is essential to sound macroscopic Internet topology analysis)

Slide 41: topology myth: Internet object sizes

topology myth: Internet object sizes

myth:
Internet object sizes follow power law
data: 
Internet graphs are closer to Weibull than to power functions
P(X>x) = a exp(-(x/b)^c)
decreases faster than power function, slower than exponential

Slide 42: routing myth: intra-country traffic

routing myth: intra-country traffic

myth:
intra-country traffic stays there
data: 
significant asia-to-asia traffic goes thru san jose
includes even same country traffic (e.g., .jp->.jp, .tw->.tw)

Slide 43: routing myth: AS path length is decreasing

routing myth: AS path length is decreasing

myth:
AS path length is decreasing
data:
since 1999, many AS paths have changed either way
average length decreased and increased for many ASes
change in the average AS path length is insignificant

Slide 44: routing myth: AS path length (continued)

routing myth: AS path length (continued)

myth:
AS path length is decreasing
data:
if anything, it's increasing

Slide 45: routing myths: routing behavior

routing myths: routing behavior

myth:
causes of growth & instability of routing system
route table growth exponential
data: 
global prefixes grew 4% may->nov 01;  37% in nov00-01 (RouteViews)



myth:
peering richness is growing (see previous slide)
data: 
link/node ratio (average degree), peering richness, and churn did not significantly change in 2000-2001, although lot of changes within ASes.

Slide 46: routing myths: routing behavior (continued)

routing myths: routing behavior (continued)

myth:
small ISPs & multihoming cause growth and/or churn
data: 
since 1998, the share of /24s has stayed between 57% and 58.5%
number of non-transit multihomed ASes grew from 35% to 37% in 2000-2001, but their share of global routes remained stable at around 30%.
new address announcements & deaggregation of existing prefixes were major sources of new prefixes between nov00-may01
most routing instability (w/drawal/reannounce events) in late 2001 contributed by a few .gov networks, developing country telecoms, & major backbone ISPs. although backbone providers routes are relatively stable on per-prefix basis.
instability caused in part by deaggregated routes leaking out originating AS, and by relatively short-lived transient announcements.  (`small multihomers' contribute negligibly, at least on bi-hourly scale)

Slide 47: Internet myths relevant to engineering

Internet myths relevant to engineering

workload: (besides basic traffic growth fiction)
level and nature of fragmented traffic
increase in flows as bandwidth grows
mice vs elephants
private addresses in core
prevalence of encrypted passwords
applications can be identified (much less controlled)
multicast traffic, flows, addressing
performance:
DoS attacks affect only large sites
geography not correlated with latency
DNS system performs well
single router can't trash the Internet
topology:
Internet topologies, object sizes follow power laws
routing:
routing tables reflect Internet topology
intra-country traffic stays there
AS path length is decreasing
small providers and multi-homing (more specifics) cause all the churn
why so many?  no real data/measurement...

Slide 48: conclusions

conclusions

we shed doubt on (too many) commonly assumed Internet myths
even with use of a number of data sets, 
we as a community have quite low integrity 
in drawing macroscopic inferences


implication: 
the community could make much better use 
of our collective intellectual resources if we could     
validate ideas against a larger variety of empirical data sets
before investing research and development time and energy on
ideas that attempt to affect the infrastructure

Slide 49: now what?

now what?

`seamless' infrastructure: no such thing (right now)
measurement tools/architecture
well-considered
strategically deployed
collaboratively maintained 
more operationally relevant research on resulting data
feedback into tool design 
correlation among data sources/types, simulation, visualization 
proactive participation
top-down (app developers scope constraints) 
bottom-up (ISP cooperation) 
middle-matters-more-than-before (dns, mcast, proxies, nats, tunnels, etc)

Slide 50: www.caida.org/publications/presentations/

k r krishnan, `parametric resampling analysis
or traffic measurements for capacity management',
quote from 1999 itc ip seminar:

"kc said `in the absence of data we just press ahead.'
well, we do better than that; we manufacture data.
but we are too refined to call it that, 
so instead we call it parametric resampling.''


kc 
ucsd/sdsc/caida
kc@caida.org
www.caida.org

http://www.caida.org/publications/presentations/

Slide 51: trilogy of action for scientists

trilogy of action for scientists 

for 'seekers of the larger view'
    [while video loads...]

draw together pieces of science and technology to create a system
whether that system is xerography, telegraphy or steam navigation.
find the economic feasibility for a new technology 
by virtue of a wide grasp of the worlds of man and matter
reach harmony through intuition
by meditating on deep knowledge of the field so as to arrive at a new result
build a model
a simplified representation of the problem, subject to experimental analysis
serve as a science-technologist generalist 
who, many times/yr, extracts the missing point out of a complicated situation
make decisions or help others make decisions
by imaginative interaction w/alternatives calculated as consequent on those decisions
                                                    -- john archibald wheeler

Related Objects

See https://catalog.caida.org/media/2001_myths2002/ to explore related objects to this document in the CAIDA Resource Catalog.