workload myth: most Internet file transfers are porn data: 1.5 % of web sites (not traffic) contain sex material http://www.wwwmetrics.com/ myth: well but all queries are about porn data: sex terms often rank highly as query terms, but users tend to use more diverse queries when searching for other topics. e.g., educational queries probably use more different terms so that each individual term does not rank as highly. http://www.neci.nec.com/~lawrence/web99questions.html