<?xml version="1.0" standalone="no"?>
                    <!DOCTYPE div SYSTEM "/www/backend/www-xml-443/dtd/caidaML.dtd">
                    <!-- do NOT ERASE the DOCTYPE declaration! --><div>


<tr bgcolor="#f4f4f4">
  <td>
<font face="helvetica,arial" size="2">
<b>URL:</b>
</font>
</td>
  <td>
<font face="helvetica,arial" size="2">
<a href="http://www.eurecom.fr/~ennajjar/PUB/IMC09_EnNajjary.pdf">http://www.eurecom.fr/~ennajjar/PUB/IMC09_EnNajjary.pdf</a>
</font>
  </td>
</tr>


<tr bgcolor="#e9e9e9">
  <td>
<font face="helvetica,arial" size="2">
<b>ABSTRACT:</b>
</font>
</td>
  <td>
<font face="helvetica,arial" size="2">
Accurate identification of network traffic according to application type is a key issue for most
companies, including ISPs. For example, some companies might want to ban p2p traffic from their network while some
ISPs might want to
other additional services based on the application. To classify applications on the fly, most companies rely on deep
packet inspection (DPI) solutions. While DPI tools can be accurate, they require constant updates of their signatures
database. Recently, several statistical traffic classification methods have been proposed. In this paper, we
investigate the use of these methods for an ADSL provider managing many Points of Presence (PoPs). We demonstrate
that statistical methods can other performance similar to the ones of DPI tools when the classifier is trained for
a specific site. It can also complement existing DPI techniques to mine traffic that the DPI solution failed to
identify. However, we also demonstrate that, even if a statistical classifier is very accurate on one site, the
resulting model cannot be applied directly to other locations. We show that this problem stems from the
statistical classifier learning site specific information.


</font>
  </td>
</tr>


<tr bgcolor="#f4f4f4">
  <td>
<font face="helvetica,arial" size="2">
<b>RESULTS:</b>
</font>
</td>
  <td>
<font face="helvetica,arial" size="2">
<ul>
  <li>statistical classification can help revealing the traffic left unknown by the ground truth
establishment tools</li>
  <li>statistical classification is flexible enough to allow to group traffic based on application rather
than protocol</li>
  <li>some applications that are correctly classified when the classifier is applied on the same site
where it was trained, become difficult to identify when applied on another site</li>
</ul>


</font>
  </td>
</tr>
</div>

