PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
We present and evaluate the idea of auto-generating training data for network application classification using a rule-based expert system on two-dimensions of the feature space. That training data is then used to learn classification of network applications using other dimensions of the feature space. The rule-based expert system uses transport layer port number conventions (source port, destination port) from the Internet Assigned Numbers Authority (IANA) to classify applications to create the labeled training data. A classifier can then be trained on other network ow features using this auto-generated training data. We evaluate this approach to network application classification and report our findings. We explore the use of the following classifiers: K-nearest neighbors, decision trees, and random forests. Lastly, our approach uses data solely at the ow-level (in NetFlow v5 records) thereby limiting the volume of data that must be collected and/or stored.
Carlos Alcantara,Venkat Dasari,Christopher Mendoza, andMichael P. McGarry
"Auto-generating training data for network application classification", Proc. SPIE 11013, Disruptive Technologies in Information Sciences II, 1101305 (10 May 2019); https://doi.org/10.1117/12.2519547
ACCESS THE FULL ARTICLE
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.
The alert did not successfully save. Please try again later.
Carlos Alcantara, Venkat Dasari, Christopher Mendoza, Michael P. McGarry, "Auto-generating training data for network application classification," Proc. SPIE 11013, Disruptive Technologies in Information Sciences II, 1101305 (10 May 2019); https://doi.org/10.1117/12.2519547