As methods and access to gene synthesis and genetic engineering have become more advanced, the fear that malicious viruses and bacteria will be designed with the express intention of causing harm to humans has received increased attention. In the event that such biological weapons are deployed, the security community needs tools to rapidly recognize the threat and identify responsible parties. Therefore, a key question is whether or not a biological threat is manmade. Currently, experts are capable of qualitatively assessing whether specific genetic sequences are natural or man-made, but few objective criteria exist for characterizing the degree to which a sequence has been engineered. Additionally, progress has recently been made on the task of attributing an engineered gene sequence to a lab-of-origin using machine learning. However, the task of analyzing naturally occurring genetic sequences so as to automatically detect outliers that may have been genetically engineered has received comparatively little attention. This work proposes a method for generating a dataset of natural and engineered sequences that can be used as an input for training machine learning classifiers to perform automatic detection of human engineering in gene sequence data.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.