PREDICT Project Overview
Researchers require current data on Internet security threats, including samples of normal and malicious Internet traffic, malicious software samples, and logs from machines compromised in targeted attacks, and other data to develop hardware and software that protects against and mitigates the effects of hacking attempts and malicious software. Concerns over privacy, security, proprietary information, and legal risks make collection and distribution of such data difficult for the owners of the infrastructure, owners of data, collectors of data, and distributors of data. Thus, few organizations make datasets available for the development and testing of defensive technologies.
The Department of Homeland Security (DHS) has developed the Protected Repository for the Defense of Infrastructure Against Cyber Threats (PREDICT) project to provide vetted researchers with current network operational data in a secure and controlled manner that respects the security, privacy, legal, and economic concerns of Internet users and network operators. Three primary goals of PREDICT are:
- To develop, implement, and maintain a Web-based portal that catalogs current computer network and operational data and handles data requests.
- To enable secure access to multiple sources of data collected on the Internet.
- To facilitate data sharing among PREDICT participants for the purpose of developing new models, technologies and products increasing cyber security capabilities.
More information about PREDICT is available in the Overview of the PREDICT program (DHS.gov PDF document).
CAIDA's Role in the PREDICT Project
CAIDA has been involved with the development of the PREDICT program since its inception; CAIDA personnel have served in an advisory capacity on all committees developing and implementing PREDICT processes and procedures.
Ongoing project activities include:
- collection, curation, and documentation of passive and active Internet measurement data (Data Provider);
- hosting and distribution of collected data sets to approved researchers (Data Host); and
- advisory role in developing technical, legal, and practical aspects of PREDICT policies and procedures.
![[CAIDA - Cooperative Association for Internet Data Analysis logo]](/images/caida_globe_faded.png)