Skip to Content
[CAIDA - Center for Applied Internet Data Analysis logo]
Center for Applied Internet Data Analysis > funding : predict
Supporting Research and Development of Security Technologies through Network and Security Data Collection
Sponsored by:
Department of Homeland Security (DHS)

Our activities for this project include collection, curation, hosting, and distribution of active and passive Internet measurement data as well as providing advice on technical, legal, and practical aspects of PREDICT policies and procedures.

Funding source: DHS S&T cooperative agreement FA8750-12-2-0326. Period of performance: September 28, 2012 - September 27, 2013 (optional till September 27, 2017).

|   Statement of Work    Complete Current SOW (pdf)    Previous SOW (2007-2012)   |

Statement of Work

CAIDA provides fundamental research on a reasonable efforts basis and in accordance with UC policy. We shall accomplish the following data providing, data hosting, and project support tasks:

I. Providing Data

CAIDA will provide the following data to PREDICT:

Dataset NameDatesContentNotes
IPv4 Routed /24 Topology2008 - ongoingforward IPv4 paths, reply Time-to-Live (TTL), Round-Trip-Time (RTT), and ICMP responsesMeasured from Ark platform
IPv4 Routed /24 DNS Names2008 - ongoingFully-qualified domain names for IP addresses in the IPv4 Routed /24 datasetUses a custom-built bulk DNS lookup service
IPv6 Topology2008 - ongoingforward IPv6 paths, TTL, RTT, and ICMP responsesMeasured from Ark platform
Internet Topology Data Kits (ITDK)2010 - ongoingRouter-level topology data, router-to-AS assignments, geographic location of each router, DNS lookups of all observed IP addressesDerived from Ark measurements once every 3-6 months
Active Internet Topology Measurements with Skitter1998 - 2008IP paths, RTTLegacy data
Near Real-time UCSD Network Telescope Data2001 - ongoingIP packets in PCAP formatThe most recent 2 months of data
Archived samples of Near Real-time UCSD Network Telescope Data2001 - ongoingIP packets in PCAP formatArchived periodically, TBD
OC48 Peering Point IP Packet Headers2002 - 2003Three packet traces in PCAP format captured in 2002-2003Legacy data


1Describe the physical, logical and functional configuration of the collection mechanism for datasets provided to PREDICT
2Document and implement a process for managing datasets listed above for dissemination via the PREDICT legal framework
3Support the maintenance of the PREDICT data catalog (hosted by the PREDICT Coordinating Center, PCC) consistent with the data collection and data curation tasks
4Describe any restrictions on the use of any datasets provided to PREDICT including international dissemination
5Describe efforts and methods employed to ensure that the data is legally collected and compliant with privacy laws, anonymization techniques (if applied), and appropriate disclosure control processes
6Describe and implement any Institution Review Board (IRB), or ethics review processes related to dataset requests, including nominal timelines, the issues to consider, and the expected frequency of reviews
7Provide a risk analysis of any dataset provided to PREDICT that addresses federal, state, local, and international laws relevant to the collection and dissemination of the dataset, as well as any ethical issues
8Describe how data collections are planned to evolve as devices, architectures, and protocols evolve

II. Data Hosting


1Provide a data hosting infrastructure to support the PREDICT project.
2Describe any expansion plans for hosts and bandwidth to be needed as a result of traffic growth.
3Describe scenarios and processes for the dissemination of data that may occur via media.
4Provide a description of data hosting infrastructure to be employed in the performance of this SOW (hardware, software, logical configuration, and mirroring or redundancy equivalence)
5If willing to host external data, indicate the availability of infrastructure and provide a plan for curating data from external sources.

III. Project Support

Implement and document PREDICT project support, as follows:


1Attend PI Meetings (not to exceed 3 times per year) and provide an on-site venue for hosting PI Meetings (as requested).
2Provide regular status briefings, participate in monthly teleconferences, project planning efforts, program reviews, and other technical interchange meetings.
3Collaborate with other PREDICT project participants on the establishment and monitoring of project-level metrics to describe the utility of the provided and hosted datasets, evaluate the dataset popularity, track the growth of the data collection, and other.
4Describe how UCSD will meet PREDICT goals for reviewing, negotiating, and executing PREDICT legal documents including Data Provider and Data Host Memoranda of Agreement (MOAs) with PCC. Discuss the availability of decision making legal support and any other required entity needed to respond to these documents consistent with UCSD policy and within seven (7) business days.
5Support the PREDICT ARB activities to screen researcher organizations, referring organizations and researchers for legitimacy of purpose and intent of PREDICT data use.
6Describe and implement a plan to publicize the availability of the data provided via PREDICT.
7Support project outreach via Government approved workshops, conferences, and other technical forums.


In the course of the project, CAIDA shall provide the following deliverables:

1Project Management PlanNovember 15, 2012done: 2014 2012
2Hosting Infrastructure Description DocumentNovember 15, 2012done: 2014 2012
3Technical Status Reportquarterly
4Financial Status Reportmonthlyongoing
5Briefings and Research Papersas available
6Final Reportupon completion of the project
  Last Modified: Wed Aug-6-2014 15:11:50 PDT
  Page URL: