Stanford PHS's Secure Data Ecosystem (SDE) includes 4 computational environments that can be used for to analyze high-risk data (including PII & PHI): (1) the PHS Data Portal (aka Redivis), (2) Carina, (3) Nero GCP, and (4) the Virtual Windows Cloud S-VDI. These environments include a variety of widely-used programs (including R, SAS, Stata, Python, SQL, and others).
The table below gives an overview of the computational environments available:
Name | Description | Languages Available | Costs |
---|---|---|---|
PHS Data Portal (Includes the Redivis Notebooks) | A data catalog to find and explore PHS-related datasets, cut analytic cohorts, and analyze high-risk data. |
| Free for most uses. You can pre-buy extra computational resources if needed. Costs for extra resources are transparent and predictable, and are roughly equivalent to GCP costs. |
Carina | A secure, linux-based, on-premises computing environment for high-risk data. |
| 1TB of data storage is free per PI Group. Extra storage can be purchased (either a new node, or ~$94.65/month for 10TB). |
Nero GCP | A flexible and powerful cloud-based computing environment for high-risk data. |
| These machines are in Google Cloud, so there are cloud costs and setup costs, but they are not prohibitive. Approximately $100- 400/month for a typical machine. You can access Google’s price calculator here (though there’s a Stanford discount that is not reflected in the calculator). |
Virtual Windows Cloud S-VDI | A powerful and flexible cloud-based Azure computing environment structured in the Windows format that can handle high-risk data. |
| These machines are in a Cloud environment so there are cloud costs and setup costs, but they are not prohibitive. You can request an individual virtual machine at $80 per user per month (4 CPU, 16GB RAM, 120GB storage; costs increase linearly for higher-spec machines, and there is cost for STATA software). More detailed pricing information available here. |
Note about Security: Data access control is less monitored on Nero, Carina, and S-VDI as compared to the PHS Data Portal.
This means that we rely on each researcher to be responsible and only share data with other PHS-approved lab members. Please check that such lab members have completed the data access requirements (and had them approved), and that their access rights are still currently valid (i.e. approved) before sharing any data.
Failure to follow the ☞ Data Use Agreement will lead to disciplinary actions.