Storage Systems

A variety of storage systems with varying characteristics make up the data platform available to the ITS-RC community. As datasets increase in size, complexity, and/or sensitivity, more attention is required to ensure best match of resources to the tasks at hand.

/proj

High capacity storage
OK to compute against, however as IO increases, /work becomes the better option
OK to hold inactive data sets like a near-line archive
If you have a large set of truly inactive data, we can assist in moving it to our Cloud-based archive, achieving greater data durability and lower storage costs
more info

/users
Your primary storage directory is: /users/{o}/{n}/{onyen}
This storage is provided by the same hardware as /proj

High-capacity storage
OK to compute against, however as IO increases, consider a temporary copy/move to /work for processing
OK to hold inactive data sets like a near-line archive
If a meaningful amount of cold data accrues, it can be packaged and MOVED to cloud archive, providing more working space for your warm data
/users is not intended for team-oriented, shared storage, as is /proj; it is intended to be your personal storage location; think of it as a capacity expansion to your home directory. Note that /work is NOT intended to be a personal storage location; /work is for data actively being processed, especially for workloads with high IO requirements.
10 TB quota

/work

NOTE: New /work space for users is longer being created as of 7/14/2026. /users personal storage continues to be provisioned (see above)

Your /work directory is: /work/users/{o}/{n}/{onyen}

High performance SSD storage
Intended for active, "hot" data, being read or written
/work is NOT intended for holding inactive data (e.g., not used within 90 days); please move such older data to /users or /proj)
Older data being stored in /work is subject to being moved to a user's personal storage by RC staff when deemed appropriate
/work is not for shared storage (consider /proj for group-owned directories)
more info

Cloud-based data archive

The most durable, least expensive storage option available, presuming there is very low probability of needing to retrieve/access frequently or at large data volumes. Not all data is suitable for archival storage, and moving the data there requires proper curation and packaging of the data by the owner before being moved to the cloud. If you have a large dataset that you think fits these parameters, then please email research@unc.edu to request a consult to begin this process.

The following two systems are built upon old hardware that is no longer on maintenance, and thus at non-trivial risk of loss as compared to the storage systems listed above. This is the prior generation of /proj hardware; high capacity, decent performance.

/overflow

Useful for:

Data that can be re-acquired
Intermediate results of analysis and workflows
Staging area to re-organize data on its way somewhere else, such as Cloud based cold archive
etc.

/datacommons

Read only
Suggestions for data sets to add are welcome
Disposable data; if systems fail, data can be re-acquired

Examples of data sets in /datacommons:

1,000 Genomes
Berkeley DeepDrive, bdd100k
SCAMPS Dataset, Camera Measurement of Physiology
Indoor Scene Recognition
Stanford Dogs

Last updated: July 14, 2026

Documentation Content