MassIVE/GNPS Fair Use Policy

For 10 years MassIVE and GNPS have provided valuable storage data analysis and repository services to the proteomics, metabolomics and natural products research communities, and we remain committed to providing a high quality service to the research community around the world. Since its inception, MassIVE and GNPS have provided free storage of mass spectrometry data to allow users to share their data via public dataset submissions, as well as to run any of our many freely available data analysis workflows. Many thousands of users thus continue to productively use our system in a manner that requires only limited long-term storage for files that are not used for dataset submissions.

Within the MassIVE/GNPS ecosystem, data is stored in two main locations:

The storage policy for submitted MassIVE datasets remains as it always has. Any dataset submitted to MassIVE will be permanently stored, with no limit to the amount of space occupied by any single user’s submissions, as long as each dataset is made public in a timely manner (shortly after publication of any associated manuscript). Making data and knowledge public benefits all users, and we want to ensure that all submissions contribute to the MassIVE/GNPS ecosystem’s capabilities.

However, when it comes to user account private file space, we have identified a growing number of users exceeding the average user’s storage volume by several orders of magnitude. These are private user files that have never been used in a MassIVE dataset submission, and therefore do not contribute to the overall public knowledge ecosystem and are using up large quantities of data infrastructure that could be better leveraged to enhance public knowledge.

Consequently, it has become necessary to require that all users limit their long-term (>=4 months) private user account storage to 100 GB or less, and that all private files are removed within 12 months of their last use in a data analysis workflow.

Files being prepared for dataset submissions or used in data analysis workflows will still be able to temporarily use over 100 GB of storage, as is often required for submission of large datasets. We simply ask that you avoid keeping your account’s total storage above 100 GB for several months on end, and that you not leave unused files in your account for more than one year.

If the total volume of user account uploads exceeds 100 GB for more than 4 months or any inactive files remain for more than 12 months, then the account will be flagged as "Non-compliant" and the following restrictions will apply:

We understand that this updated policy will require adjustments from some of our users and we will do all we can to minimize the impact of these changes while continuing to provide high quality services to all MassIVE and GNPS users. In particular, we understand that these requirements may cause some dataset files to have to be uploaded more than once before the full dataset can be submitted. To reduce the chances of requiring multiple uploads, we recommend that (a) all dataset files be uploaded only when the dataset is essentially ready for submission within just a few months or (b) data for a manuscript can also be submitted as multiple smaller datasets, all of which can then be associated with the same papers or project grants.

If you have any questions about this policy, please consult our Fair Use Policy FAQ or contact us at ccms@proteomics.ucsd.edu.