For those new around here, my day job is at a University working on genetic sequencing projects. Part of my daily TODO at work is to keep up on scientific literature, which is a tall task considering how many journals, subjects and specialities, and just sheer volume of data is being published right now. […]
Today I was asked to create a data retention policy that manages ~60TB of our generated sequencing data. This data is both in-house and customer, collaborative and cross-institutional, and generated at our facility and elsewhere. No singular policy can surely cover all of these things, but nevertheless, I have to formulate something that will work.