User Tools

Site Tools


dataarchving

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
dataarchving [2017/06/22 19:02]
mgstauff [Desktop RAID]
dataarchving [2017/08/07 15:59] (current)
mgstauff [PMACS HPC:Archive System]
Line 17: Line 17:
 ===Mini FAQ=== ===Mini FAQ===
  
-Q: We want to back up our MRI data and are expecting to collect multiple terabytes of imaging data over the next few years. Do you have a specific suggestion for us? I was looking at the DS416 option from the wiki and also saw a 2-bay system : http://www.bestbuy.com/site/synology-diskstation-2-bay-external-network-storage-nas/5706546.p?skuId=5706546+==Q: We want to back up our MRI data and are expecting to collect multiple terabytes of imaging data over the next few years. Do you have a specific suggestion for us? I was looking at the DS416 option from the wiki and also saw a 2-bay system==
  
 A: The key is to have a RAID system 1 or higher, so you have redundancy if one of the drives fails. See here: https://tierradatarecovery.co.uk/dummies-guide-to-raid/ A: The key is to have a RAID system 1 or higher, so you have redundancy if one of the drives fails. See here: https://tierradatarecovery.co.uk/dummies-guide-to-raid/
Line 23: Line 23:
 A two-bay system will work depending on what "multiple terabytes" means. If it's mean 3TB, you could put two 4TB drives in there and have a RAID 1 system with total 4TB storage. But big drives cost more, so it might be better to get a larger bay and have smaller drives. e.g. a 4-bay system with 4 2TB drives in RAID 5 configuration will get you 6TB storage, and still allow for one drive to fail w/out losing data. If you want more peace of mind get a big enough bay and large enough drives to have a RAID 6 system, so two drives can fail at the same time. A two-bay system will work depending on what "multiple terabytes" means. If it's mean 3TB, you could put two 4TB drives in there and have a RAID 1 system with total 4TB storage. But big drives cost more, so it might be better to get a larger bay and have smaller drives. e.g. a 4-bay system with 4 2TB drives in RAID 5 configuration will get you 6TB storage, and still allow for one drive to fail w/out losing data. If you want more peace of mind get a big enough bay and large enough drives to have a RAID 6 system, so two drives can fail at the same time.
    
-Q: Is it possible to purchase just 1 internal hard drive for now - or would you not recommend this - and if so do you have any good brands or suggestions?+==Q: Is it possible to purchase just 1 internal hard drive for now - or would you not recommend this - and if so do you have any good brands or suggestions?==
  
 No, you want two drives at a minimum so you can at least do RAID 1. You can start with two drives, and then add more and expand the raid volume later. (At least with the Synology systems) you can start with RAID 1 and then switch to RAID 5 or 6. Also, each drive is limited to use the size of the smallest drive in the raid, so if you start with 2TB drives, you'll want to expand in the future with 2TB drives (or larger drives, but only 2TB of each one will get used). No, you want two drives at a minimum so you can at least do RAID 1. You can start with two drives, and then add more and expand the raid volume later. (At least with the Synology systems) you can start with RAID 1 and then switch to RAID 5 or 6. Also, each drive is limited to use the size of the smallest drive in the raid, so if you start with 2TB drives, you'll want to expand in the future with 2TB drives (or larger drives, but only 2TB of each one will get used).
Line 30: Line 30:
 https://www.backblaze.com/blog/hard-drive-failure-rates-q1-2017/ https://www.backblaze.com/blog/hard-drive-failure-rates-q1-2017/
    
-Q: How technologically savvy to be we need to be to maintain this system. How often would we need to check our back up system? We do not plan on accessing it very often - just using it purely as a back up kept off site. +==Q: How technologically savvy to be we need to be to maintain this system. How often would we need to check our back up system? We do not plan on accessing it very often - just using it purely as a back up kept off site.==
  
 A: A typical undergrad/grad-student in the sciences should be able to setup and maintain the system with help of the documentation and google. We have a couple Synology Diskstation brand systems and their interface is very good overall, and reasonably easy to learn while still being powerful. It's all GUI-controlled. A: A typical undergrad/grad-student in the sciences should be able to setup and maintain the system with help of the documentation and google. We have a couple Synology Diskstation brand systems and their interface is very good overall, and reasonably easy to learn while still being powerful. It's all GUI-controlled.
Line 45: Line 45:
  
 ==== PMACS HPC:Archive System ==== ==== PMACS HPC:Archive System ====
 +
 +  NOTE 8/2017
 +  
 +  PMACS has new options for storage that may be of use.
 +  In particular the "Research Commodity Storage" may be of use to cluster users because of
 +  stated ability to conform to HIPAA compliance needs. We have not had time to investigate
 +  this ourselves. You are welcome to contact PMACS about this and ask our help to figure
 +  out if the new services are usable by cluster users.
 +  
 +  http://www.med.upenn.edu/pmacsnewsletter/#PMACSStorageServices
 +
 This is a service that provides very easy access to a modern robot-controlled high-availability tape archiving system. It provides a simple filesystem-view interface with simple file retrieval. Custom linux commands are provided for the user to make their archiving copies. Note that this is an **archiving** service, and is not meant to be a regular backup service. You are able to retrieve files, but such retrievals are expected to be rare. This is a service that provides very easy access to a modern robot-controlled high-availability tape archiving system. It provides a simple filesystem-view interface with simple file retrieval. Custom linux commands are provided for the user to make their archiving copies. Note that this is an **archiving** service, and is not meant to be a regular backup service. You are able to retrieve files, but such retrievals are expected to be rare.
  
dataarchving.1498158143.txt.gz ยท Last modified: 2017/06/22 19:02 by mgstauff