Web developer Ukiah Smith wrote a blog post about which compression format to use when archiving. Obviously the algorithm must be lossless but beyond that he sets some criteria and then evaluates how some of the more common methods line up.
After some brainstorming I have arrived with a set of criteria that I believe will help ensure my data is safe while using compression.
- The compression tool must be opensource.
- The compression format must be open.
- The tool must be popular enough to be supported by the community.
- Ideally there would be multiple implementations.
- The format must be resilient to data loss.
Some formats I am looking at are zip, 7zip, rar, xz, bzip2, tar.
He closes by mentioning error correction. That has become more important than most acknowledge due to the large size of data files, the density of storage, and the propensity for bits to flip.
(Score: 2) by inertnet on Thursday September 12 2019, @10:42PM (1 child)
One other factor to consider is time to compress, as well as decompress. Some algorithms take long to compress but are fast to decompress, others may work the other way around.
Personally I rarely use compression and I rotate backup media, so I always have at least 3 versions of files.
(Score: 2) by FatPhil on Friday September 13 2019, @06:17PM
Great minds discuss ideas; average minds discuss events; small minds discuss people; the smallest discuss themselves