PMFFRC: a large-scale genomic short reads compression optimizer via memory modeling and redundant clustering
Abstract Background Genomic sequencing reads compressors are essential for balancing high-throughput sequencing short reads generation speed, large-scale genomic data sharing, and infrastructure storage expenditure.However, most existing short reads compressors rarely utilize big-memory systems and duplicative information between diverse sequencing