deduplication is the technique of data reduction by breaking streams of data down into very granular components, and storing only the first instance of data items on the destination media and all the other similar occ...
详细信息
ISBN:
(纸本)9780769550336
deduplication is the technique of data reduction by breaking streams of data down into very granular components, and storing only the first instance of data items on the destination media and all the other similar occurrences to an index. Hash values are computed to identify the similar data items. Fixed size chunking (FSC) is a deduplication algorithm which breaks the data into fixed size chunks or blocks from the beginning of the file. But the main disadvantage of this technique is that, if new chunks are added in front or in the middle of a file, remaining chunks will get shifted from its initial position. This will yields a new hash value to the resulting chunks and thereby less deduplication ratio. But we can overcome this drawback by calculating hash values of chunks from the beginning as well as from the end of file and storing both values to metadata table. A new algorithm 'Dual Side Fixed Size Chunking' is proposed to get the high deduplication ratio over existing FSC. Without using computationally expensive Variable size chunking or content defined chunking, this algorithm can be effectively used for video or audio files to achieve a better deduplication ratio. This data reduction will provide network bandwidth savings and the ability to store more data on a given amount of disk or cloud storage. Reduced storage requirements will result in lower storage management and energy costs.
DNA-based molecular communication (DNA-MC) is a biological communication mechanism that uses DNA strands as information carriers. The longevity, stability, high information density, massive parallelism, and biological...
详细信息
DNA-based molecular communication (DNA-MC) is a biological communication mechanism that uses DNA strands as information carriers. The longevity, stability, high information density, massive parallelism, and biological compatibility of DNA offer a dramatic potential for DNA-based storage, computing, and communication. This letter extends our previous work, which used the directional and controllable molecular hopper along the track to replace the slow and random diffusion mechanisms. This letter proposes a multiple-track-hopper parallel communication mechanism to achieve high throughput by parallel transmission and sequencing. We recommend utilizing interleaved coding to mitigate the bit error rate (BER) caused by the back-stepping motion, resulting in successive symmetric errors. Additionally, we have explored the proper interleaving depth necessary to preserve the diminished DNA information density that results from the redundancy for error correction. Simulations show that interleaved coding efficiently reduces BER in parallel DNA-MC while requiring less redundancy. This letter demonstrates the feasibility and potential of high-throughput and low-error DNA-MC, which could enable novel interdisciplinary advances between DNA communication, nanotechnology, and synthetic biology.
暂无评论