visualization techniques are very useful when exploring large amount of information especially when dealing with dataflow. A pixel-oriented visualization technique based on the CGR algorithm (Chaos Game Representatio...
详细信息
ISBN:
(纸本)0819439800
visualization techniques are very useful when exploring large amount of information especially when dealing with dataflow. A pixel-oriented visualization technique based on the CGR algorithm (Chaos Game Representation) has been designed to help recognize type of flowing data on the fly. The CGR method -originally developed for the analysis of genomic sequences -and modified here to allow for coding bit sequences- is an algorithm that produces images where pixels dynamically display current frequencies of small groups of bits in the observed sequence. Qualitative and quantitative expressions of order, regularity, structure and complexity of sequences are perceptible from CGR images that consequently may be used for classification or identification purposes. The method has been applied to a wide range of files including texts of different languages (genomic sequences among others), images with different formats, and data or software of various origins. It is observed that CGR images are file-specific and may be consequently used as data signatures. Not only type of files can be easily identified, but subclasses of data (such as language - and eventually origin- for text), are also decipherable.
暂无评论