检索结果-内蒙古大学图书馆

IEEE International Conference on Image processing

作者： Yao Zhao R.L. Lagendijk Information and Communication Theory Group Faculty of Information Technology and Systems Delft University of Technnology Delft Netherlands Institute of Information Science Northern Jiaotong University Beijing China

The robustness against geometrical attacks remains one of the most challenging issues in watermarking of images and video. This paper presents several improvements of the video watermarking approach presented in Haitsma et al. (2001), namely (i) using a temporally low-pass watermark and (ii) synchronization to resist attacks along the temporal axis. In order to improve the watermark detection performance, we propose to use an amplitude-limiting filter and a whitening filter during the watermark extraction process. Experimental results show that the proposed techniques achieve good performance.

关键词： Watermarking Low pass filters Robustness Resists Frequency synchronization Filtering Video compression Humans visual system Gaussian noise

来源：评论

学校读者我要写书评

暂无评论

Integrated multimedia processing for topic segmentation and classification

Integrated multimedia processing for topic segmentation and ...

引用

IEEE International Conference on Image processing

作者： R.S. Jasinschi N. Dimitrova T. McGee L. Agnihotri J. Zimmerman D. Li Philips Research Briarcliff Manor NY USA

We describe integrated multimedia processing for Video Scout, a system that segments and indexes TV programs according to their audio, visual, and transcript information. Video Scout represents a future direction for personal video recorders. In addition to using electronic program guide metadata and a user profile, Scout allows the users to request specific topics within a program. For example, users can request the video clip of the USA president speaking from a half-hour news program. Video Scout has three modules: (i) video pre-processing, (ii) segmentation and indexing, and (iii) storage and user interface. Segmentation and indexing, the core of the system, incorporates a Bayesian framework that integrates information from the audio, visual, and transcript (closed captions) domains. This framework uses three layers to process low, mid, and high-level multimedia information. The high-level layer generates semantic information about TV program topics. This paper describes the elements of the system and presents results from running Video Scout on real TV programs.

关键词： Indexing Speech User interfaces Bayesian methods Multimedia communication TV broadcasting Data mining Video sharing Multimedia systems Engines

来源：评论

学校读者我要写书评

暂无评论

Lost motion vector recovery for digital video communication

引用

Conference on visual communications and Image processing

作者： Yu, ZH Wu, HR Yu, SY Monash Univ Sch Comp Sci & Software Engn Clayton Vic 3800 Australia

ISBN: (纸本)0819437034

For MPEG-ii and other hybrid MC/DPCM/DCT based video coding standards, it is very important to reconstruct the predicted frames based on the block motion information. In case of transmission over unreliable channels, error concealment methods are introduced to recover the lost or erroneous motion vectors. In this paper, a novel side motion estimation method is proposed to recover the lost motion vectors by selecting from a candidate motion vector set. The outer boundary of the lost block is used to perform motion estimation and the recovered motion vector is the one that minimises the squared error of the block boundary pixels between two consecutive frames. The method takes advantage of the same motion direction of most blocks and their boundaries. It releases the boundary pixel gray level continuity assumption of traditional boundary match/side match approaches so that better estimation result can be achieved. Overlapped block motion compensation is also incorporated in the proposed method to reduce the blocking artefacts. By reducing the number of motion vectors in the candidate set, the performance of the proposed algorithm can be further improved.

关键词： error concealment motion estimation motion compensation digital video coding

来源：评论

学校读者我要写书评

暂无评论

High-performance compression of visual information - A tutorial review - Part I: Still pictures

引用

PROCEEDINGS OF THE IEEE 1999年第6期87卷 976-1011页

作者： Egger, O Fleury, P Ebrahimi, T Kunt, M Oasya SA CH-1110 Morges Switzerland Swiss Fed Inst Technol Signal Proc Lab CH-1015 Lausanne Switzerland

Digital images have become an important source of information in the modern world of communication systems. In their raw form, digital images require a tremendous amount of memory. Many research efforts have been devoted to the problem of image compression in the last two decades. Two different compression categories must be distinguished: lossless and lossy. Lossless compression is achieved if no distortion is introduced in the coded image. Applications requiring this type of compression include medical imaging and satellite photography. For applications such as video telephony ol multimedia applications, some loss of information is usually tolerated in exchange for a high compression ratio. In this two-part paper, the major building blocks of image coding schemes are overviewed. Part I covets still image coding, and Parr ii covers motion picture sequences. In this first part, still image coding schemes have been classified into predictive, block transform, and multiresolution approaches. Predictive methods are suited to lossless and low-compression applications. Transform-based coding schemes achieve higher compression ratios for lossy compression but suffer from blocking artifacts at high-compression ratios. Multiresolution approaches are suited for lossy as well for lossless compression. At lossy high-compression ratios, the typical artifact visible in the reconstructed images is the ringing effect. New applications in a multimedia environment drove the need for new functionalities of the image coding schemes. For that purpose, second-generation coding techniques segment the image into semantically meaningful parts. Therefore, parts of these methods have been adapted to work for arbitrarily shaped regions. In ol-der to add another functionality, such as progressive transmission of the information, specific quantization algorithms must be defined. A final step in the compression scheme is achieved by the codeword assignment. Finally, coding results ale presented

关键词： compression image processing JPEG MPEG standards still pictures

来源：评论

学校读者我要写书评

暂无评论

Difference in visual information between face to face and telephone dialogues

Difference in visual information between face to face and te...

引用

1997 IEEE International Conference on Acoustics, Speech, and Signal processing (ICASSP 97)

作者： Iwano, Y Sugita, Y Kasahara, Y Nakazato, S Shirai, K Waseda Univ Tokyo Japan

ISBN: (纸本)0818679204

In this research, we analyzed conversations between a pair of subjects, under two conditions. One is face to face conversation that has a visual contact, and the other is conversation through telephone line that has not. From the recorded videotape we extracted the subject's actions especially focusing on the head movements. By comparing the dialogues under two conditions, it seems that there are two types of head movements, one is intended to give a response to his partner and the other is to send some signal. We are going to analyze how visual information contributes in spoken dialogue perceptions, and possibility of adopting it in a multi-modal human interface.

关键词： visual communication

来源：评论

学校读者我要写书评

暂无评论

Algorithmic representation of visual information

Algorithmic representation of visual information

引用

International Conference on Image processing

作者： Sow, D Eleftheriadis, A Columbia Univ New York United States

ISBN: (纸本)0818681837

In [6], we have introduced Complexity Distortion Theory, a mathematical framework characterizing the design of programmable communication systems. In this paper, we show how Complexity Distortion Theory fits in the MPEG-4 context and more generally in any system allowing programmability, by formalizing the concept of programmable decoders. We also show how it can be used to design intelligent encoders at two flexibility levels: the first one corresponding to the case where flexibility in the algorithm selection is allowed and the second where downloadability of new tools for representation is also allowed.

关键词： Image coding

来源：评论

学校读者我要写书评

暂无评论

About multimedia communication standardization architecture

About multimedia communication standardization architecture

引用

3rd International Conference on Signal processing (ICSP 96)

作者： Feng, YM Xu, W Lu, TX Northern Jiaotong Univ China

ISBN: (纸本)7505338900

From the early days of multimedia communication development, ITU-T has paid much attention to establish standard systems and to consummate them. This is necessary in the process of turning into the world of information Highway. In this paper, the authors conclude systematically the ITU-T Recommendation H series and T series on Audio-graphic and audio-visual multimedia services according to the research works on multimedia communication which our research lab has been doing these years as well as our experience of participating the research and discussion works of ITU-T standard files on image and multimedia communication which was held by the telecommunication department in order to establish our country's own corresponding standards.

关键词： Voice/data communication systems

来源：评论

学校读者我要写书评

暂无评论

DATA STRUCTURES FOR GIGABYTE SYSTEMS 2

DATA STRUCTURES FOR GIGABYTE SYSTEMS

引用

Conference on visual Data Exploration and Analysis ii

作者： KLINGER, A UNIV CALIF LOS ANGELES LOS ANGELESCA 90024

ISBN: (纸本)0819417572

This paper involves data structures in planning to combine engineering research areas considered as communication modes: image, outline-sketches, and speech. Images are enhanced compressed and transmitted, but in graphics solid display is central, while in speech recognition/identification dominate. Outside computing, graphics uses sketch, outline-drawing, or schematic summaries of other data (photographic images). Practical image-processing involves comparisons, features/edges, shape, and segmentation, using both transforms and other global analyses. Most speech work involves domain restriction. This limit can be deleted by focussing on data structures: they can link word and picture domains, and allow for captioning, for indexing/highlighting-domains to users. This shows data structures enable implementing useful functions, support information-handling with synergistic benefits: the paper's theme. Data structuring is also the theme of recent research literature on alternate means for visual presentation of multiple-measure numerical data. This paper briefly surveys these materials. We show how research from the data structure field enables new methods for addressing visualization issues, improves large-record data-handling, and aids greater use of visual and numerical records. (This expands on a talk presented 8 July 1994 at Argonne National Laboratory.)

关键词： DATA STRUCTURES INDEXING SPEECH DATA ANALYSIS MULTIPLE information SOURCES SCHEMATIC SUMMARIES MAPS MEDICAL IMAGES

来源：评论

学校读者我要写书评

暂无评论

MULTIPLE RESOURCE THEORY .2. EMPIRICAL-EXAMINATION OF MODALITY-SPECIFIC ATTENTION TO TELEVISION SCENES

引用

communication RESEARCH 1994年第2期21卷 208-231页

作者： BASIL, MD UNIV HAWAII HONOLULUHI 96822 USA

Multiple resource theory proposes that attention is a process of resource allocation. These resources may be shifted among different modalities and information-processing tasks. This study investigated whether selective attention to a particular television modality results in different levels of attention to the visual and auditory modalities. Two independent variables manipulated selective attention-the modality with the most information (audio or video) and viewers' instructed focus (audio or video). These variables were fully crossed in a within-subjects experimental design. Attention levels were investigated by measuring reaction times to cues in each modality (audio tones and color flashes). All five manipulation checks suggest that subjects were able to focus on a particular message channel. Reactions to cues were faster, however, when the audio channel contained the most information and when viewers focused on the audio channel. These results suggest a common pool of limited attentional resources and therefore bimodal attention.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Demonstrating image communication within open distributed environments 2nd

引用

2nd International Workshop on Advanced Teleservices and High-Speed communication Architectures, IWACA 1994

作者： Strack, Rüdiger Cordes, Ralf Sutcliffe, Dale C. Fraunhofer Institute for Computer Graphics Wilhelminenstraße 7 DarmstadtD-64283 Germany Bosch Telecom Kleyerstraße 94 Frankfurt/MainD-60326 Germany Rutherford Appleton Laboratory Chilton Didcot OxonOXI1 0QX United Kingdom

ISBN: (纸本)9783540584940

A wide variety of image interchange and communication (de facto) standards are employed today in different systems, applications and environments. Within the AMICS project a framework, called the Image communication Open Architecture(ICOA), was defined to enable the various standards and standardization activities in the broad area of imaging and image communication to be related and the necessary support tools to be identified. Based on the ICOA, software tools to support the framework were developed focusing on different requirements for image communication. One requirement was perceived to underlie all the others, that of providing uniform access to whole images and parts of images whether they are stored locally or remotely. Such uniform access is provided through the ICOA Image Handling Interface (IHI). The IHI is realized by means of the ICOA Image Handler that is modelled as an Open Distributed processing (ODP) object. The Image Handler encompasses the support of various compression schemes and (image) data formats, as well as different conversion facilities. To demonstrate the concepts of the ICOA and the ICOA Software Tools, a remote teaching scenario was chosen. The teaching scenario illustrates the accessibility of any image storage with any kind of digital image format on it within a multimedia communication environment. The access to the media is provided by a multimedia communication service. Within this service, the Image Handler is used to retrieve still and moving images and those parts of audio-visual information that are covered by the ICOA. © 1994, Springer Verlag. All rights reserved.

关键词： Digital storage

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：