检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

4,091 篇 会议
52 篇 期刊文献
19 册 图书
1 篇 学位论文

馆藏范围

4,162 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

2,329 篇 工学
- 1,547 篇 计算机科学与技术...
- 1,043 篇 信息与通信工程
- 868 篇 软件工程
- 642 篇 电气工程
- 225 篇 光学工程
- 122 篇 生物工程
- 113 篇 电子科学与技术（可...
- 105 篇 控制科学与工程
- 89 篇 生物医学工程（可授...
- 83 篇 仪器科学与技术
- 64 篇 机械工程
- 35 篇 化学工程与技术
- 34 篇 测绘科学与技术
- 34 篇 网络空间安全
- 25 篇 动力工程及工程热...
- 23 篇 安全科学与工程
- 18 篇 建筑学
1,079 篇 医学
- 1,062 篇 临床医学
- 34 篇 基础医学(可授医学...
- 30 篇 药学(可授医学、理...
- 20 篇 特种医学
957 篇 理学
- 708 篇 物理学
- 273 篇 数学
- 131 篇 生物学
- 72 篇 统计学（可授理学、...
- 36 篇 化学
- 34 篇 系统科学
155 篇 管理学
- 82 篇 图书情报与档案管...
- 80 篇 管理科学与工程(可...
32 篇 法学
- 28 篇 社会学
17 篇 军事学
10 篇 文学
6 篇 教育学
6 篇 农学
5 篇 经济学
4 篇 艺术学

主题

505 篇 image coding
414 篇 image processing
355 篇 visual communica...
306 篇 visualization
242 篇 feature extracti...
230 篇 image segmentati...
168 篇 image compressio...
164 篇 image reconstruc...
149 篇 video coding
144 篇 humans
136 篇 image quality
134 篇 training
131 篇 cameras
126 篇 image color anal...
108 篇 image enhancemen...
103 篇 image edge detec...
103 篇 decoding
99 篇 deep learning
96 篇 signal processin...
92 篇 image retrieval

机构

36 篇 shanghai jiao to...
29 篇 institute of ima...
24 篇 school of electr...
20 篇 university of sc...
18 篇 shanghai jiao to...
16 篇 shanghai jiao to...
16 篇 tianjin univ sch...
16 篇 beijing universi...
12 篇 tsinghua univ de...
11 篇 university of el...
11 篇 cas key laborato...
10 篇 univ sci & techn...
10 篇 peking univ inst...
10 篇 institute of ima...
9 篇 zhejiang univers...
9 篇 tsinghua univ de...
9 篇 school of electr...
9 篇 xidian univ sch ...
9 篇 shanghai jiao to...
8 篇 school of remote...

作者

34 篇 zhai guangtao
26 篇 sumei li
25 篇 song li
22 篇 li sumei
21 篇 guangtao zhai
18 篇 li li
18 篇 li song
18 篇 min xiongkuo
16 篇 dong liu
16 篇 yang xiaokang
16 篇 shan liu
15 篇 andré kaup
14 篇 chen zhibo
13 篇 xie rong
13 篇 xiongkuo min
12 篇 m. vetterli
12 篇 gao wen
11 篇 heming sun
11 篇 zhibo chen
11 篇 zhenzhong chen

语言

4,095 篇 英文
49 篇 土耳其文
26 篇 中文
8 篇 其他

检索条件"任意字段=Conference on Visual Communications and Image Processing 2002"

共 4163 条记录，以下是31-40 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Relation-aware Semantic Alignment Network for Text-to-image Person Retrieval

Relation-aware Semantic Alignment Network for Text-to-Image ...

引用

International conference on Acoustics, Speech, and Signal processing (ICASSP)

作者： Yong Wu Rongxi Zhou Hongchao Li Ze Zhou Feifei Wei Min Li Guodui He School of Computer and Information Anhui Normal University Wuhu China Guangxi Academy of Agricultural Sciences Nanning China China Communications Service Guangxi Technical Service Branch Nanning China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Text-to-image Person Retrieval (TIPR) aims to utilize natural language descriptions as queries to retrieve pedestrian images. However, existing methods only concentrated on aligning individual text-image pairs and ignored the specific self-representations within both visible images and textual descriptions of the same identity. This neglects the impact of intra-modal information distribution on TIPR. In this paper, a novel Relation-aware Semantic Alignment Network (RSAN) is proposed to learn reliable and comprehensive semantic visual-textual associations across different modalities. Specifically, A Global Semantic Alignment Matching (GSAM) loss is introduced to enhance the coherence of inter-modality features while preserving intra-modal representations for cross-modal matching. Additionally, an Adapter-assisted Information Aggregation (AIA) module is designed to further complement contextual information fusion between the image features and text embeddings. Extensive experiments conducted on two public benchmark datasets demonstrate the superiority of the proposed RSAN.

关键词： visualization Pedestrians Semantics Natural languages Text to image Coherence Benchmark testing Signal processing Reliability Speech processing

来源：评论

学校读者我要写书评

暂无评论

MAITFuse:Multi-Dimension Adaptive Interaction Transform Network For Infrared-visible image Fusion

MAITFuse:Multi-Dimension Adaptive Interaction Transform Netw...

引用

International conference on Acoustics, Speech, and Signal processing (ICASSP)

作者： Yabin Sun Wentai Lei Ziyi Zhang Jiongchang Liu Chenxu Li Tao Zhang Artificial Intelligence Jiangnan University Wu Xi China Communications Engineering Central South University Chang Sha China School of Electronic Information Central South University Chang Sha China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

In recent years, Transformers have achieved significant success in image fusion. These methods utilize self-attention mechanism across different spatial or channel dimensions and have demonstrated impressive performance. However, existing methods only optimize along a single dimension and struggle to simultaneously capture the complex dependencies between spatial and channel dimensions. To address this problem, we propose a novel multi-dimensional adaptive interaction transformer network, named as MAITFuse, to enhance the multilevel information expression and detail retention capabilities of images. We design a Multi-Dimensional Feature Extraction (MDFE) module to extract features across spatial and channel dimensions in parallel, and introduce a novel weighted cross-attention fusion method to integrate multi-dimensional information effectively. Experimental results show that, compared to existing fusion methods, our proposed method achieves superior fusion performance across various datasets.

关键词： Adaptive systems Transforms Signal processing Feature extraction Transformers Acoustics Data mining Speech processing image fusion visual perception

来源：评论

学校读者我要写书评

暂无评论

A Recent Advancements in Multi-Modal Medical image Fusion Techniques using SWT, NSST, NSCT, and CNN

A Recent Advancements in Multi-Modal Medical Image Fusion Te...

引用

Cognitive Computing in Engineering, communications, Sciences and Biomedical Health Informatics (IC3ECSBHI), International conference on

作者： Shailesh Bhosekar Prabhishek Singh Deepak Garg School of Computer Science and Artificial Intelligence SR University Warangal Telangana India School of Computer Science Engineering and Technology Bennett University Greater Noida India

ISBN: (数字)9798331518523

ISBN: (纸本)9798331518530

image fusion is a method used in image processing to provide a more complete representation by amalgamating features and data from many images. Multimodal medical image fusion involves the integration of medical images from many imaging modalities, including computed tomography (CT) scans, positron emission tomography (PET), and magnetic resonance imaging (MRI), into one single dataset. This integration enhances the visualisation of anatomical structures and clinical situations, hence improving diagnostic accuracy by leveraging the strengths of each medium. This study employs MRI, CT, and PET scans as experimental modalities. This review aims to compare various multi modal medical image approach based on Stationary Wavelet Transform (SWT), Non-Subsampled Shearlet Transform (NSST), Convolutional Neural Network (CNN) and NonSubsampled Contourlet Transform (NSCT). This study examines the latest conventional and non-conventional research conducted within these disciplines. It further evaluates these approaches according to diverse image quality metrics and many quantitative assessments. According to this comparison, CNN-based fusion demonstrates superior results, as the overall visual and parametric quality of the fusion outcomes surpasses that of the other approaches evaluated.

关键词： image quality Wavelet transforms visualization Reviews Magnetic resonance imaging Computed tomography Convolutional neural networks Positron emission tomography Medical diagnostic imaging image fusion

来源：评论

学校读者我要写书评

暂无评论

PDCE: Patch-wise Dynamic Curve Estimation for Low-Light image Enhancement

PDCE: Patch-wise Dynamic Curve Estimation for Low-Light Imag...

引用

International conference on Acoustics, Speech, and Signal processing (ICASSP)

作者： Ruiyuan Chen Zhixin Li Han Zeng Yifan Liu Tao He Tiecheng Song School of Communications and Information Engineering Chongqing University of Posts and Telecommunications Chongqing China College of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Low-light image enhancement (LLIE) can be reformulated as an image-specific curve estimation (CE) problem. Traditional CE-based methods struggle with issues such as uniform processing across different regions, static parameter estimation, and lack of effective global semantic enhancement. To address these limitations, we propose a novel unsupervised learning framework, Patch-wise Dynamic Curve Estimation (PDCE), which dynamically adjusts and optimizes enhancement curves according to local patch brightness and the iteration process. Specifically, we present a Vision-Language Curve Discriminator (VLCD), which dynamically determines the curve type for each patch, avoiding uniformly applying the curve on the whole image. We introduce a Curve Parameter Estimator (CPE), which dynamically updates curve parameters and adjusts enhancement effects based on the output of the previous iteration. Furthermore, we design a visual State Space-based Semantic Enhancement Module (VSEM), which captures global receptive fields and enriches semantic features through the Mamba-based U-Net architecture. Extensive experimental results show the superiority of our PDCE over state-of-the-art methods for LLIE.

关键词： visualization Parameter estimation Semantics Estimation Speech enhancement Signal processing Iterative methods Unsupervised learning image enhancement Optimization

来源：评论

学校读者我要写书评

暂无评论

A Computer Vision and Vibrohaptic Glove-Based Piano Learning System for the visually Impaired

A Computer Vision and Vibrohaptic Glove-Based Piano Learning...

引用

International conference on Advanced Communication Technology (ICACT)

作者： Ian Juha Cho Jin Park Hosung Bae Hankuk Academy of Foreign Studies Yongin South Korea

ISBN: (数字)9791188428137

ISBN: (纸本)9798331507602

The visually impaired are unable to enjoy leisure activities as much as ordinary people due to various limitations. To expand the scope of leisure activities for the visually impaired, we have developed a vibration glove-based system that helps with piano learning. Previous research used 88 infrared light-emitting diodes and gloves with infrared receivers to provide feedback to the user, but this method had many limitations. In particular, the inconvenient user experience and low accuracy were the biggest problems. Our method solves both problems using a camera and an image processing algorithm. As a result of testing the model on 20 piano images, it was shown that all keys were perfectly recognized in 75% of cases, and the gloves could be comfortably used in practice without any difficulty. Thus, our method presents a simpler user experience for the visually impaired, without requiring any special modifications to the piano.

关键词： Vibrations Learning systems Computer vision image recognition visual impairment Receivers Light emitting diodes User experience communications technology Testing

来源：评论

学校读者我要写书评

暂无评论

A Comparative Analysis of Multi-Modal Medical image Fusion Techniques using MSVD, WPD, PCA, and DWT

A Comparative Analysis of Multi-Modal Medical Image Fusion T...

引用

Cognitive Computing in Engineering, communications, Sciences and Biomedical Health Informatics (IC3ECSBHI), International conference on

ISBN: (数字)9798331518523

ISBN: (纸本)9798331518530

image fusion is a technique used in image processing to create a more comprehensive representation by combining features and data from several images. Incorporating medical images from several imaging modalities, such as computed tomography (CT) scans, Positron Emission Tomography (PET) and magnetic resonance imaging (MRI), into a single set of data is what multi-modal medical image fusion is all about. Better visualization of anatomical structures and clinical conditions is the outcome of this integration, which increases diagnostic accuracy by using the strengths of each modality. In this paper, MRI, CT, and PET scans are used as experimental modalities. This review aims to compare various multi modal medical image approach based on Multi-resolution Singular Value Decomposition (MSVD), Principal Component Analysis (PCA), Discrete Wavelet Transform (DWT), and Wavelet Packet Decomposition (WPD). This paper focuses on exploring latest conventional and non-conventional research being done using these domains. It also compares these methods based on various image quality parameters, and various quantitative checks. Based on this comparison, PCA shows the best results in the comparison as the overall visual and parametric quality of fusion results are better than compared methods.

关键词： visualization Magnetic resonance imaging Computed tomography Wavelet analysis Wavelet packets Discrete wavelet transforms Positron emission tomography Medical diagnostic imaging image fusion Principal component analysis

来源：评论

学校读者我要写书评

暂无评论

Learning Features by Minimizing the Interframe Differences 19th

Learning Features by Minimizing the Interframe Differences

引用

19th Chinese conference on image and Graphics Technologies and Applications, IGTA 2024

作者： Zhao, Dong Zhang, Dan School of Computer Qinghai Normal University Xining810016 China The State Key Laboratory of Tibetan Intelligent Information Processing and Application Xining810016 China

ISBN: (纸本)9789819799183

In this paper, we present a novel yet intuitive unsupervised feature learning approach, referred to as Minimizing Interframe Differences (MID). The idea is the following: as long as the unsupervised features successfully encode the essential information about the visual structures of the frames, the differences between the before and after frames can be minimally encoded. MID is implemented with a difference encoding module and a frame reconstruction module. The former tries to minimize the difference between frames, while the latter first maps the difference encoding to optical flow and then realizes frame reconstruction based on the learned optical flow. All the modules are jointly learned in an end-to-end way through the reconstruction loss. We verify the feasibility of MID in the image datasets by replacing the natural transformation of videos to artificially parameterized transformation for images. Experimental results show that MID is able to predict optical flow accurately based on the minimum difference coding and learn a powerful visual representation that greatly enhances other visual tasks. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Unsupervised learning

来源：评论

学校读者我要写书评

暂无评论

visual Understanding and Navigation for the visually Impaired Using image Captioning

Visual Understanding and Navigation for the Visually Impaire...

引用

Cognitive Computing in Engineering, communications, Sciences and Biomedical Health Informatics (IC3ECSBHI), International conference on

作者： Jitesh Uikey Ishan Undre Urvish Vasani Aman Shaikh Riddhi Mirajkar Jayashree Bagade Department of Information Technology Vishwakarma Institute Of Information Technology Pune India

ISBN: (数字)9798331518523

ISBN: (纸本)9798331518530

visually impaired people face problems with independent navigation due to limited visual information. Facing significant challenges, they often travel with an assistant or their relative. This project introduces an approach, increasing the independence of a blind user by developing new software solutions. The proposed system employs DenseNet201 for feature extraction and Long Short-Term Memory (LSTM) networks for generating accurate, context-aware captions. These captions are converted into real-time auditory descriptions using the gTTS library, enabling users to interpret and navigate their environment confidently. Evaluated on the Flickr8k dataset, the system achieved a BLEU score of 0.721, demonstrating its ability to generate high-quality captions. The system's architecture is designed to balance accuracy, efficiency, and user accessibility. Additionally, the system incorporates a modular architecture optimized for computational efficiency and scalability. Future work includes exploring wearable technology for continuous real-time feedback, integrating advanced natural language processing (NLP) models for richer contextual understanding, and enhancing its applicability in complex indoor and outdoor environments. This approach represents a significant step toward empowering visually impaired individuals with improved mobility and environmental awareness.

关键词： Deep learning visualization Navigation Computer architecture Feature extraction Natural language processing Real-time systems Convolutional neural networks Wearable devices Long short term memory

来源：评论

学校读者我要写书评

暂无评论

Using Convolutional Neural Networks for image Recognition in IoT Applications

Using Convolutional Neural Networks for Image Recognition in...

引用

Pervasive Computational Technologies (ICPCT), International conference on

作者： Durgam Rajababu Sampenga Veerraju K. Suresh Kumar Sonali Gupta V B Thurai Raaj Rahul Joshi Department of EEE School of Engineering SR University Warangal Telangana India Department of Electronics and Communications Engineering Dadi Institute of Engineering and Technology Anakapalli Visakhapatnam Andhra Pradesh MBA Department Panimalar Engineering College Varadarajapuram Poonamallee Chennai J C Bose University of Science and Technology YMCA Faridabad Electrical and Electronics Engineering Madanapalle Institute of Technology & Science Madanapalle Andhra Pradesh Department of Journalism & Mass Communication SMeH Manav Rachna International Institute of Research and Studies Haryana India

ISBN: (数字)9798331508685

ISBN: (纸本)9798331519476

Convolutional Neural Networks (CNNs) have grown into a powerful picture identification tool. They are an important part of making Internet of Things (IoT) apps. In this regard, CNNs make it possible to analyse visual data gathered from networked devices, like cameras and sensors, effectively and accurately. With an emphasis on its potential for real-time applications for example smart surveillance, healthcare monitoring, autonomous cars, and industrial automation, this study investigates the incorporation of CNN-based image identification into IoT contexts. We tackle important issues such as the requirement for low-latency processing, energy economy, and computing limitations on edge devices. In order to maximise CNN performance in the resource-constrained IoT ecosystem, strategies like model compression, edge computing, and distributed architectures are covered. The ability to interpret large volumes of visual data is improved by the integration of CNNs with IoT, providing creative solutions for automation and intelligent decision-making across numerous industries.

关键词： Training visualization Automation Computational modeling Surveillance Training data Feature extraction Convolutional neural networks Internet of Things image classification

来源：评论

学校读者我要写书评

暂无评论

CNN-Based Circular Economy in E-Waste Management with Blockchain and IoT Integration

CNN-Based Circular Economy in E-Waste Management with Blockc...

引用

visual Analytics and Data visualization (ICVADV), International conference on

作者： P. Santhuja A Suresh Department of Networking and Communications School of Computing SRM Institute of Science and Technology Chennai Tamil Nadu India Department of Networking and Communications Faculty of Engineering and Technology School of Computing SRM Institute of Science and Technology Chennai Tamil Nadu

ISBN: (数字)9798331521394

ISBN: (纸本)9798331521400

Increasing electronic waste (e-waste) is a serious environmental and financial problem that calls for creative ideas for effective resource recovery and recycling. This paper proposes an IoT- and Blockchain-integrated e-waste management system based on Convolutional Neural Networks (CNNs) to boost automated waste classification and, thus, circular economy practices. While Blockchain guarantees open and safe tracking of waste transportation, IoT sensors and RFID tags allow real-time monitoring of e-waste. Training on a collection of 50,000 e-waste images, a CNN-based image classification model attained an accuracy of 96.2% in classifying components into recyclables, reusables, and hazardous items. Using automated decision-making, the system showed a 28% reduction in processing time and a 32% gain in sorting efficiency over conventional approaches. Blockchain incorporation enhanced traceability through 100% secure transaction records, lowering fraud and illegal disposal. Experimental data show that this method improves 23% resource recovery rates by promoting sustainable e-waste management. The proposed architecture minimizes environmental effects by encouraging ethical e-waste disposal and a closed-loop recycling system. These results show how Blockchain-secured, AI-driven IoT devices might help to advance circular economic ideas for world e-waste management.

关键词： Waste management Training Accuracy Electronic waste Real-time systems Recycling Blockchains Internet of Things Convolutional neural networks Sorting

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共417页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：