检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,227 篇 会议
158 篇 期刊文献
36 册 图书

馆藏范围

8,420 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

5,789 篇 工学
- 5,044 篇 计算机科学与技术...
- 3,605 篇 软件工程
- 1,541 篇 光学工程
- 867 篇 信息与通信工程
- 663 篇 电气工程
- 642 篇 控制科学与工程
- 501 篇 机械工程
- 449 篇 生物医学工程（可授...
- 374 篇 电子科学与技术（可...
- 349 篇 生物工程
- 237 篇 仪器科学与技术
- 119 篇 化学工程与技术
- 101 篇 建筑学
- 92 篇 土木工程
- 72 篇 安全科学与工程
- 58 篇 材料科学与工程（可...
- 52 篇 交通运输工程
3,203 篇 理学
- 1,985 篇 物理学
- 1,904 篇 数学
- 579 篇 统计学（可授理学、...
- 408 篇 生物学
- 126 篇 化学
- 57 篇 系统科学
488 篇 管理学
- 329 篇 图书情报与档案管...
- 176 篇 管理科学与工程(可...
- 55 篇 工商管理
424 篇 医学
- 407 篇 临床医学
- 105 篇 基础医学(可授医学...
- 79 篇 药学(可授医学、理...
54 篇 艺术学
- 53 篇 设计学（可授艺术学...
53 篇 农学
45 篇 法学
28 篇 教育学
18 篇 经济学
11 篇 军事学
5 篇 文学

主题

1,329 篇 image processing
1,100 篇 computer vision
895 篇 image segmentati...
663 篇 pattern recognit...
538 篇 image reconstruc...
515 篇 image analysis
501 篇 cameras
451 篇 layout
374 篇 shape
366 篇 computer science
318 篇 feature extracti...
268 篇 face recognition
263 篇 image recognitio...
260 篇 robustness
243 篇 humans
202 篇 pixel
200 篇 image edge detec...
192 篇 object recogniti...
189 篇 object detection
188 篇 pattern recognit...

机构

23 篇 department of co...
20 篇 microsoft resear...
17 篇 center for autom...
16 篇 the robotics ins...
15 篇 national laborat...
15 篇 institute of ima...
15 篇 institute of ima...
15 篇 department of co...
15 篇 institute of com...
14 篇 department of co...
14 篇 tsinghua univers...
14 篇 school of comput...
14 篇 nec research ins...
14 篇 school of comput...
13 篇 robotics institu...
13 篇 institute for ro...
12 篇 computer science...
11 篇 carnegie mellon ...
11 篇 swiss fed inst t...
11 篇 inria sophia-ant...

作者

31 篇 anon
27 篇 huang thomas s.
25 篇 jain anil k.
24 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 haralick robert ...
19 篇 timofte radu
18 篇 shum heung-yeung
18 篇 aggarwal j.k.
17 篇 zhang lei
17 篇 hancock edwin r.
16 篇 van gool luc
15 篇 g. healey
14 篇 davis larry s.
14 篇 rosenfeld azriel
14 篇 t. kanade
14 篇 r. szeliski
14 篇 ahuja narendra
13 篇 k. ikeuchi
13 篇 chellappa rama

语言

8,151 篇 英文
182 篇 其他
88 篇 中文
3 篇 土耳其文
1 篇 西班牙文
1 篇 日文
1 篇 葡萄牙文

检索条件"任意字段=Proceedings - IEEE Computer Society Conference on Pattern Recognition and Image Processing."

共 8421 条记录，以下是571-580 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Algorithm of Digital Dynamic Sculpture Expression Change Based on Virtual Reality Technology

Algorithm of Digital Dynamic Sculpture Expression Change Bas...

引用

2022 International conference on Artificial Intelligence and Autonomous Robot Systems, AIARS 2022

作者： Yu, Guangyong Liaoning Communication University Shenyang110000 China

ISBN: (数字)9781665454575

ISBN: (纸本)9781665454575

The creation, expression and application of dynamic sculpture public art in public environment space combines traditional dynamic sculpture art with contemporary public art in a specific angle, and then tries to sort out the contribution and function of dynamic sculpture art to contemporary public art based on actual creation. Dynamic sculpture has played an important role in modern sculpture art, and a large number of outstanding artists and works have emerged, which has revolutionized the form of sculpture art. Expression recognition technology is a challenging cross-cutting subject involving emotion calculation, image processing. machine vision, motion tracking, pattern recognition, biometric identification, physiology, psychology and other research fields. It is an important part of emotion calculation and human-computer intelligent interaction. However, because two-dimensional images are the projection of three-dimensional objects onto a two-dimensional plane, the texture, shape and other features will inevitably be lost during the projection process, so gesture recognition based on virtual reality (VR) has become the research focus and center in this field. This paper proposes a digital dynamic sculpture expression change algorithm based on VR technology, which involves the method of dynamic sculpture expression feature extraction, the sculpture expression change extraction algorithm based on VR, the analysis of stereo display algorithm based on projection transformation, and the analysis of stereo display algorithm. It can make people pay attention to the interactive communication with sculpture in the process of use, the feeling of sculpture artistic image and personalized recreation. Through the dynamics of sculpture structure and elements, we can meet people's growing aesthetic needs to a certain extent. © 2022 ieee.

关键词： Stereo image processing

来源：评论

学校读者我要写书评

暂无评论

Multimodal Human Activity recognition for Smart Healthcare Applications

Multimodal Human Activity Recognition for Smart Healthcare A...

引用

2022 ieee International conference on Systems, Man, and Cybernetics, SMC 2022

作者： Islam, Md. Milon Nooruddin, Sheikh Karray, Fakhri University of Waterloo Centre for Pattern Analysis and Machine Intelligence Department of Electrical and Computer Engineering ONN2L 3G1 Canada Mohamed Bin Zayed University of Artificial Intelligence Abu Dhabi United Arab Emirates

ISBN: (数字)9781665452588

ISBN: (纸本)9781665452588

Human Activity recognition (HAR) has emerged as a potential research topic for smart healthcare owing to the fast growth of wearable and smart devices in recent years. The significant applications of HAR in ambient assisted living environments include monitoring the daily activities of elderly and cognitively impaired individuals to assist them by observing their health status. In this research, we present a deep learning-based fusion approach for multimodal HAR that fuses the different modalities of data to obtain robust outcomes. Here, Convolutional Neural Networks (CNNs) retrieve the high-level attributes from the image data, and the Convolutional Long Short Term Memory (ConvLSTM) is utilized to capture significant patterns from the multi-sensory data. Finally, the extracted features from the modalities are fused through self-attention mechanisms that enhance the relevant activity data and inhibit the superfluous and possibly confusing information by measuring their compatibility. Lastly, extensive tests have been performed to measure the efficiency and robustness of the developed fusion approach using the UP-Fall detection dataset. It is evident from the experimental findings that the proposed fusion technique outperforms the existing state-of-the-art and achieves relatively better performance. © 2022 ieee.

关键词： Long short-term memory

来源：评论

学校读者我要写书评

暂无评论

Real-Time Signal processing.using AI Integrated Framework for Color and Drawing in Gesture recognition 5

Real-Time Signal Processing using AI Integrated Framework fo...

引用

5th International conference on Contemporary Computing and Informatics, IC3I 2022

作者： Lavania, Geerija Arya, Vivek Sharma, Neha Rashid, Mamoon Akram, Shaik Vaseem JECRC Artificial Intelligence & Data Science Department Rajasthan Jaipur India Dept of Electronics and Communication Engineering Haridwar India Chitkara University Institute of Engineering and Technology Chitkara University Punjab Rajpura India Vishwakarma University Department of Computer Engineering Pune India Uttaranchal Institute of Technology Uttaranchal University Dehradun India

ISBN: (纸本)9798350398267

In the realm of image processing.and pattern recognition, writing with gestures in the air has grown in popularity and difficulty over the past few years. It contributes significantly to the creation of automation processes and can improve human-machine communication in a range of systems. Many studies focus on cutting-edge methods and techniques that will expedite processing.while preserving a high level of identification accuracy. Internally, the area of computer vision considers object tracking to be crucial. Object tracking techniques are credited to modern computers, easy access to good-looking, affordable video cameras, and automated video analytics requirements. The video analysis technique generally consists of three fundamental steps: first, recognizing the object, second, tracking its movement from frame to frame, and third, assessing the behavior of that object. Each of the four takes into account a separate issue: choosing the best item representation, choosing the tracking function, purchasing the item, and tracking the item. Numerous real-world applications, such as autonomous monitoring, video identification, and vehicle navigation, depend heavily on object-tracking algorithms. This area is utilized by the research, which focuses on it to create a prospective text-to-text converter transition. This Research occasionally works as a freelance journalist. Your fingerprint will be tracked by a computer. The created text can be copied and used for a variety of things, including emails and SMS. For the deaf, it will be a potent means to establish this connection. By removing the need for writing, an efficient communication system decreases the use of the phone and laptop. © 2022 ieee.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Rotated Multi-Scale Interaction Network for Referring Remote Sensing image Segmentation

Rotated Multi-Scale Interaction Network for Referring Remote...

引用

conference on computer Vision and pattern recognition (CVPR)

作者： Sihan Liu Yiwei Ma Xiaoqing Zhang Haowei Wang Jiayi Ji Xiaoshuai Sun Rongrong Ji Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University P.R. China

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Referring Remote Sensing image Segmentation (RRSIS) is a new challenge that combines computer vision and natu-ral language processing. Traditional Referring image Seg-mentation (RIS) approaches have been impeded by the com-plex spatial scales and orientations found in aerial imagery, leading to suboptimal segmentation results. To address these challenges, we introduce the Rotated Multi-Scale In-teraction Network (RMSIN), an innovative approach de-signed for the unique demands of RRSIS. RMSIN incorpo-rates an Intra-scale Interaction Module (IIM) to effectively address the fine-grained detail required at multiple scales and a Cross-scale Interaction Module (CIM) for integrating these details coherently across the network. Furthermore, RMSIN employs an Adaptive Rotated Convolution (ARC) to account for the diverse orientations of objects, a novel contribution that significantly enhances segmentation accu-racy. To assess the efficacy of RMSIN, we have curated an expansive dataset comprising 17,402 image-caption-mask triplets, which is unparalleled in terms of scale and vari-ety. This dataset not only presents the model with a wide range of spatial and rotational scenarios but also estab-lishes a stringent benchmark for the RRSIS task, ensuring a rigorous evaluation of performance. Experimental eval-uations demonstrate the exceptional performance of RM-SIN, surpassing existing state-of-the-art models by a signif-icant margin. Datasets and code are available at https://***/Lsan2401/RMSIN.

关键词： image segmentation computer vision Codes Navigation Convolution Benchmark testing Solids

来源：评论

学校读者我要写书评

暂无评论

ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic

ClassSR: A General Framework to Accelerate Super-Resolution ...

引用

ieee/CVF conference on computer Vision and pattern recognition (CVPR)

作者： Kong, Xiangtao Zhao, Hengyuan Qiao, Yu Dong, Chao Chinese Acad Sci Shenzhen Inst Adv Technol Key Lab Human Machine Intelligence Synergy Syst Shenzhen Peoples R China Univ Chinese Acad Sci Beijing Peoples R China Shanghai AI Lab Shanghai Peoples R China Shenzhen Inst Artificial Intelligence & Robot Soc SIAT Branch Shenzhen Peoples R China

ISBN: (纸本)9781665445092

We aim at accelerating super-resolution (SR) networks on large images (2K-8K). The large images are usually decomposed into small sub-images in practical usages. Based on this processing. we found that different image regions have different restoration difficulties and can be processed by networks with different capacities. Intuitively, smooth areas are easier to super-solve than complex textures. To utilize this property, we can adopt appropriate SR networks to process different sub-images after the decomposition. On this basis, we propose a new solution pipeline - ClassSR that combines classification and SR in a unified framework. In particular, it first uses a Class-Module to classify the sub-images into different classes according to restoration difficulties, then applies an SR-Module to perform SR for different classes. The Class-Module is a conventional classification network, while the SR-Module is a network container that consists of the to-be-accelerated SR network and its simplified versions. We further introduce a new classification method with two losses - Class-Loss and Average-Loss to produce the classification results. After joint training, a majority of sub-images will pass through smaller networks, thus the computational cost can be significantly reduced. Experiments show that our ClassSR can help most existing methods (e.g., FSRCNN, CARN, SRResNet, RCAN) save up to 50% FLOPs on DIV8K datasets. This general framework can also be applied in other low-level vision tasks.

关键词： Training computer vision Superresolution Pipelines Containers image restoration pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Sparse Multimodal Vision Transformer for Weakly Supervised Semantic Segmentation

Sparse Multimodal Vision Transformer for Weakly Supervised S...

引用

ieee computer society conference on computer Vision and pattern recognition Workshops (CVPRW)

作者： Joëlle Hanna Michael Mommert Damian Borth AIML Lab School of Computer Science University of St.Gallen

Vision Transformers have proven their versatility and utility for complex computer vision tasks, such as land cover segmentation in remote sensing applications. While performing on par or even outperforming other methods like Convolutional Neural Networks (CNNs), Transformers tend to require even larger datasets with fine-grained annotations (e.g., pixel-level labels for land cover segmentation). To overcome this limitation, we propose a weakly-supervised vision Transformer that leverages image-level labels to learn a semantic segmentation task to reduce the human annotation load. We achieve this by slightly modifying the architecture of the vision Transformer through the use of gating units in each attention head to enforce sparsity during training and thereby retaining only the most meaningful heads. This allows us to directly infer pixel-level labels from image-level labels by post-processing.the un-pruned attention heads of the model and refining our predictions by iteratively training a segmentation model with high fidelity. Training and evaluation on the DFC2020 dataset show that our method 1 not only generates high-quality segmentation masks using image-level labels, but also performs on par with fully-supervised training relying on pixel-level labels. Finally, our results show that our method is able to perform weakly-supervised semantic segmentation even on small-scale datasets.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Research on super-resolution reconstruction of single-frame image of infrared focal plane polarization imaging 8

Research on super-resolution reconstruction of single-frame ...

引用

8th Symposium on Novel Photoelectronic Detection Technology and Applications

作者： Yuan, Hongwu Wang, Yiqing Xu, Guoming Wang, Feng School of Big Data and Artificial Intelligence Anhui Xinhua University Hefei China Key Laboratory of Polarization Imaging Detection Technology in Anhui Province Hefei China Institute of Intelligent Information Anhui Xinhua University Hefei China School of Internet Anhui University Hefei China

ISBN: (数字)9781510653122

ISBN: (纸本)9781510653115

Since the low resolution of infrared focal plane arrays may degrade the performance of polarization imaging significantly, it is necessary to study the super-resolution reconstruction method for superior image resolution and contrast. Four typical single-frame image reconstruction methods are studied in this paper, and the comparison of reconstruction result of these methods is conducted by using subjective and objective evaluation. The experiments show that the reconstruction method based on generative adversarial network performs poorly in the evaluation indexes such as peak signal-to-noise ratio and structural similarity, but its reconstructed image has good visual effects, rich texture and details, and has a strong ability to suppress background noise, and using the reconstructed images for polarization information parsing can significantly improve the accuracy of polarization information parsing and the effect of polarization image fusion. © 2022 SPIE

关键词： image fusion

来源：评论

学校读者我要写书评

暂无评论

A study on Frequency Domain Microstate Feature Fusion for EEG Emotion recognition 7

A study on Frequency Domain Microstate Feature Fusion for EE...

引用

7th International conference on Communication, image and Signal processing. CCISP 2022

作者： Xiao, Di Lv, Zhao Hu, Shiang Sch. of Computer Science and Technology Anhui University Hefei China

ISBN: (数字)9781665459594

ISBN: (纸本)9781665459594

The microstate analysis of EEG signals makes full use of the spatial information of the brain topographic map, and reflects the active association of different brain regions. Different from the traditional EEG features that mostly focus on single-channel information, the microstate feature contains the spatio-Temporal information of EEG signals. Unlike microstate studies that mostly focus on dimensional emotions, the experiments classify positive, neutral, and negative discrete emotions using the SEED database. This work filters the data of a single subject into five frequency bands and calculates the microstate topographic maps of EEG signals in different frequency bands, respectively. The extracted features of microstate classes are coverage, duration, occurrence, and transition probability between microstates. The gender difference as to the dominant microstate pattern for emotions and the comparison between microstates, we found that the brain activity of males in three emotional states and females in negative emotions were related to the frontal-occipital pattern, the females of positive and neutral emotional states were associated with the left and right brain areas. We also investigated the traditional power spectra features, these features which be fused over frequency bands or not fused were fed into the classifiers such as the K-Nearest Neighbor (KNN) and the the Support Vector Machine(SVM) to classify discrete emotional labels in SEED. The average classification accuracy of 15 subjects was 97.67±1.4% and 92.58±3.24%, respectively. © 2022 ieee.

关键词： Brain

来源：评论

学校读者我要写书评

暂无评论

GeoLLM-Engine: A Realistic Environment for Building Geospatial Copilots

GeoLLM-Engine: A Realistic Environment for Building Geospati...

引用

ieee computer society conference on computer Vision and pattern recognition Workshops (CVPRW)

作者： Simranjit Singh Michael Fore Dimitrios Stamoulis CoStrategist R&D Group Microsoft Corporation USA

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Geospatial Copilots unlock unprecedented potential for performing Earth Observation (EO) applications through natural language instructions. However, existing agents rely on overly simplified single tasks and template-based prompts, creating a disconnect with real-world scenarios. In this work, we present GeoLLM-Engine, an environment for tool-augmented agents with intricate tasks routinely executed by analysts on remote sensing platforms. We enrich our environment with geospatial API tools, dynamic maps/UIs, and external multimodal knowledge bases to properly gauge an agent’s proficiency in interpreting realistic high-level natural language commands and its functional correctness in task completions. By alleviating overheads typically associated with human-in-the-loop benchmark curation, we harness our massively parallel engine across 100 GPT-4-Turbo nodes, scaling to over half a million diverse multi-tool tasks and across 1.1 million satellite images. By moving beyond traditional single-task image-caption paradigms, we investigate state-of-the-art agents and prompting techniques against long-horizon prompts.

关键词： Earth Geology Natural languages Benchmark testing Parallel processing Geospatial analysis Satellite images

来源：评论

学校读者我要写书评

暂无评论

QAttn: Efficient GPU Kernels for mixed-precision Vision Transformers

QAttn: Efficient GPU Kernels for mixed-precision Vision Tran...

引用

ieee computer society conference on computer Vision and pattern recognition Workshops (CVPRW)

作者： Piotr Kluska Adrián Castelló Florian Scheidegger A. Cristiano I. Malossi Enrique S. Quintana-Ortí IBM Research Europe Universitat Politècnica de València Universitat Politècnica de València IBM Research Europe

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Vision Transformers have demonstrated outstanding performance in computer Vision tasks. Nevertheless, this superior performance for large models comes at the expense of increasing memory usage for storing the parameters and intermediate activations. To accelerate model inference, in this work we develop and evaluate integer and mixed-precision kernels in Triton for the efficient execution of two fundamental building blocks of transformers –linear layer and attention– on graphics processing.units (GPUs). On an NVIDIA A100 GPU, our kernel implementations of Vision Transformers achieve a throughput speedup of up to 7x compared with reference kernels in PyTorch floating-point single precision (FP32). Additionally, the accuracy for the ViT Large model top-1 drops by less than one percent on the imageNet1K classification task. We also observe up to 6x increased throughput by applying our kernels to the Segment Anything Model image encoder while keeping the mIOU close to the FP32 reference on the COCO2017 dataset for static and dynamic quantization. Furthermore, our kernels demonstrate improved speed to the TensorRT INT8 linear layer, and we improve the throughput of base FP16 (half precision) Triton attention on average by up to 19 ± 4.01%. We have open-sourced the QAtnn framework, which is tightly integrated with the PyTorch quantization workflow https://***/IBM/qattn.

关键词： computer vision image segmentation Quantization (signal) Computational modeling Graphics processing.units Prototypes Transformers

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 54 55 56 57 58 59 60 61 62 63 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：