检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

15,243 篇 会议
187 篇 期刊文献
64 册 图书

馆藏范围

15,494 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,638 篇 工学
- 5,960 篇 计算机科学与技术...
- 4,112 篇 软件工程
- 2,075 篇 信息与通信工程
- 1,704 篇 光学工程
- 1,225 篇 控制科学与工程
- 1,122 篇 电气工程
- 1,029 篇 生物工程
- 848 篇 生物医学工程（可授...
- 629 篇 电子科学与技术（可...
- 554 篇 安全科学与工程
- 470 篇 网络空间安全
- 449 篇 机械工程
- 442 篇 交通运输工程
- 375 篇 化学工程与技术
- 319 篇 材料科学与工程（可...
- 301 篇 仪器科学与技术
- 278 篇 建筑学
- 262 篇 土木工程
3,290 篇 理学
- 1,527 篇 物理学
- 1,124 篇 数学
- 1,121 篇 生物学
- 369 篇 化学
- 324 篇 统计学（可授理学、...
1,649 篇 管理学
- 1,014 篇 管理科学与工程(可...
- 773 篇 图书情报与档案管...
- 294 篇 工商管理
953 篇 医学
- 798 篇 临床医学
- 480 篇 基础医学(可授医学...
- 366 篇 公共卫生与预防医...
- 258 篇 药学(可授医学、理...
228 篇 法学
220 篇 农学
118 篇 教育学
86 篇 经济学
85 篇 文学
37 篇 军事学

主题

2,356 篇 accuracy
2,048 篇 computer vision
1,736 篇 deep learning
1,354 篇 computational mo...
1,346 篇 feature extracti...
1,307 篇 training
1,216 篇 convolutional ne...
1,093 篇 image segmentati...
982 篇 visualization
760 篇 image processing
746 篇 transformers
689 篇 real-time system...
567 篇 computer archite...
532 篇 object detection
438 篇 three-dimensiona...
424 篇 image recognitio...
405 篇 neural networks
342 篇 image edge detec...
333 篇 machine learning
332 篇 data models

机构

72 篇 chitkara univers...
35 篇 university of sc...
34 篇 school of comput...
34 篇 university of ch...
29 篇 school of comput...
26 篇 chitkara centre ...
26 篇 department of co...
25 篇 centre of resear...
24 篇 department of co...
23 篇 school of comput...
22 篇 shanghai jiao to...
21 篇 tsinghua univers...
21 篇 computer vision ...
21 篇 computer science...
20 篇 computer science...
20 篇 university of el...
20 篇 school of comput...
18 篇 school of comput...
18 篇 school of electr...
18 篇 computer science...

作者

16 篇 chen chen
14 篇 gill kanwarparta...
13 篇 liu jun
13 篇 yang yang
12 篇 chen li
12 篇 wang wei
11 篇 ahmad jalal
11 篇 jia zhenhong
11 篇 li xin
11 篇 li yang
11 篇 li chen
11 篇 deepak upadhyay
10 篇 sharma vikrant
10 篇 roy partha prati...
10 篇 satvik vats
10 篇 li xiaoli
10 篇 kukreja vinay
10 篇 vikrant sharma
9 篇 wei li
9 篇 zhou gang

语言

14,510 篇 英文
975 篇 其他
160 篇 中文
1 篇 土耳其文

检索条件"任意字段=2024 International Conference on Computer Vision and Image Processing, CVIP 2024"

共 15494 条记录，以下是51-60 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

A vision Transformer with Adaptive Cross-image and Cross-Resolution Attention 9th

A Vision Transformer with Adaptive Cross-Image and Cross-Res...

引用

27th international conference on Medical image Computing and computer Assisted Intervention (MICCAI)

作者： Murray, Benjamin A. K. Tan, Wei R. Canas, Liane S. Smith, Catherine H. Mahil, Satveer K. Ourselin, Sebastien Modat, Marc Kings Coll London Biomed Engn & Image Sci London SE1 7EH England Guys & St Thomas NHS Fdn Trust St Johns Inst Dermatol London England Kings Coll London London England

ISBN: (纸本)9783031776090;9783031776106

vision Transformers (ViTs) are the current state-of-the-art in deep learning for computer vision tasks. They are trained on vast datasets and are capable of useful downstream tasks through clever use of the attention mechanism. The biggest limiting factor for ViTs is the number of pixels and tokens that can be processed in a given pass. Memory constraints on both patch size and the number of patches mean that ViTs are most effective at processing relatively low-resolution images. Whilst ViTs can attend very flexibly across an image, attending across images in a naive fashion requires memory proportional to the square of the number of images. This is a further limiting factor. Given the task of automated assessment of psoriasis severity, a chronic skin condition that can affect large portions of a person's skin, it is necessary to look across multiple images and at fine detail in large images. We present a method that adapts ViTs to a two-stage design that allows for the regression of a patient's psoriasis score across multiple images and resolutions and shows its effectiveness relative to a baseline ViT. The implementation of our method is available at https://***/KCL-BMEIS/***.

关键词： vision Transformer Multi-image Multi-resolution Psoriasis

来源：评论

学校读者我要写书评

暂无评论

Real time processing and analysis of truck blind spot images based on computer vision

Real time processing and analysis of truck blind spot images...

引用

2024 international conference on Physics, Photonics, and Optical Engineering, ICPPOE 2024

作者： Dong, Jinsong Research Institute of Highway Ministry Transport Beijing100088 China

ISBN: (数字)9781510689121

ISBN: (纸本)9781510689114

Due to factors such as the large body volume, high position of the driver's cabin, and limited range of rearview mirror reflection, trucks have blind spots when making right turns and reversing. This prevents drivers from fully assessing the surrounding environment, leading to safety incidents. This paper proposes a computer vision-based method for detecting obstacles in truck blind spots. Deep learning-based detection techniques, particularly the YOLO series algorithms, are commonly used due to their simplicity and speed in real-time detection. However, inaccuracies in output bounding boxes, especially in detecting small-scale and overlapping objects, lead to low detection accuracy. This paper improves upon the YOLOv5s framework by incorporating attention mechanisms to enhance the feature extraction capability for small-scale and overlapping objects. Additionally, to prevent image feature loss and resolution reduction caused by pooling operations, multiple parallel dilated convolutions with different dilation rates are used instead of pooling layers, forming an atrous pyramid pooling structure. This structure is added after different hierarchical feature outputs and fused to increase the network's receptive field. © 2025 SPIE.

关键词： Trucks

来源：评论

学校读者我要写书评

暂无评论

The fusion strategy of multimodal learning in image and text recognition

The fusion strategy of multimodal learning in image and text...

引用

2024 international conference on Physics, Photonics, and Optical Engineering, ICPPOE 2024

作者： Cheng, Sihao Hou, Ruibo Li, Mingxi University of Illinois Urbana-Champaign ChampaignIL61820 United States

ISBN: (数字)9781510689121

ISBN: (纸本)9781510689114

Multimodal data such as text and image play an important role in various fields. Traditional machine learning methods often only deal with the data of a single modality, while ignoring the relevance between different modalities, which limits the ability to fully understand and analyze the data. Therefore, multimodal learning emerges as the times require, which aims to improve the effectiveness of data analysis and decision-making by fusing the information of different modal data. In this paper, the basic idea, fusion strategy and specific steps of multimodal learning are described in detail, including data fusion, feature extraction, model training and evaluation. Through a case study, it shows how to combine natural language processing and computer vision technology to solve the joint recognition task of image and text. The importance of selecting a suitable data fusion method is emphasized, and a method for evaluating the effect of multimodal data processing is proposed. © 2025 SPIE.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

Application of conditional DDPM on the MNIST dataset 5

Application of conditional DDPM on the MNIST dataset

引用

5th international conference on Signal processing and computer Science, SPCS 2024

作者： Wang, Xin Beijing University of Posts and Telecommunications Beijing China

ISBN: (数字)9781510686731

ISBN: (纸本)9781510686724

Since its introduction, Denoising Diffusion Probabilistic Models (DDPM) have received widespread attention for their exceptional performance in image generation. They generate new samples by simulating the denoising process of data, a method that is not only simple and efficient but also capable of producing highly realistic samples. This paper explores the application of Conditional Denoising Diffusion Probabilistic Models (Conditional DDPM) on the MNIST dataset. MNIST is a classic dataset containing handwritten digit images, widely used in computer vision and machine learning fields. The paper first introduces the basic principles and model structure of Conditional DDPM, then elaborately explains how to train and apply the Conditional DDPM on the MNIST dataset, and analyzes the experimental results. The experimental results show that the Conditional DDPM can generate high-quality handwritten digit images that meet specific conditions on the MNIST dataset. © 2025 SPIE.

关键词： image denoising

来源：评论

学校读者我要写书评

暂无评论

Fourth international conference on computer vision, Application, and Algorithm, CVAA 2024

Fourth International Conference on Computer Vision, Applicat...

引用

4th international conference on computer vision, Application, and Algorithm, CVAA 2024

ISBN: (纸本)9781510687615

The proceedings contain 122 papers. The topics discussed include: DFrFT-ES model for emotion recognition based on fractional Fourier transform of EEG signals;research on traffic sign recognition under complex meteorological conditions;diffusion-augmented learning for long-tail recognition;apple leaf scab recognition using CNN and transfer learning;container image management in cloud-edge environments: an image deletion method based on layer affinity;computer graphics and image processing techniques based on visual communication design;dynamic fusion and non-negative matrix factorization-based multi-view clustering method;convolutional recurrent neural network-based EEG signal classification in motor imagery;and sentiment classification of MOOC courses by merging local context focus and bi-directional gated recurrent unit.

关键词：

来源：评论

学校读者我要写书评

暂无评论

vision Transformer for Audio-Based Depression Detection on Multi-Lingual Audio Data 24

Vision Transformer for Audio-Based Depression Detection on M...

引用

7th international conference on Digital Medicine and image processing, DMIP 2024

作者： Pratiwi, Monica Sanjaya, Samuel Ady Department of Computer Engineering Faculty of Engineering and Informatics Universitas Multimedia Nusantara Tangerang Indonesia Department of Information System Faculty of Engineering and Informatics Universitas Multimedia Nusantara Tangerang Indonesia

ISBN: (纸本)9798400709586

Depression has the potential to impact death rates, particularly when it comes to death by suicide. Inadequate diagnosis may result in a delay or unsuitable therapy, which can worsen symptoms of depression. Unaddressed or insufficiently addressed depression can result in deteriorating mental well-being, which includes a higher risk of experiencing suicidal ideation and engaging in self-destructive actions. Voice analysis can be employed to distinguish between individuals with depression and those without depression. However, the research that has been conducted on voice recognition to distinguish between depressed and non-depressed individuals keeps focusing on the use of a single dataset source as input for the classification model that is being developed. This work utilizes the vision Transformer model and various pre-trained transformers, such as DeiT (Data-Efficient image Transformer) and Swin Transformer, to detect depression based on voice. The objective was to create a more generic model not restricted to a specific language. Integrating Mel-spectrogram characteristics with a vision transformer-based model can enhance the efficacy of voice recognition models when dealing with multi-language data. The result shows 21% higher accuracy than the previous study that also implemented a cross-dataset test. Copyright © 2024 held by the owner/author(s). Publication rights licensed to ACM.

关键词： Spectrographs

来源：评论

学校读者我要写书评

暂无评论

Blur Patch Classification Approach to Single-image Depth Estimation 17

Blur Patch Classification Approach to Single-Image Depth Est...

引用

17th international conference on Machine vision, ICMV 2024

作者： Kim, Huijun Lee, Deokwoo Keimyung University Dalgubeol-daero Dalseo-gu Daegu1095 Korea Republic of

ISBN: (纸本)9781510688278

Depth information is useful in many image processing and computer vision applications, but in photography, depth information is lost in the process of projecting a real-world scene onto a 2D plane. Extracting depth information from such images is a challenging task. In this paper, we propose a method to train a deep neural network to classify an image patch (16x16 in size) into 15 levels based on the level of blur. Blur is related to the distance between the focal plane and the object. The input image is shifted using a sliding window technique at 8 pixel intervals and the trained blur classifier evaluates each blur level. The obtained blur maps are subjected to a refinement process to quantitatively assess their accuracy and impact on the final result, and the final blur maps are compared with the labels of the actual input data to estimate the depth map. The proposed method demonstrates that depth information can be successfully extracted from a single image by classifying the focus levels. © 2025 SPIE.

关键词： image classification

来源：评论

学校读者我要写书评

暂无评论

Predicting relevant captions using image caption generator in social media platforms 4th

Predicting relevant captions using image caption generator i...

引用

4th international conference on Computational Methods in Science and Technology, ICCMST 2024

作者： Gupta, Aryan Bhadauria, Devansh Singh Atray, Manav Kaur, Inderjeet Department of Computer Science & Engineering Galgotias College of Engineering & Technology Uttar Pradesh Greater Noida India

ISBN: (纸本)9781032911571

Creating natural language descriptions or captions for images is a formidable task that requires a combination of computer vision techniques to understand image content and natural language processing models to express this understanding in coherent sentences. This paper delves into the development of a web-accessible image Caption Generator, aiming to improve user experience by automatically generating captions for uploaded images. The versatility of this generator extends its applicability to various uses, such as image indexing, assisting visually impaired individuals, and enhancing interactions on social media platforms. This paper results in caption generator without the need for complex data pre-processing or a specialized model pipeline. Notably, a singular end-to-end model seamlessly predicts captions, streamlining the entire process. The paper integrates social media platforms with the proposed image Caption Generator giving users automated and contextually relevant captions, fostering a more immersive and inclusive digital experience. © 2025 the Author(s).

关键词： Long short-term memory

来源：评论

学校读者我要写书评

暂无评论

Machine vision and image processing for intelligent online monitoring and control of pharmaceutical fluidized bed granulation processes 4

Machine vision and image processing for intelligent online m...

引用

4th international conference on computer vision, Application, and Algorithm, CVAA 2024

作者： Liu, Yan Wu, Tao Wang, Xue Zhong Beijing Key Laboratory of Enze Biomass Fine Chemicals College of New Materials and Chemical Engineering Beijing Institute of Petrochemical Technology Beijing102617 China

ISBN: (数字)9781510687622

ISBN: (纸本)9781510687615

Fluidized bed granulation is a unit operation widely used in the pharmaceutical, chemical and food processing industries. It is a manufacturing technology that by suspending lose powders using hot air and transforms the powders into granules of uniform sizes to improve compaction and flow characteristics. The granule size distribution and moisture content are important quality indicators that are currently characterized by sampling and offline analysis in the laboratory, leading to time delay in measurement. This work reports an investigation of machine vision combined with deep learning image segmentation for on-line real-time monitoring. A non-invasive microscopic imaging probe with an integrated light source is designed and mounted on the granulator’s sight glass to monitoring the granule dynamic changes in particle morphology and size. In addition, a near-infrared spectrometer combined with chemometric modeling is used for real-time monitoring of moisture content. © 2025 SPIE.

关键词： Near infrared spectroscopy

来源：评论

学校读者我要写书评

暂无评论

MamTrack: vision-Language Tracking with Mamba Fusion 24

MamTrack: Vision-Language Tracking with Mamba Fusion

引用

8th international conference on computer Science and Artificial Intelligence, CSAI 2024

作者： Chen, Donghua Zhang, Hong Song, Jianbo Feng, Yachun Yang, Yifan Image Processing Center Beihang University Beijing China School of Mechanical Engineering & Automation Beihang University Beijing China Institute of Artificial Intelligence Beihang University Beijing China

ISBN: (纸本)9798400718182

vision-language tracking models aim to improve target tracking performance by fusing visual features and language description of the target, making it more useful and robust for a wider range of applications. Transformer excel at learning global information but are hindered by their quadratic complexity. The recently proposed State Space Model (SSM), Mamba, offers a promising solution to this issue by enabling global awareness with linear complexity. We consider the vision-language tracking task as a sequence generation task of multimodal tokens and propose an efficient vision-language tracking method, MamTrack. We extend the Mamba block and design an adaptive fusion strategy for Mamba’s different scanning directions to reduce channel redundancy. We introduce the CLIP language model to update text descriptions in real-time, enhancing the robustness of language descriptions in the tracking task. We compare our method with other state-of-the-art approaches on the TNL2K, LaSOT, and OTB99-Lang datasets, where our method achieves leading performance. © 2024 Copyright held by the owner/author(s).

关键词： Visual languages

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 2 3 4 5 6 7 8 9 10 11 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：