检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

17,702 篇 会议
260 册 图书
190 篇 期刊文献
1 篇 学位论文

馆藏范围

18,152 篇 电子文献
2 种 纸本馆藏

日期分布

学科分类号

10,553 篇 工学
- 6,243 篇 计算机科学与技术...
- 4,016 篇 电气工程
- 3,837 篇 控制科学与工程
- 2,913 篇 软件工程
- 1,929 篇 信息与通信工程
- 1,556 篇 光学工程
- 1,409 篇 机械工程
- 1,000 篇 仪器科学与技术
- 583 篇 电子科学与技术（可...
- 550 篇 生物医学工程（可授...
- 434 篇 生物工程
- 234 篇 材料科学与工程（可...
- 199 篇 交通运输工程
- 166 篇 安全科学与工程
- 155 篇 化学工程与技术
- 140 篇 力学（可授工学、理...
- 117 篇 建筑学
- 112 篇 土木工程
- 106 篇 航空宇航科学与技...
3,405 篇 理学
- 2,549 篇 物理学
- 806 篇 数学
- 487 篇 生物学
- 295 篇 系统科学
- 210 篇 统计学（可授理学、...
- 136 篇 化学
1,654 篇 医学
- 1,577 篇 临床医学
- 185 篇 基础医学(可授医学...
765 篇 管理学
- 585 篇 管理科学与工程(可...
- 191 篇 图书情报与档案管...
- 121 篇 工商管理
107 篇 农学
79 篇 法学
44 篇 经济学
44 篇 教育学
39 篇 艺术学
37 篇 军事学
18 篇 文学

主题

2,739 篇 computer vision
1,686 篇 cameras
1,490 篇 signal processin...
1,444 篇 robot vision sys...
1,357 篇 image processing
1,176 篇 robot sensing sy...
912 篇 signal processin...
876 篇 mobile robots
840 篇 feature extracti...
769 篇 machine vision
548 篇 image segmentati...
505 篇 object detection
444 篇 visualization
426 篇 deep learning
409 篇 robustness
393 篇 estimation
367 篇 stereo vision
358 篇 navigation
343 篇 training
321 篇 robot kinematics

机构

83 篇 centre for visio...
63 篇 xi an jiao tong ...
54 篇 centre for visio...
37 篇 school of electr...
36 篇 centre for visio...
29 篇 carnegie mellon ...
28 篇 chinese acad sci...
27 篇 shanghai jiao to...
27 篇 center for machi...
27 篇 university of ch...
23 篇 centre for visio...
23 篇 harbin inst tech...
21 篇 univ chinese aca...
21 篇 nanyang technol ...
17 篇 centre for visio...
16 篇 university of sc...
16 篇 tsinghua univers...
13 篇 chinese acad sci...
13 篇 univ sci & techn...
13 篇 chinese univ hon...

作者

52 篇 j. kittler
40 篇 josef kittler
28 篇 nakadai kazuhiro
19 篇 anil fernando
18 篇 wang wei
15 篇 chen chen
14 篇 yang yang
13 篇 jing zhang
13 篇 liu yang
13 篇 sun fuchun
13 篇 nascimento jacin...
12 篇 sun lining
12 篇 hansung kim
11 篇 zhang lei
11 篇 bartolozzi chiar...
11 篇 hong liu
10 篇 wang lei
10 篇 li yang
10 篇 aguiar pedro m. ...
10 篇 qiuqiang kong

语言

17,895 篇 英文
158 篇 其他
88 篇 中文
12 篇 土耳其文
3 篇 俄文
2 篇 西班牙文

检索条件"任意字段=International Conference on Robot Vision and Signal Processing"

共 18153 条记录，以下是861-870 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Research on Target Detection Algorithm Based on Vehicle Detection

Research on Target Detection Algorithm Based on Vehicle Dete...

引用

2023 international conference on Algorithm, Imaging processing, and Machine vision, AIPMV 2023

作者： Huang, Yanguo Rao, Zehao Li, Luo College of Electrical and Automation Jiangxi University of Science and Technology GanZhou34100 China

ISBN: (纸本)9781510672444

Aiming at the current problem of unsatisfactory vehicle detection in complex scenes, an improved vehicle target detection network model is proposed. First, Res2Net residual network is fused in SCP, and the CSP_R structure is proposed, so that the model can extract deeper feature information and strengthen the ability to characterize small-scale targets;the attention mechanism is introduced, and the C3_CBAM module is designed to strengthen the attention to the detection targets while avoiding the increase of the model's computational volume;the loss function of the MPDIoU regression optimization is introduced, and the loss function is optimized by combining the prediction frame with the real frame length, width and area loss, and quantitative indicators to improve the convergence speed and robustness of the model. Finally, the model is validated on the SODA10M dataset, and the experimental results show that the model detection speed reaches 32 frames per second. The average detection accuracy reaches 83.7%, which is an improvement of 7.8 percentage points compared with YOLOV5s. © 2024 SPIE.

关键词： signal detection

来源：评论

学校读者我要写书评

暂无评论

Research on multimodal human-computer interaction technology based on audiovisual fusion 7

Research on multimodal human-computer interaction technology...

引用

7th international conference on Intelligent Computing and signal processing, ICSP 2022

作者： Jiao, Zhinan School of Art and Design Hubei University of Technology China

ISBN: (纸本)9781665478571

With the improvement of information technology, service robots are becoming more and more deeply involved in our work and life, and provide a wider variety of services. How to make robots communicate with humans more intelligently and understand and accomplish people's task needs has become a current research hotspot. In this paper, we propose a multimodal human-robot interaction technique with D-S evidence theory for fusion of audiovisual information, and a multimodal interaction system with speech and gesture interaction to improve the performance of human-robot interaction. © 2022 IEEE.

关键词： Human robot interaction

来源：评论

学校读者我要写书评

暂无评论

Exploring the Role of CLIP Global Visual Features in Multimodal Large Language Models

Exploring the Role of CLIP Global Visual Features in Multimo...

引用

2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025

作者： Bai, Zixing Bai, Yuting School of Computer Science and Technology Fudan University Shanghai China

ISBN: (纸本)9798350368741

The next recognized development direction of large language models (LLMs) is to integrate and enhance multimodal capability. Although current multimodal large language models (MLLMs) have achieved impressive performance by combining the pre-trained visual encoder CLIP and LLM, these works mainly focus on using the CLIP patch visual features. In practice, we find that CLIP global visual features are more efficient than patch visual features in some scenarios, especially in multimodal reasoning tasks. Therefore, we explore the role of CLIP global visual features and propose a new MLLM with the usage of full global visual features in this paper. Our model adopts the parameter-efficient transfer learning (PETL) method Adapter to fine-tune the pre-trained models and a simple MLP-based network to connect the visual encoder and LLM. To validate the performance, we evaluate our model on the first large-scale multimodal science question dataset, ScienceQA. Our model achieves a new state-of-the-art (SoTA) result of 93.96% on ScienceQA, which is higher than the previous SoTA result of 92.53%. © 2025 IEEE.

关键词： Language Multimodal Large Language Models vision

来源：评论

学校读者我要写书评

暂无评论

2023 3rd international conference on robotics, Electrical and signal processing Techniques, ICREST 2023

2023 3rd International Conference on Robotics, Electrical an...

引用

3rd international conference on robotics, Electrical and signal processing Techniques, ICREST 2023

ISBN: (纸本)9798350346435

The proceedings contain 69 papers. The topics discussed include: predicting mushroom edibility with effective classification and efficient feature selection techniques;performance enhancement of conventional design of 4-bit carry look-ahead adder;two-bit magnitude comparator design using gate diffusion input technique and static CMOS logic;a comprehensive study of camouflaged object detection using deep learning;fuzzy logic-based design optimization and economic planning of a microgrid for a residential community in Bangladesh;performance analysis of the AVR using an artificial neural network and genetic algorithm optimization technique;electrical impedance measurement technique to determine the impedance of a volume conductor with embedded object;design and implementation of embedded sensor network for an automated radio telescope;and development of a facial recognition pantograph drawing robot.

关键词：

来源：评论

学校读者我要写书评

暂无评论

When Does Visual Prompting Outperform Linear Probing for vision-Language Models? A Likelihood Perspective

When Does Visual Prompting Outperform Linear Probing for Vis...

引用

2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025

作者： Tsao, Hsi-Ai Hsiung, Lei Chen, Pin-Yu Ho, Tsung-Yi National Tsing Hua University Hsinchu Taiwan Dartmouth College HanoverNH United States IBM Research Yorktown HeightsNY United States The Chinese University of Hong Kong Shatin Hong Kong

ISBN: (纸本)9798350368741

Adapting pre-trained models to new tasks can exhibit varying effectiveness across datasets. Visual prompting, a state-of-the-art parameter-efficient transfer learning method, can significantly improve the performance of out-of-distribution tasks. On the other hand, linear probing, a standard transfer learning method, can sometimes become the best approach. We propose a log-likelihood ratio (LLR) approach to analyze the comparative benefits of visual prompting and linear probing. By employing the LLR score alongside resource-efficient visual prompts approximations, our cost-effective measure attains up to a 100-fold reduction in run time compared to full training, while achieving prediction accuracies up to 91%. The source code is available at VP-LLR. © 2025 IEEE.

关键词： transfer learning visual prompting

来源：评论

学校读者我要写书评

暂无评论

Leveraging Multimodal Methods and Spontaneous Speech for Alzheimer's Disease Identification

Leveraging Multimodal Methods and Spontaneous Speech for Alz...

引用

2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025

作者： Gao, Yifan Guo, Long Liu, Hong College of Computer Science National Key Laboratory of Fundamental Science on Synthetic Vision Sichuan University Chengdu China

ISBN: (纸本)9798350368741

Cognitive impairment detection through spontaneous speech is a promising avenue for early diagnosis of Alzheimer's disease (AD) and mild cognitive impairment (MCI), where timely intervention can significantly improve patient outcomes. The PROCESS Grand Challenge at ICASSP 2025 addresses these tasks by promoting innovative classification and regression methods for detecting cognitive decline. In this paper, we propose a multimodal fusion strategy that combines interpretable linguistic features with temporal embeddings extracted from pre-trained models. Our approach achieves an F1-score of 0.649 for the classification task (predicting healthy, MCI, dementia) and an RMSE of 2.628 for the regression task (MMSE score prediction), securing the top overall ranking in the competition. © 2025 IEEE.

关键词： Interpretable Features Multimodal Temporal Features

来源：评论

学校读者我要写书评

暂无评论

Automatic Fruit Detection Based on robotic vision 5

Automatic Fruit Detection Based on Robotic Vision

引用

5th international conference on Big Data and Artificial Intelligence and Software Engineering, ICBASE 2024

作者： Chen, Yiran Yao, Rui Liu, Haoran Liu, Bingqi Liu, Mingzhe Wenzhou University of Technology School of Data Science and Artificial Intelligence Wenzhou China Chengdu University of Technology College of Nuclear Technology and Automation Engineering Chengdu China Norla Institute of Technical Physics Chengdu China

ISBN: (纸本)9798331506612

Automatic fruit detection has greatly reduced labor costs and crop damage rates, contributing to the progress of agricultural modernization. It involves real-time assessment of the surrounding environment and recognition of target fruit categories. However, fruit detection faces challenges due to the similarities in shape, color, and other features among different types of fruits, as well as the impact of environmental factors such as inadequate lighting, occlusion, and background variations on recognition accuracy. This project focuses on the simulation of the robotic fruit harvesting process, which includes environmental image acquisition, image processing, and robot control. The robot utilizes its integrated camera to capture real-time images of the surrounding environment. These images are then processed on a computer, employing color recognition based on traditional algorithms. The images are segmented based on the identified color regions and subjected to object detection using YOLO v5. By discerning the presence of the target objects, the results are transmitted to the robot, which responds accordingly. The real-time performance is enhanced by filtering and narrowing down the target regions, reducing the time required for object recognition. © 2024 IEEE.

关键词： Fruits

来源：评论

学校读者我要写书评

暂无评论

FACE PHOTO-SKETCH SYNTHESIS VIA DOMAIN-INVARIANT FEATURE EMBEDDING 30

FACE PHOTO-SKETCH SYNTHESIS VIA DOMAIN-INVARIANT FEATURE EMB...

引用

30th IEEE international conference on Image processing (ICIP)

作者： Choi, Yeji Sohn, Kwanghoon Kim, Ig-Jae Yonsei Univ Sch Elect & Elect Engn Seoul South Korea Korea Inst Sci & Technol AI & Robot Inst Seoul South Korea Yonsei Univ Yonsei KIST Convergence Res Inst Seoul South Korea

ISBN: (纸本)9781728198354

Face photo-sketch synthesis involves transforming photos into sketches and vice versa. A well-transformed image should preserve its original identity characteristics and naturalness. However, identity preservation remains a challenge because of the large discrepancy between the photo and sketch domains. To this end, we propose a novel face photo-sketch synthesis framework that uses domain-invariant feature embedding (DIFE). The DIFE framework generates images assuming the domain-invariant feature of an image pair for the same person to be the identity information. A joint feature embedding module considers latent features from two different domains as input and transfers them into the domain-invariant latent space. Subsequently, a semantic-aware decoder completes the desired image guided by multiscale facial parsing masks. Experimental results demonstrate that the DIFE method outperforms state-of-the-art approaches visually and perceptually.

关键词： face photo-sketch synthesis face recognition domain-invariant feature identity preservation

来源：评论

学校读者我要写书评

暂无评论

AI-based Algorithm for GNSS Spoofing Detection 14

AI-based Algorithm for GNSS Spoofing Detection

引用

14th international conference on Information and Communication Technology Convergence, ICTC 2023

作者： Bong, Jae Hwan Kim, Doyoung Jeong, Seongkyun Sangmyung University Department of Human Intelligence Robot Engineering Cheonan Korea Republic of

ISBN: (纸本)9798350313277

GNSS is extensively employed for applications requiring high reliability. However, GNSS inherently encompasses varying error factors and is susceptible to malicious attacks such as spoofing. To ensure stable GNSS utilization, it is imperative to incorporate GNSS spoofing signal detection mechanisms into GNSS signal processing. In this paper, artificial intelligence (AI) to detect the spoofing in GNSS signal is proposed. The developed AI model effectively detects the spoofing within GNSS signals while the satellite navigation solutions changing over time. The AI was trained using simulated GNSS data, confirming the feasibility of employing AI techniques for GNSS signal processing. © 2023 IEEE.

关键词： Anomaly Detection Artificial Intelligence GNSS

来源：评论

学校读者我要写书评

暂无评论

C3D-VIT: CONSISTENCY-AWARE 3D vision TRANSFORMER FOR FACE FORGERY DETECTION

C3D-VIT: CONSISTENCY-AWARE 3D VISION TRANSFORMER FOR FACE FO...

引用

2025 IEEE international conference on Acoustics, Speech, and signal processing, ICASSP 2025

作者： Zhang, Jingyi Zhang, Peng Wang, Jingjing Hikvision Research Institute

ISBN: (纸本)9798350368741

Recently, spatial and temporal inconsistencies have been shown to effectively enhance the generalization performance of face forgery detection, as common forgery strategies create inconsistencies among face regions and across frames. However, current methods often focus on either spatial or temporal modeling and involve complex modules that are difficult to integrate into other frameworks. Some approaches require extra training data or forgery masks to learn consistency features, limiting their applicability and performance. Additionally, most current spatio-temporal methods rely on CNNs, with few designed for transformers, making it challenging to adapt them for transformer use. To address these issues, we propose an efficient consistency modeling block that unifies spatial and temporal consistency modeling within a transformer framework. Specifically, we calculate the reconstruction error between features and predictions via neighboring spatial and temporal dimensions to explicitly model consistency without extra training data or forgery masks. This block can be seamlessly integrated into a transformer framework, adding less than 1% to computing costs. © 2025 IEEE.

关键词： consistency Face forgery detection

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 83 84 85 86 87 88 89 90 91 92 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：