检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

50,499 篇 会议
1,420 册 图书
1,018 篇 期刊文献
1 篇 学位论文

馆藏范围

52,935 篇 电子文献
3 种 纸本馆藏

日期分布

学科分类号

31,784 篇 工学
- 24,773 篇 计算机科学与技术...
- 12,555 篇 软件工程
- 5,155 篇 光学工程
- 4,739 篇 电气工程
- 4,428 篇 信息与通信工程
- 4,255 篇 机械工程
- 3,950 篇 控制科学与工程
- 2,475 篇 生物工程
- 1,729 篇 生物医学工程（可授...
- 1,579 篇 仪器科学与技术
- 1,305 篇 电子科学与技术（可...
- 793 篇 化学工程与技术
- 697 篇 安全科学与工程
- 541 篇 交通运输工程
- 379 篇 建筑学
- 331 篇 土木工程
11,835 篇 理学
- 6,437 篇 物理学
- 5,401 篇 数学
- 2,762 篇 生物学
- 1,910 篇 统计学（可授理学、...
- 797 篇 化学
- 668 篇 系统科学
5,301 篇 医学
- 5,094 篇 临床医学
- 727 篇 基础医学(可授医学...
- 459 篇 药学(可授医学、理...
3,346 篇 管理学
- 1,951 篇 图书情报与档案管...
- 1,534 篇 管理科学与工程(可...
- 480 篇 工商管理
720 篇 艺术学
- 718 篇 设计学（可授艺术学...
428 篇 法学
- 401 篇 社会学
298 篇 农学
197 篇 教育学
163 篇 经济学
63 篇 文学
49 篇 军事学

主题

17,316 篇 computer vision
8,990 篇 pattern recognit...
4,198 篇 training
3,815 篇 feature extracti...
3,129 篇 cameras
2,870 篇 computational mo...
2,774 篇 image segmentati...
2,620 篇 visualization
2,551 篇 shape
2,538 篇 face recognition
2,166 篇 robustness
2,118 篇 computer science
1,969 篇 object detection
1,960 篇 computer archite...
1,859 篇 layout
1,841 篇 object recogniti...
1,802 篇 three-dimensiona...
1,726 篇 neural networks
1,704 篇 humans
1,686 篇 image recognitio...

机构

165 篇 univ chinese aca...
144 篇 tsinghua univers...
135 篇 national laborat...
107 篇 univ sci & techn...
104 篇 zhejiang univers...
100 篇 shanghai jiao to...
94 篇 university of sc...
94 篇 microsoft resear...
85 篇 zhejiang univ pe...
84 篇 shanghai ai lab ...
74 篇 school of comput...
69 篇 computer vision ...
68 篇 peking univ peop...
68 篇 chinese acad sci...
65 篇 chinese univ hon...
63 篇 institute of inf...
62 篇 google res mount...
61 篇 univ oxford oxfo...
59 篇 univ toronto on
57 篇 swiss fed inst t...

作者

91 篇 van gool luc
87 篇 umapada pal
76 篇 zhang lei
64 篇 lee seong-whan
50 篇 vittorio murino
42 篇 yang yi
34 篇 nassir navab
33 篇 li xin
33 篇 jie yang
32 篇 liu yang
31 篇 escalera sergio
31 篇 loy chen change
30 篇 ling haibin
30 篇 h. bischof
29 篇 zhou jie
29 篇 vasconcelos nuno
29 篇 jan-michael frah...
28 篇 blumenstein mich...
28 篇 hanqing lu
27 篇 jia yunde

语言

50,671 篇 英文
2,031 篇 其他
246 篇 中文
22 篇 土耳其文
4 篇 西班牙文
2 篇 日文
2 篇 葡萄牙文
2 篇 俄文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition"

共 52938 条记录，以下是4281-4290 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Multi-Digit Handwritten recognition: A CNN-LSTM Hybrid Approach with Wavelet Transforms 14

Multi-Digit Handwritten Recognition: A CNN-LSTM Hybrid Appro...

引用

14th International conference on computer and Knowledge Engineering, ICCKE 2024

作者： Kazempour, Amin Tanha, Jafar Department of Electrical and Computer Engineering University of Tabriz Tabriz Iran

ISBN: (纸本)9798331511272

Handwritten digit recognition remains a pivotal area in machine learning and computer vision, essential for applications like license plate identification, form processing, and historical document reading. Addressing the challenges of multi-digit and multi-language recognition, including variations in handwriting styles across different languages, we propose a novel model integrating convolutional and recurrent neural networks with an attention mechanism. Unlike conventional methods, our model employs wavelet transforms instead of max pooling to preserve image texture and edges. We created a comprehensive dataset containing both English and Persian digits, featuring 80,000 training and 20,000 test images with 1–5 digit numbers. To demonstrate the superiority of the proposed model, we conducted extensive experiments and compared it to some state-of-the-art models. Our model demonstrated remarkable accuracy, achieving 99.58% for single digits and 98.03% for sequences. Extensive experiments validated the efficacy of our approach, highlighting its potential for future research in multi-digit recognition systems across various languages. ©2024 ieee.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Single-Shot Freestyle Dance Reenactment

Single-Shot Freestyle Dance Reenactment

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Gafni, Oran Ashual, Oron Wolf, Lior Facebook AI Res Menlo Pk CA 94025 USA Tel Aviv Univ Tel Aviv Israel

ISBN: (纸本)9781665445092

The task of motion transfer between a source dancer and a target person is a special case of the pose transfer problem, in which the target person changes their pose in accordance with the motions of the dancer. In this work, we propose a novel method that can reanimate a single image by arbitrary video sequences, unseen during training. The method combines three networks: (i) a segmentation-mapping network, (ii) a realistic frame-rendering network, and (iii) a face refinement network. By separating this task into three stages, we are able to attain a novel sequence of realistic frames, capturing natural motion and appearance. Our method obtains significantly better visual quality than previous methods and is able to animate diverse body types and appearances, which are captured in challenging poses.

关键词： Training Visualization Image segmentation computer vision Motion segmentation Face recognition Biological system modeling

来源：评论

学校读者我要写书评

暂无评论

WiMix: A Lightweight Multimodal Human Activity recognition System based on WiFi and vision 20

WiMix: A Lightweight Multimodal Human Activity Recognition S...

引用

20th ieee International conference on Mobile Ad Hoc and Smart Systems, MASS 2023

作者： Chen, Jiajing Yang, Kun Zheng, Xiaolong Dong, Shengbo Liu, Liang Ma, Huadong Beijing University of Posts and Telecommunications China Beijing Institute of Remote Sensing Equipment China

ISBN: (纸本)9798350324334

Human activity recognition is important for a wide range of applications such as surveillance systems and human-computer interaction. computer vision based human activity recognition suffers from performance degradation in many real-world scenarios where the illumination is poor. On the other hand, recently proposed WiFi sensing that leverage ubiquitous WiFi signal for activity recognition is not affected by illumination but has low accuracy in dynamic environments. In this paper, we propose WiMix, a lightweight and robust multimodal system that leverages both WiFi and vision for human activity recognition. To deal with complex real-world environments, we design a lightweight mix cross attention module for automatic WiFi and video weight distribution. To reduce the system response time while ensuring the sensing accuracy, we design an end-to-end framework together with an efficient classifier to extract spatial and temporal features of two modalities. Extensive experiments are conducted in the real-world scenarios and the results demonstrate that WiMix achieves 98.5% activity recognition accuracy in 3 scenarios, which outperforms the state-of-the-art 89.6% sensing accuracy using WiFi and video modalities. WiMix can also reduce the inference latency from 1268.25ms to 217.36ms, significantly improving the response time. © 2023 ieee.

关键词： Wireless local area networks (WLAN)

来源：评论

学校读者我要写书评

暂无评论

CE-PeopleSeg: Real-time people segmentation with 10% CPU usage for video conference

CE-PeopleSeg: Real-time people segmentation with 10% CPU usa...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Jiang, Ziyu He, Zhenhua Huang, Xueqin Yang, Zibin Tan, Pearl Texas A&M Univ College Stn TX 77843 USA EEO Washington DC USA

ISBN: (纸本)9781665448994

Nowadays, video conference solutions are widely adopted for companies, education, and government. People segmentation is crucial for supporting virtual background, an essential video conference function to protect users' privacy. This paper demonstrated a people segmentation framework called CE-PeopleSeg, which employed an efficient segmentation method, structural pruning, and dynamic frame skipping techniques, leading to a fast inference speed on CPU. Our extensive experiments show that the proposed CE-PeopleSeg can achieve a high prediction mIoU of 87.9% on Supervised People Dataset while reaching a real-time inference speed of 32.40 fps on CPU with very low usage of 10%. Our code would be released at https://***/geekJZY/***.

关键词： Visualization Privacy Government Education Graphics processing units Switches Streaming media

来源：评论

学校读者我要写书评

暂无评论

Interactive Sign Language Learning System Using computer vision and Deep Learning 3

Interactive Sign Language Learning System Using Computer Vis...

引用

International conference on Advances in Computing, Communication and Applied Informatics (ACCAI)

作者： Murugan, Suganiya Ali, Mir Kasif Singari, Dhanvanth Kumar, S. Pradeep SRM Inst Sci & Technol Dept Comp Technol Chennai Tamil Nadu India Vels Inst Sci Technol & Adv Studies Elect & Elect Engn Chennai Tamil Nadu India

ISBN: (纸本)9798350389432;9798350389449;9798350389456

In an era where communication is key, the gap in accessible tools for those with hearing impairments or speech disabilities is significant. These individuals often face obstacles in education and social interaction due to a heavy reliance on spoken language and a lack of sign language resources. The Interactive Sign Language Learning System (ISLLS) addresses this gap by providing an innovative platform for learning sign language, enhanced with voice output to assist individuals with speech disabilities. This feature allows for auditory feedback alongside visual sign learning, enriching the educational experience. The ISLLS employs advanced technologies like computer vision and deep learning to facilitate sign recognition and text-to-sign conversion. With the new voice output, it further aids those with speech impairments, expanding its inclusivity. This system offers a comprehensive learning tool that caters to a diverse user base, enabling people with speech difficulties to engage more fully with the world. The ISLLS is a significant step towards a more inclusive society, offering a user-friendly platform that not only improves the learning of sign language but also empowers people with speech disabilities to connect and thrive, representing progress in both technology and social inclusivity.

关键词： Sign Language CNN Algorithm NLP Techniques Gesture recognition Pose Estimation YoloV7 Algorithm

来源：评论

学校读者我要写书评

暂无评论

A Review on Multimodal Fusion Method for Gesture recognition 14

A Review on Multimodal Fusion Method for Gesture Recognition

引用

14th International conference on Ubiquitous and Future Networks, ICUFN 2023

作者： Lee, Dong Jae Choi, Sunwoong Kookmin University Department of Electronics Engineering Seoul Korea Republic of

ISBN: (纸本)9798350335385

Recent research using deep learning has been actively conducted in various fields, including computer vision, reinforcement learning, classifiers, and more. AlphaGo, which learned to play Go and beat professional players, was developed based on reinforcement learning research. This paper focuses on computer vision in particular, which also has multiple subfields such as image restoration and image compression. This paper examines the use of deep learning with video data in computer vision. Video data can be divided into RGB and Depth, and the fusion of these two types of data will be used, referred to as multimodal fusion. By reviewing several papers, this method will be applied to gesture recognition research for potential improvements. © 2023 ieee.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Comparative Analysis of Deep Learning Architectures for Multilingual Digit recognition 2

Comparative Analysis of Deep Learning Architectures for Mult...

引用

2nd ieee International conference on Data Science and Network Security, ICDSNS 2024

作者： Mamatha Bai, B.G. Sarangi, Akruti Kashyap, Akanksha Yadav, Himanshi Nitte Meenakshi Institute of Technology Department of Computer Science & Engineering Bangalore India

ISBN: (纸本)9798350373110

Digit recognition is foundational in pattern recog-nition and machine learning, with applications in document processing and optical character recognition. Current research often targets English digits, overlooking language-independent systems. This paper bridges this gap by analyzing deep-learning architectures and activation functions for Hindi, English, and Kannada digits. We evaluate CNN, LeNet, ResNet, MobileNet, and Inception (GoogleNet) models, and activation functions like sigmoid, tanh, linear, ELU, Leaky ReLU, and ReLU. The results indicate ResNet achieves 99.498% accuracy in English, 99.43% in Hindi, and 97.67% in Kannada datasets. This study enhances the design and optimization of robust digit recognition systems across diverse languages and scripts, promoting efficient models for practical applications. Despite advancements with CNNs, research predominantly focuses on English digits, highlighting the need for lanauage-independent recognition systems. © 2024 ieee.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Multi-Label Activity recognition using Activity-specific Features and Activity Correlations

Multi-Label Activity Recognition using Activity-specific Fea...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zhang, Yanyi Li, Xinyu Marsic, Ivan Rutgers Univ New Brunswick Elect & Comp Engn Dept New Brunswick NJ 08901 USA Amazon Web Serv Seattle WA USA

ISBN: (纸本)9781665445092

Multi-label activity recognition is designed for recognizing multiple activities that are performed simultaneously or sequentially in each video. Most recent activity recognition networks focus on single-activities, that assume only one activity in each video. These networks extract shared features for all the activities, which are not designed for multi-label activities. We introduce an approach to multi-label activity recognition that extracts independent feature descriptors for each activity and learns activity correlations. This structure can be trained end-to-end and plugged into any existing network structures for video classification. Our method outperformed state-of-the-art approaches on four multi-label activity recognition datasets. To better understand the activity-specific features that the system generated, we visualized these activity-specific features in the Charades dataset. The code will be released later.

关键词： Visualization computer vision Correlation Codes Activity recognition Feature extraction ieee activities

来源：评论

学校读者我要写书评

暂无评论

Topological Planning with Transformers for vision-and-Language Navigation

Topological Planning with Transformers for Vision-and-Langua...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chen, Kevin Chen, Junshen K. Chuang, Jo Vazquez, Marynel Savarese, Silvio Stanford Univ Stanford CA 94305 USA Yale Univ New Haven CT 06520 USA

ISBN: (纸本)9781665445092

Conventional approaches to vision-and-language navigation (VLN) are trained end-to-end but struggle to perform well in freely traversable environments. Inspired by the robotics community, we propose a modular approach to VLN using topological maps. Given a natural language instruction and topological map, our approach leverages attention mechanisms to predict a navigation plan in the map. The plan is then executed with low-level actions (e.g. FORWARD, ROTATE) using a robust controller. Experiments show that our method outperforms previous end-to-end approaches, generates interpretable navigation plans, and exhibits intelligent behaviors such as backtracking.

关键词： computer vision Backtracking Navigation Natural languages Buildings Transformers Planning

来源：评论

学校读者我要写书评

暂无评论

Unlocking a New Era in Human-computer Interaction with Transformative AI Virtual Mice for Unparalleled Accessibility, Hygiene, and User Experience

Unlocking a New Era in Human-Computer Interaction with Trans...

引用

2024 ieee International conference on Recent Advances in Science and Engineering Technology, ICRASET 2024

作者： Prabakaran, Manjula Kusuma, S. Aman, S. Dinesh, P. Faliha, S. Andhra Pradesh Madanapalle India

ISBN: (纸本)9798350388602

The ubiquitous physical mouse, despite its years of service, presents challenges in accessibility, hygiene, and user experience. This paper explores the transformative potential of AI Virtual Mouse (AI VM) systems, powered by deep learning, to revolutionize human-computer interaction (HCI). We are involved in the intricate interplay between computer vision and deep learning architectures, dissecting various approaches to hand segmentation, gesture recognition, and model training. Through rigorous evaluation based on accuracy, speed, and robustness, we analyze the strengths and weaknesses of different configurations, paving the way for optimal performance. This example of the advancement of touchless HCI interfaces also opens doors to exciting future directions, including personalized control, integration with AR/VR technologies, and broader applications across diverse domains. For decades, the humble physical mouse has reigned supreme as our gateway to the digital world. Yet, despite its familiarity, it comes with limitations - accessibility hurdles, hygiene concerns, and a user experience that has not fundamentally evolved. This paper heralds a transformative era, where AI Virtual Mouse (AI VM) systems, powered by the magic of deep learning, are poised to revolutionize human-computer interaction (HCI). © 2024 ieee.

关键词： Gesture recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 425 426 427 428 429 430 431 432 433 434 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：