检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

283 篇 会议
41 册 图书
3 篇 期刊文献

馆藏范围

327 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

171 篇 工学
- 152 篇 计算机科学与技术...
- 93 篇 软件工程
- 47 篇 光学工程
- 35 篇 信息与通信工程
- 28 篇 生物工程
- 22 篇 电气工程
- 16 篇 控制科学与工程
- 9 篇 电子科学与技术（可...
- 8 篇 化学工程与技术
- 8 篇 生物医学工程（可授...
- 6 篇 网络空间安全
- 5 篇 机械工程
- 5 篇 安全科学与工程
- 3 篇 仪器科学与技术
- 3 篇 材料科学与工程（可...
- 3 篇 建筑学
- 3 篇 农业工程
78 篇 理学
- 48 篇 物理学
- 30 篇 生物学
- 25 篇 数学
- 11 篇 统计学（可授理学、...
- 8 篇 化学
21 篇 医学
- 16 篇 临床医学
- 4 篇 公共卫生与预防医...
- 3 篇 基础医学(可授医学...
- 3 篇 特种医学
18 篇 管理学
- 10 篇 管理科学与工程(可...
- 9 篇 图书情报与档案管...
- 5 篇 工商管理
4 篇 农学
- 4 篇 作物学
2 篇 法学
2 篇 教育学

主题

37 篇 computer vision
25 篇 image processing...
21 篇 artificial intel...
20 篇 pattern recognit...
20 篇 computer imaging...
17 篇 machine learning
14 篇 computer applica...
13 篇 computer systems...
12 篇 signal, image an...
11 篇 deep learning
8 篇 image processing
6 篇 computers and ed...
5 篇 vision transform...
5 篇 low-level vision
4 篇 image enhancemen...
4 篇 object detection
4 篇 cell microscopy
4 篇 image segmentati...
4 篇 graphics process...
4 篇 stereo image pro...

机构

17 篇 microsoft resear...
17 篇 microsoft res as...
12 篇 tsinghua univers...
7 篇 university of sc...
7 篇 univ sci & techn...
6 篇 national key lab...
6 篇 shanghai collabo...
6 篇 shanghai key lab...
5 篇 university of te...
5 篇 dalian universit...
5 篇 university of sy...
5 篇 institute of aut...
5 篇 shenzhen univers...
4 篇 key laboratory o...
4 篇 national enginee...
4 篇 university of ud...
4 篇 tsinghua univ pe...
4 篇 microsoft cloud ...
4 篇 university of yo...
4 篇 vision and image...

作者

8 篇 han hu
7 篇 hu han
6 篇 zuxuan wu
6 篇 min xu
5 篇 hui huang
5 篇 jing dong
5 篇 yu-gang jiang
5 篇 jiwen lu
5 篇 risheng liu
5 篇 wanli ouyang
5 篇 huchuan lu
4 篇 gian luca forest...
4 篇 qi dai
4 篇 zhiguo cao
4 篇 dongdong chen
4 篇 edwin hancock
4 篇 boxin shi
4 篇 jean-jacques rou...
4 篇 zheng zhang
4 篇 andrea fusiello

语言

321 篇 英文
6 篇 其他
3 篇 中文

检索条件"任意字段=2023 Asia Conference on Computer Vision, Image Processing and Pattern Recognition, CVIPPR 2023"

共 327 条记录，以下是51-60 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Streaming Video Model

Streaming Video Model

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zhao, Yucheng Luo, Chong Tang, Chuanxin Chen, Dongdong Codella, Noel Zha, Zheng-Jun Univ Sci & Technol China Hefei Peoples R China Microsoft Res Asia Beijing Peoples R China Microsoft Cloud AI Redmond WA USA

ISBN: (纸本)9798350301298

Video understanding tasks have traditionally been modeled by two separate architectures, specially tailored for two distinct tasks. Sequence-based video tasks, such as action recognition, use a video backbone to directly extract spatiotemporal features, while frame-based video tasks, such as multiple object tracking (MOT), rely on single fixed-image backbone to extract spatial features. In contrast, we propose to unify video understanding tasks into one novel streaming video architecture, referred to as Streaming vision Transformer (S-ViT). S-ViT first produces frame-level features with a memory-enabled temporally-aware spatial encoder to serve the frame-based video tasks. Then the frame features are input into a task-related temporal decoder to obtain spatiotemporal features for sequence-based tasks. The efficiency and efficacy of S-ViT is demonstrated by the state-of-the-art accuracy in the sequence-based action recognition task and the competitive advantage over conventional architecture in the frame-based MOT task. We believe that the concept of streaming video model and the implementation of S-ViT are solid steps towards a unified deep learning architecture for video understanding. Code will be available at https://***/yuzhms/Streaming-Video-Model.

关键词： and tracking motion Video: Low-level analysis

来源：评论

学校读者我要写书评

暂无评论

ResFormer: Scaling ViTs with Multi-Resolution Training

ResFormer: Scaling ViTs with Multi-Resolution Training

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tian, Rui Wu, Zuxuan Dai, Qi Hu, Han Qiao, Yu Jiang, Yu-Gang Fudan Univ Sch CS Shanghai Key Lab Intell Info Proc Shanghai Peoples R China Shanghai Collaborat Innovat Ctr Intelligent Visua Shanghai Peoples R China Microsoft Res Asia Beijing Peoples R China Shanghai AI Lab Shanghai Peoples R China

ISBN: (纸本)9798350301298

vision Transformers (ViTs) have achieved overwhelming success, yet they suffer from vulnerable resolution scalability, i.e., the performance drops drastically when presented with input resolutions that are unseen during training. We introduce, ResFormer, a framework that is built upon the seminal idea of multi-resolution training for improved performance on a wide spectrum of, mostly unseen, testing resolutions. In particular, ResFormer operates on replicated images of different resolutions and enforces a scale consistency loss to engage interactive information across different scales. More importantly, to alternate among varying resolutions effectively, especially novel ones in testing, we propose a global-local positional embedding strategy that changes smoothly conditioned on input sizes. We conduct extensive experiments for image classification on imageNet. The results provide strong quantitative evidence that ResFormer has promising scaling abilities towards a wide range of resolutions. For instance, ResFormer-B-MR achieves a Top-1 accuracy of 75.86% and 81.72% when evaluated on relatively low and high resolutions respectively (i.e., 96 and 640), which are 48% and 7.49% better than DeiT-B. We also demonstrate, moreover, ResFormer is flexible and can be easily extended to semantic segmentation, object detection and video action recognition.

关键词： Efficient and scalable vision

来源：评论

学校读者我要写书评

暂无评论

SVFormer: Semi-supervised Video Transformer for Action recognition

SVFormer: Semi-supervised Video Transformer for Action Recog...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Xing, Zhen Dai, Qi Hu, Han Chen, Jingjing Wu, Zuxuan Jiang, Yu-Gang Fudan Univ Shanghai Key Lab Intell Info Proc Sch CS Shanghai Peoples R China Shanghai Collaborat Innovat Ctr Intelligent Visua Shanghai Peoples R China Microsoft Res Asia Beijing Peoples R China

ISBN: (纸本)9798350301298

Semi-supervised action recognition is a challenging but critical task due to the high cost of video annotations. Existing approaches mainly use convolutional neural networks, yet current revolutionary vision transformer models have been less explored. In this paper, we investigate the use of transformer models under the SSL setting for action recognition. To this end, we introduce SVFormer, which adopts a steady pseudo-labeling framework (i.e., EMA-Teacher) to cope with unlabeled video samples. While a wide range of data augmentations have been shown effective for semi-supervised image classification, they generally produce limited results for video recognition. We therefore introduce a novel augmentation strategy, Tube Token-Mix, tailored for video data where video clips are mixed via a mask with consistent masked tokens over the temporal axis. In addition, we propose a temporal warping augmentation to cover the complex temporal variation in videos, which stretches selected frames to various temporal durations in the clip. Extensive experiments on three datasets Kinetics-400, UCF-101, and HMDB-51 verify the advantage of SVFormer. In particular, SVFormer outperforms the state-of-the-art by 31.5% with fewer training epochs under the 1% labeling rate of Kinetics-400. Our method can hopefully serve as a strong benchmark and encourage future search on semi-supervised action recognition with Transformer networks. Code is released at https://***/ChenHsing/SVFormer.

关键词： Video: Action and event understanding

来源：评论

学校读者我要写书评

暂无评论

Research on Automatic Workpiece Sorting System Based on Machine vision 3

Research on Automatic Workpiece Sorting System Based on Mach...

引用

2023 IEEE 3rd International conference on Data Science and computer Application, ICDSCA 2023

作者： Ji, Linfeng Zhenjiang Vocational Technical College Zhenjiang China

ISBN: (纸本)9798350341546

This paper expounds the automatic recognition method of parts based on computer vision. The feature database of the processed parts is constructed by using machine learning method. image preprocessing, threshold segmentation, edge and contour extraction are used to analyze the features of the image of the inspected part to judge whether the inspected part is qualified or not. Aiming at the attitude error generated when the workpiece is placed, a method for positioning the center of gravity and rotation angle of the workpiece based on boundary detection and contour tracking is proposed. Use the similarity measure to determine the characteristics of the object to be processed. Simulation results show that the method proposed in this paper is effective. © 2023 IEEE.

关键词： image processing Machine vision pattern recognition workpiece

来源：评论

学校读者我要写书评

暂无评论

A review of binocular vision-based target detection techniques

A review of binocular vision-based target detection techniqu...

引用

2023 International conference on image, Signal processing, and pattern recognition, ISPP 2023

作者： Lu, Hao Dong, Yu School of Automation and Electrical Engineering Lanzhou Jiaotong University Lanzhou Gansu China

ISBN: (纸本)9781510666351

Binocular vision-based target detection is one of the hot topics in computer vision, where the technique aims to detect and localize target objects in images. The technology has applications in fields such as autonomous driving, video surveillance, and UAV flight control. In recent years, with the development of deep learning techniques, its speed, accuracy, and robustness have led to its widespread use in various research areas. This paper first lists the history of the development of target detection techniques, then introduces two target detection methods for binocular vision, and finally suggests possible improvements and development trends. Through summary and analysis, the aim is to provide a reference for work related to conducting binocular vision target detection. © 2023 SPIE.

关键词： Binocular vision

来源：评论

学校读者我要写书评

暂无评论

A Survey: The Sensor-Based Method for Sign Language recognition 6th

A Survey: The Sensor-Based Method for Sign Language Recognit...

引用

6th Chinese conference on pattern recognition and computer vision (PRCV)

作者： Yang, Tian Shen, Cong Wang, Xinyue Ma, Xiaoyu Ling, Chen Tianjin Univ Technol Sch Comp Sci & Engn Tianjin Peoples R China Minist Educ Engn Res Ctr Learning Based Intelligent Syst Tianjin Peoples R China Intel Mobileye R&D Ctr China Shanghai Peoples R China

ISBN: (纸本)9789819985364;9789819985371

Sign language is a crucial communication carrier among deaf people to express and exchange their thoughts and emotions. However, ordinary individuals cannot acquire proficiency in sign language in the short term, which leads to deaf people facing huge barriers with the sound community. Regarding this conundrum, it is valuable to investigate Sign Language recognition (SLR) equipped with sensors which collect data for the following computer vision processing. This study has reviewed the sensor-based SLR methods, which can transform heterogeneous signals from various underlying sensors into high-level motion representations. Specifically, we have summarized current developments in sensor-based SLR techniques from the perspective of modalities. Addtionally, we have also distilled the sensor-based SLR paradigm and compared the state-of-the-art works, including computer vision. Following that, we have concluded the research opportunities and future work expectations.

关键词： Sign Language recognition Sensor computer vision

来源：评论

学校读者我要写书评

暂无评论

Neuralizer: General Neuroimage Analysis without Re-Training

Neuralizer: General Neuroimage Analysis without Re-Training

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Czolbe, Steffen Dalca, Adrian V. Univ Copenhagen Copenhagen Denmark MGH Copenhagen Denmark MIT Cambridge MA USA Harvard Med Sch MGH Boston MA USA

ISBN: (纸本)9798350301298

Neuroimage processing tasks like segmentation, reconstruction, and registration are central to the study of neuroscience. Robust deep learning strategies and architectures used to solve these tasks are often similar. Yet, when presented with a new task or a dataset with different visual characteristics, practitioners most often need to train a new model, or fine-tune an existing one. This is a time-consuming process that poses a substantial barrier for the thousands of neuroscientists and clinical researchers who often lack the resources or machine-learning expertise to train deep learning models. In practice, this leads to a lack of adoption of deep learning, and neuroscience tools being dominated by classical frameworks. We introduce Neuralizer, a single model that generalizes to previously unseen neuroimaging tasks and modalities without the need for re-training or fine-tuning. Tasks do not have to be known a priori, and generalization happens in a single forward pass during inference. The model can solve processing tasks across multiple image modalities, acquisition methods, and datasets, and generalize to tasks and modalities it has not been trained on. Our experiments on coronal slices show that when few annotated subjects are available, our multi-task network outperforms task-specific baselines without training on the task.

关键词： cell microscopy Medical and biological vision

来源：评论

学校读者我要写书评

暂无评论

Research on anomaly detection technology of IoT mobile terminal based on watermark detection

Research on anomaly detection technology of IoT mobile termi...

引用

2023 International conference on computer Application and Information Security, ICCAIS 2023

作者： Dai, Dawei Liu, Rui Liu, Yong Tian, Zheng Chen, Zhongwei Sun, Yizhen State Grid Information & Communication Company of Hunan Electric Power Corporation Hunan Changsha410118 China Hunan Provincial Key Laboratory of Internet of Things in Electricity Hunan Changsha410114 China

ISBN: (纸本)9781510675032

The watermark detection method under Android system combines the technology of computer vision, image processing and pattern matching, aiming to provide an effective and automatic watermark detection solution. Through image conversion of input files and pre-processing operations such as filtering, contrast enhancement and brightness adjustment, the watermark features of files in different formats are enhanced. Feature extraction algorithm is adopted to extract features such as the number and hierarchical relationship of image contouring from pre-processed images, and image features are further processed by fitting algorithm. Finally, it is compared and matched with the predefined watermark pattern library. By combining the image recognition algorithm with Android components, the system realizes the watermark detection function in Android applications. © 2024 SPIE.

关键词： pattern matching

来源：评论

学校读者我要写书评

暂无评论

Selective Bokeh Effect Transformation

Selective Bokeh Effect Transformation

引用

2023 IEEE/CVF conference on computer vision and pattern recognition Workshops, CVPRW 2023

作者： Peng, Juewen Pan, Zhiyu Liu, Chengxin Luo, Xianrui Sun, Huiqiang Shen, Liao Xian, Ke Cao, Zhiguo Ministry of Education School of Artificial Intelligence and Automation Huazhong University of Science and Technology Key Laboratory of Image Processing and Intelligent Control China Nanyang Technological University S-Lab Singapore

ISBN: (纸本)9798350302493

Bokeh effect transformation is a novel task in computer vision and computational photography. It aims to convert bokeh effects from one camera lens to another. To this end, we introduce a new concept of blur ratio, which represents the ratio of the blur amount of a target image to that of a source image, and propose a novel framework SBTNet based on this concept. For cat-eye simulation and lens type transformation, a two-channel coordinate map and a two-channel one-hot map are added as extra inputs. The core of the framework is a sequence of parallel FeaNets, along with a feature selection and integration strategy, which aims to transform the blur amount with arbitrary blur ratio. The effectiveness of the proposed framework is demonstrated through extensive experiments, and our solution has achieved the top LPIPS metric in NTIRE 2023 Bokeh Effect Transformation Challenge. © 2023 IEEE.

关键词： Color photography

来源：评论

学校读者我要写书评

暂无评论

Desigen: A Pipeline for Controllable Design Template Generation

Desigen: A Pipeline for Controllable Design Template Generat...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Weng, Haohan Huang, Danqing Qiao, Yu Hu, Zheng Lin, Chin-Yew Zhang, Tong Chen, C. L. Philip South China Univ Technol Guangzhou Guangdong Peoples R China Microsoft Redmond WA USA Cent South Univ Changsha Hunan Peoples R China Microsoft Res Asia Beijing Peoples R China

ISBN: (纸本)9798350353006

Templates serve as a good starting point to implement a design (e.g., banner, slide) but it takes great effort from designers to manually create. In this paper, we present Desigen, an automatic template creation pipeline which generates background images as well as harmonious layout elements over the background. Different from natural images, a background image should preserve enough non-salient space for the overlaying layout elements. To equip existing advanced diffusion-based models with stronger spatial control, we propose two simple but effective techniques to constrain the saliency distribution and reduce the attention weight in desired regions during the background generation process. Then conditioned on the background, we synthesize the layout with a Transformer-based autoregressive generator. To achieve a more harmonious composition, we propose an iterative inference strategy to adjust the synthesized background and layout in multiple rounds. We constructed a design dataset with more than 40k advertisement banners to verify our approach. Extensive experiments demonstrate that the proposed pipeline generates high-quality templates comparable to human designers. More than a single-page design, we further show an application of presentation generation that outputs a set of theme-consistent slides. The data and code are available at https://***/desigen.

关键词： design generation diffusion model layout generation

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共33页 << < 2 3 4 5 6 7 8 9 10 11 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：