检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是821-830 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

CoCon: Cooperative-Contrastive Learning

CoCon: Cooperative-Contrastive Learning

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Rai, Nishant Adeli, Ehsan Lee, Kuan-Hui Gaidon, Adrien Niebles, Juan Carlos Stanford Univ Stanford CA 94305 USA Toyota Res Inst Toyota Japan

ISBN: (纸本)9781665448994

Labeling videos at scale is impractical. Consequently, self-supervised visual representation learning is key for efficient video analysis. Recent success in learning image representations suggest contrastive learning is a promising framework to tackle this challenge. However, when applied to real-world videos, contrastive learning may unknowingly lead to separation of instances that contain semantically similar events. In our work, we introduce a cooperative variant of contrastive learning to utilize complementary information across views and address this issue. We use data-driven sampling to leverage implicit relationships between multiple input video views, whether observed (e.g. RGB) or inferred (e.g. flow, segmentation masks, poses). We are one of the firsts to explore exploiting inter-instance relationships to drive learning. We experimentally evaluate our representations on the downstream task of action recognition. Our method achieves competitive performance on standard benchmarks (UCF101, HMDB51, Kinetics400). Furthermore, qualitative experiments illustrate that our models can capture higher-order class relationships. The code is available at http://***/nishantrai18/CoCon.

关键词： computer vision Visualization Semantics Performance gain pattern recognition Noise measurement Labeling

来源：评论

学校读者我要写书评

暂无评论

Generalized Foggy-Scene Semantic Segmentation by Frequency Decoupling

Generalized Foggy-Scene Semantic Segmentation by Frequency D...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Qi Bi Shaodi You Theo Gevers Computer Vision Research Group University of Amsterdam Amsterdam The Netherlands

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Foggy-scene semantic segmentation (FSSS) is highly challenging due to the diverse effects of fog on scene properties and the limited training data. Existing research has mainly focused on domain adaptation for FSSS, which has practical limitations when dealing with new scenes. In our paper, we introduce domain-generalized FSSS, which can work effectively on unknown distributions without extensive training. To address domain gaps, we propose a frequency decoupling (FreD) approach that separates fog-related effects (amplitude) from scene semantics (phase) in feature representations. Our method is compatible with both CNN and vision Transformer backbones and outperforms existing approaches in various scenarios.

关键词： Training computer vision Frequency-domain analysis Semantic segmentation conferences Semantics Training data

来源：评论

学校读者我要写书评

暂无评论

GOO: A Dataset for Gaze Object Prediction in Retail Environments

GOO: A Dataset for Gaze Object Prediction in Retail Environm...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tomas, Henri Reyes, Marcus Dionido, Raimarc Ty, Mark Mirando, Jonric Casimiro, Joel Atienza, Rowel Guinto, Richard Univ Philippines Quezon City Philippines Samsung R&D Inst Philippines Taguig Philippines

ISBN: (纸本)9781665448994

One of the most fundamental and information-laden actions humans do is to look at objects. However, a survey of current works reveals that existing gaze-related datasets annotate only the pixel being looked at, and not the boundaries of a specific object of interest. This lack of object annotation presents an opportunity for further advancing gaze estimation research. To this end, we present a challenging new task called gaze object prediction, where the goal is to predict a bounding box for a person's gazed-at object. To train and evaluate gaze networks on this task, we present the Gaze On Objects (GOO) dataset. GOO is composed of a large set of synthetic images (GOO-Synth) supplemented by a smaller subset of real images (GOO-Real) of people looking at objects in a retail environment. Our work establishes extensive baselines on GOO by re-implementing and evaluating selected state-of-the-art models on the task of gaze following and domain adaptation. Code is available(1) on github.

关键词： Training computer vision Adaptation models conferences Estimation computer architecture Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Multi-View Body Image-Based Prediction of Body Mass Index and Various Body Part Sizes

Multi-View Body Image-Based Prediction of Body Mass Index an...

引用

2023 ieee/CVF conference on computer vision and pattern recognition workshops, CVPRW 2023

作者： Kim, Seunghyun Lee, Kunyoung Lee, Eui Chul Sangmyung University Graduate School Department of AI & Informatics Korea Republic of Sangmyung University Graduate School Department of Computer Science Korea Republic of Sangmyung University Department of Human-Centered Artificial Intelligence Korea Republic of

ISBN: (纸本)9798350302493

This paper proposes a novel model for predicting body mass index and various body part sizes using front, side, and back body images. The model is trained on a large dataset of labeled images. The results show that the model can accurately predict body mass index and various body part sizes such as chest, waist, hip, thigh, forearm, and shoulder width. One significant advantage of the proposed model is that it can use multiple views of the body to achieve more accurate predictions, overcoming the limitations of models that only used a single image. The model also does not require complex pre-processing or feature extraction, making it straightforward to apply in practice. We also explore the impact of different environmental factors, such as clothing and posture, on the model's performance. The findings show that the model is relatively insensitive to posture but is more sensitive to clothing, emphasizing the importance of controlling for clothing when using this model. Overall, the proposed model represents a step forward in predicting body mass index and various body part sizes from images. The model's accuracy, convenience, and ability to use multiple views of the body make it a promising tool for a wide range of applications. The proposed method is expected to be utilized as a parameter for accurate sensing of various vision-based non-contact biomarkers, in addition to body mass index inference. © 2023 ieee.

关键词： Large dataset

来源：评论

学校读者我要写书评

暂无评论

Multi-Modal Temporal Convolutional Network for Anticipating Actions in Egocentric Videos

Multi-Modal Temporal Convolutional Network for Anticipating ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zatsarynna, Olga Abu Farha, Yazan Gall, Juergen Univ Bonn Bonn Germany

ISBN: (纸本)9781665448994

Anticipating human actions is an important task that needs to be addressed for the development of reliable intelligent agents, such as self-driving cars or robot assistants. While the ability to make future predictions with high accuracy is crucial for designing the anticipation approaches, the speed at which the inference is performed is not less important. Methods that are accurate but not sufficiently fast would introduce a high latency into the decision process. Thus, this will increase the reaction time of the underlying system. This poses a problem for domains such as autonomous driving, where the reaction time is crucial. In this work, we propose a simple and effective multi-modal architecture based on temporal convolutions. Our approach stacks a hierarchy of temporal convolutional layers and does not rely on recurrent layers to ensure a fast prediction. We further introduce a multi-modal fusion mechanism that captures the pairwise interactions between RGB, flow, and object modalities. Results on two large-scale datasets of egocentric videos, EPIC-Kitchens-55 and EPIC-Kitchens-100, show that our approach achieves comparable performance to the state-of-the-art approaches while being significantly faster.

关键词： computer vision conferences computer architecture pattern recognition Autonomous automobiles Reliability Intelligent agents

来源：评论

学校读者我要写书评

暂无评论

City Traffic Aware Multi-Target Tracking Prediction with Multi-Camera 31

City Traffic Aware Multi-Target Tracking Prediction with Mul...

引用

31st ieee International conference on Image Processing Challenges and workshops, ICIPCW 2024

作者： Peng, Kanglei Dong, Tuo Zhang, Wanqin Zhang, Jianhui Hangzhou Dianzi University College of Computer Science & Technology Hangzhou310018 China Comprehensive Command and Support Center of Grassroots Governance Hangzhou China

ISBN: (纸本)9798331515942

In recent years, Multi-Camera Multiple Object Tracking (MCMT) has gained significant attention as a crucial computer vision application. Research focuses on data association and track detection. However, accurately selecting datasets from raw vision data remains challenging due to real-world complexities like object types, varying speeds, and unknown directions. To address these problems, this paper proposes the Object Tracking Model (OTM) to capture the feature of target area with the Camera Monitoring Network (CMN) based on Graph Convolutional Network (GCN). Our method gives the way for many existing MCMT method to apply into real applications, especially the large scale CMN, which usually provides quite huge amount of raw data, and can reduce the time consumption on the object detection from the data set. Experimental results show that our method outperform existing MC-MOT algorithms by a large margin on CityFlowV2 datasets. © 2024 ieee.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

HINet: Half Instance Normalization Network for Image Restoration

HINet: Half Instance Normalization Network for Image Restora...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chen, Liangyu Lu, Xin Zhang, Jie Chu, Xiaojie Chen, Chengpeng MEGVII Technol Beijing Peoples R China Fudan Univ Shanghai Peoples R China Peking Univ Beijing Peoples R China

ISBN: (纸本)9781665448994

In this paper, we explore the role of Instance Normalization in low-level vision tasks. Specifically, we present a novel block: Half Instance Normalization Block (HIN Block), to boost the performance of image restoration networks. Based on HIN Block, we design a simple and powerful multi-stage network named HINet, which consists of two subnetworks. With the help of HIN Block, HINet surpasses the state-of-the-art (SOTA) on various image restoration tasks. For image denoising, we exceed it 0.11dB and 0.28 dB in PSNR on SIDD dataset, with only 7.5% and 30% of its multiplier-accumulator operations (MACs), 6.8 x and 2.9x speedup respectively. For image deblurring, we get comparable performance with 22.5% of its MACs and 3.3 x speedup on REDS and GoPro datasets. For image deraining, we exceed it by 0.3 dB in PSNR on the average result of multiple datasets with 1.4x speedup. With HINet, we won the 1st place on the NTIRE 2021 Image Deblurring Challenge - Track2. JPEG Artifacts, with a PSNR of 29.70.

关键词： Visualization computer vision conferences Transform coding Image restoration pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos

SoccerNet-v2: A Dataset and Benchmarks for Holistic Understa...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Deliege, Adrien Cioppa, Anthony Giancola, Silvio Seikavandi, Meisam J. Dueholm, Jacob, V Nasrollahi, Kamal Ghanem, Bernard Moeslund, Thomas B. Van Droogenbroeck, Marc Univ Liege Liege Belgium KAUST Thuwal Saudi Arabia Aalborg Univ Aalborg Denmark Milestone Syst Brondby Denmark

ISBN: (纸本)9781665448994

Understanding broadcast videos is a challenging task in computer vision, as it requires generic reasoning capabilities to appreciate the content offered by the video editing. In this work, we propose SoccerNet-v2, a novel large-scale corpus of manual annotations for the SoccerNet [24] video dataset, along with open challenges to encourage more research in soccer understanding and broadcast production. Specifically, we release around 300k annotations within SoccerNet's 500 untrimmed broadcast soccer videos. We extend current tasks in the realm of soccer to include action spotting, camera shot segmentation with boundary detection, and we define a novel replay grounding task. For each task, we provide and discuss benchmark results, reproducible with our open-source adapted implementations of the most relevant works in the field. SoccerNet-v2 is presented to the broader research community to help push computer vision closer to automatic solutions for more general video understanding and production purposes.

关键词： computer vision Annotations Grounding Production Benchmark testing Cameras pattern recognition

来源：评论

学校读者我要写书评

暂无评论

SrvfNet: A Generative Network for Unsupervised Multiple Diffeomorphic Functional Alignment

SrvfNet: A Generative Network for Unsupervised Multiple Diff...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Nunez, Elvis Lizarraga, Andrew Joshi, Shantanu H. Univ Calif Los Angeles Dept Elect & Comp Engn Los Angeles CA 90024 USA Univ Calif Los Angeles Dept Bioengn Los Angeles CA 90024 USA Univ Calif Los Angeles Dept Neurol Ahmanson Lovelace Brain Mapping Ctr Los Angeles CA 90024 USA

ISBN: (纸本)9781665448994

We present SrvfNet, a generative deep learning framework for the joint multiple alignment of large collections of functional data comprising square-root velocity functions (SRVF) to their templates. Our proposed framework is fully unsupervised and is capable of aligning to a predefined template as well as jointly predicting an optimal template from data while simultaneously achieving alignment. Our network is constructed as a generative encoder-decoder architecture comprising fully-connected layers capable of producing a distribution space of the warping functions. We demonstrate the strength of our framework by validating it on synthetic data as well as diffusion profiles from magnetic resonance imaging (MRI) data.

关键词： Deep learning computer vision Magnetic resonance imaging conferences Computational modeling Training data computer architecture

来源：评论

学校读者我要写书评

暂无评论

v2e: From Video Frames to Realistic DVS Events

v2e: From Video Frames to Realistic DVS Events

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hu, Yuhuang Liu, Shih-Chii Delbruck, Tobi Univ Zurich Inst Neuroinformat Zurich Switzerland Swiss Fed Inst Technol Zurich Switzerland

ISBN: (纸本)9781665448994

To help meet the increasing need for dynamic vision sensor (DVS) event camera data, this paper proposes the v2e toolbox that generates realistic synthetic DVS events from intensity frames. It also clarifies incorrect claims about DVS motion blur and latency characteristics in recent literature. Unlike other toolboxes, v2e includes pixel-level Gaussian event threshold mismatch, finite intensity-dependent bandwidth, and intensity-dependent noise. Realistic DVS events are useful in training networks for uncontrolled lighting conditions. The use of v2e synthetic events is demonstrated in two experiments. The first experiment is object recognition with N-Caltech 101 dataset. Results show that pretraining on various v2e lighting conditions improves generalization when transferred on real DVS data for a ResNet model. The second experiment shows that for night driving, a car detector trained with v2e events shows an average accuracy improvement of 40% compared to the YOLOv3 trained on intensity frames.

关键词： Training Visualization Lighting vision sensors Tools Cameras pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 79 80 81 82 83 84 85 86 87 88 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：