检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是1261-1270 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

A Lightweight Spatiotemporal Network for Online Eye Tracking with Event Camera

A Lightweight Spatiotemporal Network for Online Eye Tracking...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Yan Ru Pei Sasskia Brüers Sébastien Crouzet Douglas McLelland Olivier Coenen Brainchip Inc. Laguna Hills CA

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Event-based data are commonly encountered in edge computing environments where efficiency and low latency are critical. To interface with such data and leverage their rich temporal features, we propose a causal spatiotemporal convolutional network. This solution targets efficient implementation on edge-appropriate hardware with limited resources in three ways: 1) deliberately targets a simple architecture and set of operations (convolutions, ReLU activations) 2) can be configured to perform online inference efficiently via buffering of layer outputs 3) can achieve more than 90% activation sparsity through regularization during training, enabling very significant efficiency gains on event-based processors. In addition, we propose a general affine augmentation strategy acting directly on the events, which alleviates the problem of dataset scarcity for event-based systems. We apply our model on the AIS 2024 event-based eye tracking challenge, reaching a score of 0.9916 p10 accuracy on the Kaggle private testset.

关键词： Training Program processors Gaze tracking Streaming media Spatiotemporal phenomena pattern recognition Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Focusing on What Matters: Fine-grained Medical Activity recognition for Trauma Resuscitation via Actor Tracking

Focusing on What Matters: Fine-grained Medical Activity Reco...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Wenjin Zhang Keyi Li Sen Yang Sifan Yuan Ivan Marsic Genevieve J. Sippel Mary S. Kim Randall S. Burd Rutgers University Waymo Children’s National Hospital

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Trauma is a leading cause of mortality worldwide, with about 20% of these deaths being preventable. Most of these preventable deaths result from errors during the initial resuscitation of injured patients. Decision support has been evaluated as an approach to support teams during this phase to reduce errors. Existing systems require manual data entry and monitoring, which makes tasks challenging to accomplish in a time-critical setting. This paper identified the specific challenges of achieving effective decision support in trauma resuscitation based on computer vision techniques, including complex backgrounds, crowded scenes, fine-grained activities, and a scarcity of labeled data. To address the first three challenges, the proposed system involved an actor tracker that identifies individuals, allowing the system to focus on actor-specific features. Video Masked Autoencoder (Video-MAE) was used to overcome the issue of insufficient labeled data. This approach enables self-supervised learning using unlabeled video content, improving feature representation for medical activities. For more reliable performance, an ensemble fusion method was introduced. This technique combines predictions from consecutive video clips and different actors. Our method outperformed existing approaches in identifying fine-grained activities, providing a solution for activity recognition in trauma resuscitation and similar complex domains.

关键词： computer vision conferences Focusing Self-supervised learning Manuals Activity recognition Time factors

来源：评论

学校读者我要写书评

暂无评论

Avalanche: an End-to-End Library for Continual Learning

Avalanche: an End-to-End Library for Continual Learning

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Lomonaco, Vincenzo Pellegrini, Lorenzo Cossu, Andrea Carta, Antonio Graffieti, Gabriele Hayes, Tyler L. De Lange, Matthias Masana, Marc Pomponi, Jary Van de Ven, Gido M. Mundt, Martin She, Qi Cooper, Keiland Forest, Jeremy Belouadah, Eden Calderara, Simone Parisi, German, I Cuzzolin, Fabio Tolias, Andreas S. Scardapane, Simone Antiga, Luca Ahmad, Subutai Popescu, Adrian Kanan, Christopher Van de Weijer, Joost Tuytelaars, Tinne Bacciu, Davide Maltoni, Davide Univ Pisa Pisa Italy Univ Bologna Bologna Italy Rochester Inst Technol Rochester NY 14623 USA Katholieke Univ Leuven Leuven Belgium Univ Autonoma Barcelona Barcelona Spain Sapienza Univ Rome Rome Italy Baylor Coll Med Houston TX 77030 USA Goethe Univ Frankfurt Germany ByteDance AI Lab Beijing Peoples R China Univ Calif Berkeley Berkeley CA 94720 USA NYU New York NY USA Univ Paris Saclay Paris France Univ Modena & Reggio Emilia Modena Italy Univ Hamburg Hamburg Germany Oxford Brookes Univ Oxford England Orobix Bergamo Italy Numenta Redwood City CA USA Scuola Normale Super Pisa Pisa Italy

ISBN: (纸本)9781665448994

Learning continually from non-stationary data streams is a long-standing goal and a challenging problem in machine learning. Recently, we have witnessed a renewed and fast-growing interest in continual learning, especially within the deep learning community. However, algorithmic solutions are often difficult to re-implement, evaluate and port across different settings, where even results on standard benchmarks are hard to reproduce. In this work, we propose Avalanche, an open-source end-to-end library for continual learning research based on PyTorch. Avalanche is designed to provide a shared and collaborative codebase for fast prototyping, training, and reproducible evaluation of continual learning algorithms.

关键词： Training Deep learning computer vision Machine learning algorithms conferences Collaboration Libraries

来源：评论

学校读者我要写书评

暂无评论

Faster Than Lies: Real-time Deepfake Detection using Binary Neural Networks

Faster Than Lies: Real-time Deepfake Detection using Binary ...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Romeo Lanzino Federico Fontana Anxhelo Diko Marco Raoul Marini Luigi Cinque Department of Computer Science Sapienza University of Rome Rome Italy

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Deepfake detection aims to contrast the spread of deep-generated media that undermines trust in online content. While existing methods focus on large and complex models, the need for real-time detection demands greater efficiency. With this in mind, unlike previous work, we introduce a novel deepfake detection approach on images using Binary Neural Networks (BNNs) for fast inference with minimal accuracy loss. Moreover, our method incorporates Fast Fourier Transform (FFT) and Local Binary pattern (LBP) as additional channel features to uncover manipulation traces in frequency and texture domains. Evaluations on COCOFake, DFFD, and CIFAKE datasets demonstrate our method’s state-of-the-art performance in most scenarios with a significant efficiency gain of up to a 20× reduction in FLOPs during inference. Finally, by exploring BNNs in deepfake detection to balance accuracy and efficiency, this work paves the way for future research on efficient deepfake detection.

关键词： Deepfakes Accuracy Fast Fourier transforms Computational modeling Neural networks Media Real-time systems

来源：评论

学校读者我要写书评

暂无评论

Recognize Anything: A Strong Image Tagging Model

Recognize Anything: A Strong Image Tagging Model

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Youcai Zhang Xinyu Huang Jinyu Ma Zhaoyang Li Zhaochuan Luo Yanchun Xie Yuzhuo Qin Tong Luo Yaqian Li Shilong Liu Yandong Guo Lei Zhang OPPO Research Institute International Digital Economy Academy (IDEA) AI2Robotics

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

We present the Recognize Anything Model (RAM): a strong foundation model for image tagging. RAM makes a substantial step for foundation models in computer vision, demonstrating the zero-shot ability to recognize any common category with high accuracy. By leveraging large-scale image-text pairs for training instead of manual annotations, RAM introduces a new paradigm for image *** development of RAM comprises four key steps. Firstly, annotation-free image tags are obtained at scale through automatic text semantic parsing. Subsequently, a preliminary model is trained for automatic annotation by unifying the captioning and tagging tasks, supervised by the original texts and parsed tags, respectively. Thirdly, a data engine is employed to generate additional annotations and clean incorrect ones. Lastly, the model is retrained with the processed data and fine-tuned using a smaller but higher-quality *** evaluate the tagging capability of RAM on numerous benchmarks and observe an impressive zero-shot performance, which significantly outperforms CLIP and BLIP. Remarkably, RAM even surpasses fully supervised models and exhibits a competitive performance compared with the Google tagging API. We have released RAM at https://***/ to foster the advancement of foundation models in computer vision.

关键词： Training computer vision Image recognition Annotations Computational modeling Random access memory Image annotation

来源：评论

学校读者我要写书评

暂无评论

CSANet: High Speed Channel Spatial Attention Network for Mobile ISP

CSANet: High Speed Channel Spatial Attention Network for Mob...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hsyu, Ming-Chun Liu, Chih-Wei Chen, Chao-Hung Chen, Chao-Wei Tsai, Wen-Chia Ind Technol Res Inst Hsinchu Taiwan Natl Yang Ming Chiao Tung Univ Hsinchu Taiwan

ISBN: (纸本)9781665448994

The Image Signal Processor (ISP) is a customized device to restore RGB images from the pixel signals of CMOS image sensor. In order to realize this function, a series of processing units are leveraged to tackle different artifacts, such as color shifts, signal noise, moire effects, and so on, that are introduced from the photo-capturing devices. However, tuning each processing unit is highly complicated and requires a lot of experience and effort from image experts. In this paper, a novel network architecture, CSANet, with emphases on inference speed and high PSNR is proposed for end-to-end learned ISP task. The proposed CSANet applies a double attention module employing both channel and spatial attentions. Particularly, its spatial attention is simplified to a light-weighted dilated depth-wise convolution and still performs as well as others. As proof of performance, CSANet won 2nd place in the Mobile AI 2021 Learned Smartphone ISP Challenge with 1st place PSNR score.

关键词： Performance evaluation Runtime Pipelines Network architecture Service-oriented architecture pattern recognition Image restoration

来源：评论

学校读者我要写书评

暂无评论

Deep Portrait Quality Assessment. A NTIRE 2024 Challenge Survey

Deep Portrait Quality Assessment. A NTIRE 2024 Challenge Sur...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Nicolas Chahine Marcos V. Conde Daniela Carfora Gabriel Pacianotto Benoit Pochon Sira Ferradans Radu Timofte Zhichao Duan Xinrui Xu Yipo Huang Quan Yuan Xiangfei Sheng Zhichao Yang Leida Li Haotian Fan Fangyuan Kong Yifang Xu Wei Sun Weixia Zhang Yanwei Jiang Haoning Wu Zicheng Zhang Jun Jia Yingjie Zhou Zhongpeng Ji Xiongkuo Min Weisi Lin Guangtao Zhai Xiaoqi Wang Junqi Liu Zixi Guo Yun Zhang Zewen Chen Wen Wang Juan Wang Bing Li DXOMARK CAIDAS & IFI Computer Vision Lab University of Würzburg Xidian University ByteDance Inc Shanghai Jiao Tong University Nanyang Technological University Huawei School of Electronics and Communication Engineering Sun Yat-sen University China State Key Laboratory of Multimodal Artificial Intelligence Systems CASIA School of Artificial Intelligence University of Chinese Academy of Sciences Beijing Jiaotong University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper reviews the NTIRE 2024 Portrait Quality Assessment Challenge, highlighting the proposed solutions and results. This challenge aims to obtain an efficient deep neural network capable of estimating the perceptual quality of real portrait photos. The methods must generalize to diverse scenes and diverse lighting conditions (indoor, outdoor, low-light), movement, blur, and other challenging conditions. In the challenge, 140 participants registered, and 35 submitted results during the challenge period. The performance of the top 5 submissions is reviewed and provided here as a gauge for the current state-of-the-art in Portrait Quality Assessment.

关键词： Surveys computer vision Reviews conferences Lighting Artificial neural networks Quality assessment pattern recognition

来源：评论

学校读者我要写书评

暂无评论

EFI-Net: Video Frame Interpolation from Fusion of Events and Frames

EFI-Net: Video Frame Interpolation from Fusion of Events and...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Paikin, Genady Ater, Yotam Shaul, Roy Soloveichik, Evgeny Samsung Israel R&D Ctr Tel Aviv Israel

ISBN: (纸本)9781665448994

Event cameras are sensors with pixels that respond independently and asynchronously to changes in scene illumination. Event cameras have a number of advantages when compared to conventional cameras: low-latency, high temporal resolution, high dynamic range, low power and sparse data output. However, existing event cameras also suffer from comparatively low spatial resolution and are sensitive to noise. Recently, it has been shown that it is possible to reconstruct an intensity frame stream from an event stream. These reconstructions preserve the high temporal rate of the event stream, but tend to suffer from significant artifacts and low image quality due to the shortcomings of event cameras. In this work we demonstrate that it is possible to combine the best of both worlds, by fusing a color frame stream at low temporal resolution and high spatial resolution with an event stream at high temporal resolution and low spatial resolution to generate a video stream with both high temporal and spatial resolutions while preserving the original color information. We utilize a novel event frame interpolation network (EFI-Net), a multi-phase convolutional neural network which fuses the frame and event streams. EFI-Net is trained using only simulated data and generalizes exceptionally well to real-world experimental data. We show that our method is able to interpolate frames where traditional video interpolation approaches fail, while also outperforming event-only reconstructions. We further contribute a new dataset, containing event camera data synchronized with high speed video. This work opens the door to a new application for event cameras, enabling high fidelity fusion with frame based image streams for generation of high-quality high-speed video. The dataset is available at https://***/file/d/1UIGVBqNER_5KguYPAu5y7TVg-JlNhz3-/view?usp=sharing [GRAPHICS] .

关键词： Interpolation Image color analysis Lighting Streaming media Cameras Sensors pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Federated Hyperparameter Optimization through Reward-Based Strategies: Challenges and Insights

Federated Hyperparameter Optimization through Reward-Based S...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Krishna Kanth Nakka Ahmed Frikha Ricardo Mendis Xue Jiang Xuebing Zhou Huawei Munich Research Center

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Performing hyperparameter tuning in federated learning is often prohibitively expensive due to the substantial communication overhead associated with training a single configuration, especially with a large hyperparameter search space. To overcome this challenge, recent works explored reward-based approaches to learn a policy distribution over a set of hyperparameter configurations. These approaches enable the concurrent exploration of multiple hyperparameter configurations within a single communication round, thereby accelerating the search *** this paper, we take a deeper look at the reward-based strategies and systematically analyze them, uncovering several issues and challenges associated with their adoption in practice. Furthermore, motivated by the insights from our analysis, we propose an in-depth evaluation of policy distribution with metrics that capture rankings of standalone configurations. We contribute this critical examination and proposed evaluation metrics in order to raise awareness about the challenges and hidden issues that reward-based federated hyperparameter optimization might face and to enable a more rigorous evaluation and therefore a faster progress in this research area. We expect that the identified challenges will serve as inspiration for the development of more robust and hyperparameter-free federated hyperparameter tuning approaches.

关键词： Training computer vision Systematics Sensitivity Federated learning Face recognition conferences

来源：评论

学校读者我要写书评

暂无评论

Where are they looking in the 3D space?

Where are they looking in the 3D space?

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Nora Horanyi Linfang Zheng Eunji Chong Aleš Leonardis Hyung Jin Chang School of Computer Science University of Birmingham *** Inc.

We propose a novel depth-aware joint attention target estimation framework that estimates the attention target in 3D space. Our goal is to mimic human’s ability to understand where each person is looking in their proximity. In this work, we tackle the previously unexplored problem of utilising a depth prior along with a 3D joint FOV probability map to estimate the joint attention target of people in the scene. We leverage the insight that besides the 2D image content, strong gaze-related constraints exist in the depth order of the scene and different subject-specific attributes. Extensive experiments show that our method outperforms favourably against existing joint attention target estimation methods on the VideoCoAtt benchmark dataset. Despite the proposed framework being designed for joint attention target estimation, we show that it outperforms single attention target estimation methods on both the GazeFollow image and the VideoAttentionTarget video benchmark datasets.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 123 124 125 126 127 128 129 130 131 132 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：