检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

8,901 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,961 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,560 篇 工学
- 4,020 篇 计算机科学与技术...
- 2,178 篇 软件工程
- 1,241 篇 光学工程
- 555 篇 控制科学与工程
- 431 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 287 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 61 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 49 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,065 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 54 篇 化学
- 36 篇 系统科学
263 篇 管理学
- 180 篇 图书情报与档案管...
- 89 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,412 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
259 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,932 篇 英文
21 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8962 条记录，以下是241-250 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

recognition of Freely Selected Keypoints on Human Limbs

Recognition of Freely Selected Keypoints on Human Limbs

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ludwig, Katja Kienzle, Daniel Lienhart, Rainer Univ Augsburg Machine Learning & Comp Vis Lab Augsburg Germany

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Nearly all Human Pose Estimation (HPE) datasets consist of a fixed set of keypoints. Standard HPE models trained on such datasets can only detect these keypoints. If more points are desired, they have to be manually annotated and the model needs to be retrained. Our approach leverages the vision Transformer architecture to extend the capability of the model to detect arbitrary keypoints on the limbs of persons. We propose two different approaches to encode the desired keypoints. (1) Each keypoint is defined by its position along the line between the two enclosing keypoints from the fixed set and its relative distance between this line and the edge of the limb. (2) Keypoints are defined as coordinates on a norm pose. Both approaches are based on the TokenPose [12] architecture, while the keypoint tokens that correspond to the fixed keypoints are replaced with our novel module. Experiments show that our approaches achieve similar results to TokenPose on the fixed keypoints and are capable of detecting arbitrary keypoints on the limbs.

关键词： Measurement computer vision Image edge detection conferences Computational modeling Biological system modeling Pose estimation

来源：评论

学校读者我要写书评

暂无评论

ScanpathNet: A Recurrent Mixture Density Network for Scanpath Prediction

ScanpathNet: A Recurrent Mixture Density Network for Scanpat...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： de Belen, Ryan Anthony Jalova Bednarz, Tomasz Sowmya, Arcot Univ New South Wales Sydney NSW Australia

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Understanding the mechanisms underlying human visual attention is an important research problem in cognitive neuroscience and computer vision. While existing models predict salient regions (i.e., saliency maps) and temporal sequences of eye fixations (i.e., scanpaths) in images, their designs often partially follow theoretical frameworks. Here, we introduce ScanpathNet, a deep learning model inspired by the latest theoretical model in neuroscience. It is 'guided' by a dynamic priority map influenced by semantic content and fixation history. The model leverages convolutional neural networks to extract rich semantic features, convolutional long short-term memory networks to model the inhibition of return mechanism and sequential dependencies of fixations, and mixture density networks to predict probability distributions of fixations for each pixel. Simulated human scanpaths can then be generated by sequentially sampling the output of the proposed model. Despite its simplicity, ScanpathNet showed promising qualitative and quantitative scanpath prediction performance in extensive experiments on numerous eye-tracking benchmark datasets.

关键词： Deep learning Visualization computer vision Computational modeling Semantics Predictive models Visual systems

来源：评论

学校读者我要写书评

暂无评论

A New Non-central Model for Fisheye Calibration

A New Non-central Model for Fisheye Calibration

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tezaur, Radka Kumar, Avinash Nestares, Oscar Intel Corp Santa Clara CA 95054 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

A new non-central model suitable for calibrating fisheye cameras is proposed. It is a direct extension of the popular central model developed by Scaramuzza et al., used by Matlab computer vision Toolbox fisheye calibration tool. It allows adapting existing applications that are using this central model to a non-central projection that is more accurate, especially when objects captured in the images are close to the camera, and it makes it possible to switch easily between the more accurate non-central characterization of the fisheye camera and the more convenient central approximation, as needed. It is shown that the algorithms proposed by Scaramuzza et al. for their central model can be modified to accommodate the angle dependent axial viewpoint shift. This means, besides other, that a similar process can be used for calibration involving the viewpoint shift characterization and a user-friendly calibration tool can be produced with this new non-central model that does not require the user to provide detailed lens design specifications or an educated guess for the initial parameter values. Several other improvements to the Scaramuzza's central model are also introduced, helping to improve the performance of both the central model, and its non-central extension.

关键词： Adaptation models computer vision Three-dimensional displays Computational modeling Switches Cameras Mathematical models

来源：评论

学校读者我要写书评

暂无评论

EKILA: Synthetic Media Provenance and Attribution for Generative Art

EKILA: Synthetic Media Provenance and Attribution for Genera...

引用

2023 ieee/CVF conference on computer vision and pattern recognition workshops, CVPRW 2023

作者： Balan, Kar Agarwal, Shruti Jenni, Simon Parsons, Andy Gilbert, Andrew Collomosse, John University of Surrey United Kingdom Adobe Inc.

ISBN: (纸本)9798350302493

We present EKILA;a decentralized framework that enables creatives to receive recognition and reward for their contributions to generative AI (GenAI). EKILA proposes a robust visual attribution technique and combines this with an emerging content provenance standard (C2PA) to address the problem of synthetic image provenance - determining the generative model and training data responsible for an AI-generated image. Furthermore, EKILA extends the non-fungible token (NFT) ecosystem to introduce a tokenized representation for rights, enabling a triangular relationship between the asset's Ownership, Rights, and Attribution (ORA). Leveraging the ORA relationship enables creators to express agency over training consent and, through our attribution model, to receive apportioned credit, including royalty payments for the use of their assets in GenAI. © 2023 ieee.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Leveraging Unlabeled Data for Sketch-based Understanding

Leveraging Unlabeled Data for Sketch-based Understanding

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Morales, Javier Murrugarra-Llerena, Nils Saavedra, Jose M. Univ Chile Dept Comp Sci Santiago Chile Weber State Univ Dept Comp Sci Ogden UT 84408 USA Univ Los Andes Santiago Chile

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Sketch-based understanding is a critical component of human cognitive learning and is a primitive communication means between humans. This topic has recently attracted the interest of the computer vision community as sketching represents a powerful tool to express static objects and dynamic scenes. Unfortunately, despite its broad application domains, the current sketch-based models strongly rely on labels for supervised training, ignoring knowledge from unlabeled data, thus limiting the underlying generalization and the applicability. Therefore, we present a study about the use of unlabeled data to improve a sketch-based model. To this end, we evaluate variations of VAE and semi-supervised VAE, and present an extension of BYOL to deal with sketches. Our results show the superiority of sketch-BYOL, which outperforms other self-supervised approaches increasing the retrieval performance for known and unknown categories. Furthermore, we show how other tasks can benefit from our proposal.

关键词： Training computer vision TV Limiting Image edge detection Image retrieval Data models

来源：评论

学校读者我要写书评

暂无评论

Scene Representation in Bird's-Eye View from Surrounding Cameras with Transformers

Scene Representation in Bird's-Eye View from Surrounding Cam...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zhao, Yun Zhang, Yu Gong, Zhan Zhu, Hong Inspur Elect Informat Ind Co Ltd Dept AI & HPC Beijing Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Scene representation in the bird's-eye-view (BEV) coordinate frame provides a succinct and effective way to understand surrounding environments for autonomous vehicles and robotics. In this work, we present an end-to-end architecture to generate the BEV representation from surrounding cameras. To generate the BEV representation, we propose a transformer-based encoder-decoder structure to translate the image features from different cameras into the BEV frame, which takes advantage of the context information in the individual image and the relationship between images in different views. We perform multiple semantic segmentation tasks using the BEV features. Experimental results show that our model outperforms the competitive baseline [20], which demonstrates the effectiveness and efficiency of our method.

关键词： Image segmentation Image coding Computational modeling Semantics Robot vision systems Predictive models Cameras

来源：评论

学校读者我要写书评

暂无评论

Guiding Attention using Partial-Order Relationships for Image Captioning

Guiding Attention using Partial-Order Relationships for Imag...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Popattia, Murad Rafi, Muhammad Qureshi, Rizwan Nawaz, Shah Natl Univ Comp & Emerging Sci Karachi Pakistan Hamad Bin Khalifa Univ Doha Qatar Ist Italiano Tecnol IIT Pattern Anal & Comp Vis PAVIS Genoa Italy DESY Hamburg Germany

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

The use of attention models for automated image captioning has enabled many systems to produce accurate and meaningful descriptions for images. Over the years, many novel approaches have been proposed to enhance the attention process using different feature representations. In this paper, we extend this approach by creating a guided attention network mechanism, that exploits the relationship between the visual scene and text-descriptions using spatial features from the image, high-level information from the topics, and temporal context from caption generation, which are embedded together in an ordered embedding space. A pairwise ranking objective is used for training this embedding space which allows similar images, topics and captions in the shared semantic space to maintain a partial order in the visual-semantic hierarchy and hence, helps the model to produce more visually accurate captions. The experimental results based on MSCOCO dataset shows the competitiveness of our approach, with many state-of-the-art models on various evaluation metrics.

关键词： Training Measurement Visualization computer vision conferences Semantics computer architecture

来源：评论

学校读者我要写书评

暂无评论

An Attention-based Method for Multi-label Facial Action Unit Detection

An Attention-based Method for Multi-label Facial Action Unit...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Le Hoai, Duy Lim, Eunchae Choi, Eunbin Kim, Sieun Pant, Sudarshan Lee, Guee-Sang Kim, Soo-Huyng Yang, Hyung-Jeong Chonnam Natl Univ Gwangju South Korea

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Facial Action Coding System is an approach for modeling the complexity of human emotional expression. Automatic action unit (AU) detection is a crucial research area in human-computer interaction. This paper describes our submission to the third Affective Behavior Analysis in-the-wild (ABAW) competition 2022. We proposed a method for detecting facial action units in the video. In the first stage, a lightweight CNN-based feature extractor is employed to extract the feature map from each video frame. Then, an attention module is applied to refine the attention map. The attention encoded vector is derived using a weighted sum of the feature map and the attention scores later. Finally, the sigmoid function is used at the output layer to make the prediction suitable for multi-label AUs detection. We achieved a macro F1 score of 0.48 on the validation set and 0.4206 on the test set compared to 0.39 and 0.3650 from the ABAW challenge baseline model.

关键词： Human computer interaction Gold computer vision conferences Computational modeling Feature extraction Encoding

来源：评论

学校读者我要写书评

暂无评论

Can we trust bounding box annotations for object detection?

Can we trust bounding box annotations for object detection?

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Murrugarra-Llerena, Jeffri Kirsten, L. N. Jung, Claudio R. Univ Fed Rio Grande do Sul Inst Informat Porto Alegre RS Brazil

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Object detection is a classical problem in computer vision, and the vast majority of approaches require large annotated datasets for training and evaluation purposes. The most popular representations are bounding boxes (BBs), usually defined as the minimal-area rectangle that encompasses the whole object region. However, the annotation process presents some subjectiveness (particularly when occlusions are present), and its quality might get degraded when the annotators get tired. Comparing BBs is crucial for evaluation purposes, and the Intersection-over-Union (IoU) is the standard similarity metric. In this paper, we provide theoretical and experimental results indicating that the IoU can be strongly affected even by small annotation discrepancies in popular datasets used for object detection. As a consequence, the Average Precision (AP) value commonly used to evaluate object detectors is also influenced by annotation bias or noise, particularly for small objects and tighter IoU thresholds.

关键词： Degradation Training computer vision Annotations Object detection Detectors Size measurement

来源：评论

学校读者我要写书评

暂无评论

PhoneDepth: A Dataset for Monocular Depth Estimation on Mobile Devices

PhoneDepth: A Dataset for Monocular Depth Estimation on Mobi...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Benavides, Fausto Tapia Ignatov, Andrey Timofte, Radu Swiss Fed Inst Technol Zurich Switzerland JMU Wurzburg Wurzburg Germany

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Monocular depth estimation has been studied as a classic and learning based computer vision problem for decades. However, little attention received the efficiency and the deployment of methods on mobile hardware. All publicly available datasets have severe limitations related to their applicability to camera data captured with real mobile devices. For instance, the main issues with current datasets include (but not exhaustively) low quality of images due the cameras or collection methods, domain specifically generated datasets as is the case for autonomous driving, small number of samples, sparse depthmaps, etc. In response, we introduce PhoneDepth, a novel dataset that aims to take advantage of modern phones hardware and professional stereo cameras. Depthmaps are acquired from three sources: Time of Flight sensor, Dual Pixel sensor and stereo camera;while the images correspond to mobile phone photos. We prove its high value by training neural networks with multiple depth supervision, fine-tuning on other datasets and for depth refinement. Along with the dataset we present benchmark models and a toolbox to facilitate the dataset usage.

关键词： Training computer vision Three-dimensional displays Estimation Cameras Mobile handsets Hardware

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 21 22 23 24 25 26 27 28 29 30 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：