咨询与建议

限定检索结果

文献类型

  • 13 篇 期刊文献
  • 11 篇 会议

馆藏范围

  • 24 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 14 篇 工学
    • 11 篇 计算机科学与技术...
    • 9 篇 软件工程
    • 2 篇 机械工程
    • 2 篇 信息与通信工程
    • 2 篇 船舶与海洋工程
    • 1 篇 材料科学与工程(可...
    • 1 篇 控制科学与工程
    • 1 篇 网络空间安全
  • 4 篇 管理学
    • 3 篇 图书情报与档案管...
    • 1 篇 管理科学与工程(可...
  • 3 篇 理学
    • 2 篇 数学
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 统计学(可授理学、...
  • 2 篇 医学
    • 2 篇 临床医学
    • 1 篇 特种医学

主题

  • 2 篇 transformers
  • 2 篇 cameras
  • 2 篇 decoding
  • 2 篇 identification o...
  • 2 篇 computer vision
  • 1 篇 analysis
  • 1 篇 degradation of p...
  • 1 篇 object detection
  • 1 篇 video streaming
  • 1 篇 computational co...
  • 1 篇 human-machine in...
  • 1 篇 fault tolerance
  • 1 篇 customers
  • 1 篇 bioelectronics
  • 1 篇 cardio
  • 1 篇 text to image
  • 1 篇 optimization
  • 1 篇 process control
  • 1 篇 pattern recognit...
  • 1 篇 intensity inhomo...

机构

  • 12 篇 guangdong provin...
  • 12 篇 school of comput...
  • 12 篇 key laboratory o...
  • 5 篇 tongyi lab aliba...
  • 4 篇 pazhou lab
  • 4 篇 peng cheng labor...
  • 4 篇 pengcheng lab
  • 2 篇 department of co...
  • 2 篇 engineering rese...
  • 2 篇 school of electr...
  • 2 篇 key lab. machine...
  • 1 篇 department of ne...
  • 1 篇 school of comput...
  • 1 篇 the data science...
  • 1 篇 school of sofwar...
  • 1 篇 school of chemis...
  • 1 篇 school of materi...
  • 1 篇 state key lab of...
  • 1 篇 department of bi...
  • 1 篇 department of bi...

作者

  • 7 篇 zheng wei-shi
  • 5 篇 fu shenghao
  • 5 篇 xie xiaohua
  • 4 篇 wei xihan
  • 4 篇 xiaohua xie
  • 4 篇 yan junkai
  • 4 篇 yang qize
  • 4 篇 wei-shi zheng
  • 3 篇 jianhuang lai
  • 2 篇 lei wang
  • 2 篇 shenghao fu
  • 2 篇 zhan zhi-hui
  • 2 篇 zhang jun
  • 2 篇 quan zhang
  • 2 篇 junkai yan
  • 1 篇 meng jingke
  • 1 篇 gao huajian
  • 1 篇 zhu bowen
  • 1 篇 abidian mohammad...
  • 1 篇 zhao jin

语言

  • 19 篇 英文
  • 5 篇 其他
检索条件"机构=Key Lab of Language Engineering and Computing of Guangdong Province"
24 条 记 录,以下是1-10 订阅
排序:
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models  38
Frozen-DETR: Enhancing DETR with Image Understanding from Fr...
收藏 引用
38th Conference on Neural Information Processing Systems, NeurIPS 2024
作者: Fu, Shenghao Yan, Junkai Yang, Qize Wei, Xihan Xie, Xiaohua Zheng, Wei-Shi School of Computer Science and Engineering Sun Yat-sen University China Peng Cheng Laboratory Shenzhen518055 China Tongyi Lab Alibaba Group China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Guangdong Province Key Laboratory of Information Security Technology China Guangdong Guangzhou510555 China
Recent vision foundation models can extract universal representations and show impressive abilities in various tasks. However, their application on object detection is largely overlooked, especially without fine-tunin...
来源: 评论
View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network
View-decoupled Transformer for Person Re-identification unde...
收藏 引用
Conference on Computer Vision and Pattern Recognition (CVPR)
作者: Quan Zhang Lei Wang Vishal M. Patel Xiaohua Xie Jianhuang Lai School of Computer Science and Engineering Sun Yat-Sen University China Department of Electrical and Computer Engineering Johns Hopkins University USA Pazhou Lab (HuangPu) Guangdong China Guangdong Province Key Laboratory of Information Security Technology Guangdong China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China
Existing person re-identification methods have achieved remarkable advances in appearance-based identity association across homogeneous cameras, such as ground-ground matching. However, as a more practical scenario, a... 详细信息
来源: 评论
Rotation Exploration Transformer for Aerial Person Re-identification
Rotation Exploration Transformer for Aerial Person Re-identi...
收藏 引用
IEEE International Conference on Multimedia and Expo (ICME)
作者: Lei Wang Quan Zhang Junyang Qiu Jianhuang Lai School of Computer Science and Engineering Sun Yat-sen University China Guangdong Province Key Laboratory of Information Security Technology China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Pazhou Lab (HuangPu) Guangzhou China
Aerial person re-identification (AReID) focuses on accurately matching target person images within a UAV camera network. Challenges arise due to the broad field of view and arbitrary movement of UAVs, leading to foreg... 详细信息
来源: 评论
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis
Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image...
收藏 引用
Conference on Computer Vision and Pattern Recognition (CVPR)
作者: Yanzuo Lu Manlin Zhang Andy J Ma Xiaohua Xie Jianhuang Lai School of Computer Science and Engineering Sun Yat-sen University Guangzhou China Guangdong Province Key Laboratory of Information Security Technology China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Pazhou Lab (HuangPu) Guangzhou China
Diffusion model is a promising approach to image generation and has been employed for Pose-Guided Person Image Synthesis (PGPIS) with competitive performance. While existing methods simply align the person appearance ... 详细信息
来源: 评论
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large language Models
arXiv
收藏 引用
arXiv 2025年
作者: Fu, Shenghao Yang, Qize Mo, Qijie Yan, Junkai Wei, Xihan Meng, Jingke Xie, Xiaohua Zheng, Wei-Shi School of Computer Science and Engineering Sun Yat-sen University China Tongyi Lab Alibaba Group China Peng Cheng Laboratory China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Guangdong Province Key Laboratory of Information Security Technology China China
Recent open-vocabulary detectors achieve promising performance with abundant region-level annotated data. In this work, we show that an open-vocabulary detector co-training with a large language model by generating im... 详细信息
来源: 评论
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
arXiv
收藏 引用
arXiv 2024年
作者: Fu, Shenghao Yan, Junkai Yang, Qize Wei, Xihan Xie, Xiaohua Zheng, Wei-Shi School of Computer Science and Engineering Sun Yat-sen University China Peng Cheng Laboratory Shenzhen518055 China Tongyi Lab Alibaba Group China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Guangdong Province Key Laboratory of Information Security Technology China Guangdong Guangzhou510555 China
Recent vision foundation models can extract universal representations and show impressive abilities in various tasks. However, their application on object detection is largely overlooked, especially without fine-tunin... 详细信息
来源: 评论
ViSpeak: Visual Instruction Feedback in Streaming Videos
arXiv
收藏 引用
arXiv 2025年
作者: Fu, Shenghao Yang, Qize Li, Yuan-Ming Peng, Yi-Xing Lin, Kun-Yu Wei, Xihan Hu, Jian-Fang Xie, Xiaohua Zheng, Wei-Shi School of Computer Science and Engineering Sun Yat-sen University China Tongyi Lab Alibaba Group China Peng Cheng Laboratory China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Guangdong Province Key Laboratory of Information Security Technology China China
Recent advances in Large Multi-modal Models (LMMs) are primarily focused on offline video understanding. Instead, streaming video understanding poses great challenges to recent models due to its time-sensitive, omni-m... 详细信息
来源: 评论
Frozen-DETR: enhancing DETR with image understanding from frozen foundation models  24
Frozen-DETR: enhancing DETR with image understanding from fr...
收藏 引用
Proceedings of the 38th International Conference on Neural Information Processing Systems
作者: Shenghao Fu Junkai Yan Qize Yang Xihan Wei Xiaohua Xie Wei-Shi Zheng School of Computer Science and Engineering Sun Yat-sen University China and Tongyi Lab Alibaba Group and Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Tongyi Lab Alibaba Group School of Computer Science and Engineering Sun Yat-sen University China and Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China and Guangdong Province Key Laboratory of Information Security Technology China School of Computer Science and Engineering Sun Yat-sen University China and Peng Cheng Laboratory Shenzhen China and Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China and Pazhou Laboratory (Huangpu) Guangzhou Guangdong China
Recent vision foundation models can extract universal representations and show impressive abilities in various tasks. However, their application on object detection is largely overlooked, especially without fine-tunin...
来源: 评论
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation
arXiv
收藏 引用
arXiv 2023年
作者: Fu, Shenghao Yan, Junkai Gao, Yipeng Xie, Xiaohua Zheng, Wei-Shi School of Computer Science and Engineering Sun Yat-sen University China Pengcheng Lab China Guangdong Province Key Laboratory of Information Security Technology China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China
Recent sparse detectors with multiple, e.g. six, decoder layers achieve promising performance but much inference time due to complex heads. Previous works have explored using dense priors as initialization and built o... 详细信息
来源: 评论
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via...
收藏 引用
International Conference on Computer Vision (ICCV)
作者: Shenghao Fu Junkai Yan Yipeng Gao Xiaohua Xie Wei-Shi Zheng School of Computer Science and Engineering Sun Yat-sen University China Guangdong Province Key Laboratory of Information Security Technology China Key Laboratory of Machine Intelligence and Advanced Computing Ministry of Education China Pengcheng Lab China
Recent sparse detectors with multiple, e.g. six, decoder layers achieve promising performance but much inference time due to complex heads. Previous works have explored using dense priors as initialization and built o...
来源: 评论