Since the preparation of labeled datafor training semantic segmentation networks of pointclouds is a time-consuming process, weakly supervisedapproaches have been introduced to learn fromonly a small fraction of data....
详细信息
Since the preparation of labeled datafor training semantic segmentation networks of pointclouds is a time-consuming process, weakly supervisedapproaches have been introduced to learn fromonly a small fraction of data. These methods aretypically based on learning with contrastive losses whileautomatically deriving per-point pseudo-labels from asparse set of user-annotated labels. In this paper, ourkey observation is that the selection of which samplesto annotate is as important as how these samplesare used for training. Thus, we introduce a methodfor weakly supervised segmentation of 3D scenes thatcombines self-training with active learning. Activelearning selects points for annotation that are likelyto result in improvements to the trained model, whileself-training makes efficient use of the user-providedlabels for learning the model. We demonstrate thatour approach leads to an effective method that providesimprovements in scene segmentation over previouswork and baselines, while requiring only a few userannotations.
Currently,the video captioning models based on an encoder-decoder mainly rely on a single video input *** contents of video captioning are limited since few studies employed external corpus information to guide the ge...
详细信息
Currently,the video captioning models based on an encoder-decoder mainly rely on a single video input *** contents of video captioning are limited since few studies employed external corpus information to guide the generation of video captioning,which is not conducive to the accurate descrip-tion and understanding of video *** address this issue,a novel video captioning method guided by a sentence retrieval generation network(ED-SRG)is proposed in this ***,a ResNeXt network model,an efficient convolutional network for online video understanding(ECO)model,and a long short-term memory(LSTM)network model are integrated to construct an encoder-decoder,which is utilized to extract the 2D features,3D features,and object features of video data *** features are decoded to generate textual sentences that conform to video content for sentence ***,a sentence-transformer network model is employed to retrieve different sentences in an external corpus that are semantically similar to the above textual *** candidate sentences are screened out through similarity ***,a novel GPT-2 network model is constructed based on GPT-2 network *** model introduces a designed random selector to randomly select predicted words with a high probability in the corpus,which is used to guide and generate textual sentences that are more in line with human natural language *** proposed method in this paper is compared with several existing works by *** results show that the indicators BLEU-4,CIDEr,ROUGE_L,and METEOR are improved by 3.1%,1.3%,0.3%,and 1.5%on a public dataset MSVD and 1.3%,0.5%,0.2%,1.9%on a public dataset MSR-VTT *** can be seen that the proposed method in this paper can generate video captioning with richer semantics than several state-of-the-art approaches.
IoT devices rely on authentication mechanisms to render secure message *** data transmission,scalability,data integrity,and processing time have been considered challenging aspects for a system constituted by IoT *** ...
详细信息
IoT devices rely on authentication mechanisms to render secure message *** data transmission,scalability,data integrity,and processing time have been considered challenging aspects for a system constituted by IoT *** application of physical unclonable functions(PUFs)ensures secure data transmission among the internet of things(IoT)devices in a simplified network with an efficient time-stamped *** paper proposes a secure,lightweight,cost-efficient reinforcement machine learning framework(SLCR-MLF)to achieve decentralization and security,thus enabling scalability,data integrity,and optimized processing time in IoT *** has been integrated into SLCR-MLF to improve the security of the cluster head node in the IoT platform during transmission by providing the authentication service for device-to-device *** IoT network gathers information of interest from multiple cluster members selected by the proposed *** addition,the software-defined secured(SDS)technique is integrated with SLCR-MLF to improve data integrity and optimize processing time in the IoT *** analysis shows that the proposed framework outperforms conventional methods regarding the network’s lifetime,energy,secured data retrieval rate,and performance *** enabling the proposed framework,number of residual nodes is reduced to 16%,energy consumption is reduced by up to 50%,almost 30%improvement in data retrieval rate,and network lifetime is improved by up to 1000 msec.
As a deep learning network with an encoder-decoder architecture,UNet and its series of improved versions have been widely used in medical image segmentation with great ***,when used to segment targets in 3D medical im...
详细信息
As a deep learning network with an encoder-decoder architecture,UNet and its series of improved versions have been widely used in medical image segmentation with great ***,when used to segment targets in 3D medical images such as magnetic resonance imaging(MRI),computed tomography(CT),these models do not model the relevance of images in vertical space,resulting in poor accurate analysis of consecutive slices of the same *** the other hand,the large amount of detail lost during the encoding process makes these models incapable of segmenting small-scale tumor *** at the scene of small-scale target segmentation in 3D medical images,a fully new neural network model SUNet++is proposed on the basis of UNet and UNet++.SUNet++improves the existing models mainly in three aspects:1)the modeling strategy of slice superposition is used to thoroughly excavate the three dimensional information of the data;2)by adding an attention mechanism during the decoding process,small scale targets in the picture are retained and amplified;3)in the up-sampling process,the transposed convolution operation is used to further enhance the effect of the *** order to verify the effect of the model,we collected and produced a dataset of hyperintensity MRI liver-stage images containing over 400 cases of liver *** results on both public and proprietary datasets demonstrate the superiority of SUNet++in small-scale target segmentation of three-dimensional medical images.
Automatically detecting and locating remote occlusion small objects from the images of complex traffic environments is a valuable and challenging *** the boundary box location is not sufficiently accurate and it is di...
详细信息
Automatically detecting and locating remote occlusion small objects from the images of complex traffic environments is a valuable and challenging *** the boundary box location is not sufficiently accurate and it is difficult to distinguish overlapping and occluded objects,the authors propose a network model with a second-order term attention mechanism and occlusion ***,the backbone network is built on *** a method is designed for the feature extraction network based on an item-wise attention mechanism,which uses the filtered weighted feature vector to replace the original residual fusion and adds a second-order term to reduce the information loss in the process of fusion and accelerate the convergence of the ***,an objected occlusion regression loss function is studied to reduce the problems of missed detections caused by dense *** experimental results demonstrate that the authors’method achieved state-of-the-art performance without reducing the detection *** mAP@.5 of the method is 85.8%on the Foggy_cityscapes dataset and the mAP@.5 of the method is 97.8%on the KITTI dataset.
In recent years,aquaculture has developed rapidly,especially in coastal and open ocean *** practice,water quality prediction is of critical ***,traditional water quality prediction models face limitations in handling ...
详细信息
In recent years,aquaculture has developed rapidly,especially in coastal and open ocean *** practice,water quality prediction is of critical ***,traditional water quality prediction models face limitations in handling complex spatiotemporal *** address this challenge,a prediction model was proposed for water quality,namely an adaptive multi-channel temporal graph convolutional network(AMTGCN).The AMTGCN integrates adaptive graph construction,multi-channel spatiotemporal graph convolutional network,and fusion layers,and can comprehensively capture the spatial relationships and spatiotemporal patterns in aquaculture water quality *** aquaculture water quality data and the metrics MAE,RMSE,MAPE,and R^(2) were collected to validate the *** results show that the AMTGCN presents an average improvement of 34.01%,34.59%,36.05%,and 17.71%compared to LSTM,respectively;an average improvement of 64.84%,56.78%,64.82%,and 153.16%compared to the STGCN,respectively;an average improvement of 55.25%,48.67%,57.01%,and 209.00%compared to GCN-LSTM,respectively;and an average improvement of 7.05%,5.66%,7.42%,and 2.47%compared to TCN,*** indicates that the AMTGCN,integrating the innovative structure of adaptive graph construction and multi-channel spatiotemporal graph convolutional network,could provide an efficient solution for water quality prediction in aquaculture.
Advancements in smart applications highlight the need for increased processing and storage capacity at Smart Devices (SDs). To tackle this, Edge computing (EC) is enabled to offload SD workloads to distant edge server...
详细信息
The flying foxes optimization(FFO)algorithm,as a newly introduced metaheuristic algorithm,is inspired by the survival tactics of flying foxes in heat wave *** preferentially selects the best-performing *** tendency wi...
详细信息
The flying foxes optimization(FFO)algorithm,as a newly introduced metaheuristic algorithm,is inspired by the survival tactics of flying foxes in heat wave *** preferentially selects the best-performing *** tendency will cause the newly generated solution to remain closely tied to the candidate optimal in the search *** address this issue,the paper introduces an opposition-based learning-based search mechanism for FFO algorithm(IFFO).Firstly,this paper introduces niching techniques to improve the survival list method,which not only focuses on the adaptability of individuals but also considers the population’s crowding degree to enhance the global search ***,an initialization strategy of opposition-based learning is used to perturb the initial population and elevate its ***,to verify the superiority of the improved search mechanism,IFFO,FFO and the cutting-edge metaheuristic algorithms are compared and analyzed using a set of test *** results prove that compared with other algorithms,IFFO is characterized by its rapid convergence,precise results and robust stability.
The medical domain faces unique challenges in Information Retrieval (IR) due to the complexity of medical language and terminology discrepancies between user queries and documents. While traditional Keyword-Based Meth...
详细信息
Recommender systems aim to filter information effectively and recommend useful sources to match users' requirements. However, the exponential growth of information in recent social networks may cause low predictio...
详细信息
暂无评论