With the rapid development of intelligent video surveillance technology,pedestrian re-identification has become increasingly important inmulti-camera surveillance *** technology plays a critical role in enhancing publ...
详细信息
With the rapid development of intelligent video surveillance technology,pedestrian re-identification has become increasingly important inmulti-camera surveillance *** technology plays a critical role in enhancing public ***,traditional methods typically process images and text separately,applying upstream models directly to downstream *** approach significantly increases the complexity ofmodel training and computational ***,the common class imbalance in existing training datasets limitsmodel performance *** address these challenges,we propose an innovative framework named Person Re-ID Network Based on Visual Prompt technology andMulti-Instance Negative Pooling(VPM-Net).First,we incorporate the Contrastive Language-Image Pre-training(CLIP)pre-trained model to accurately map visual and textual features into a unified embedding space,effectively mitigating inconsistencies in data distribution and the training *** enhancemodel adaptability and generalization,we introduce an efficient and task-specific Visual Prompt Tuning(VPT)technique,which improves the model’s relevance to specific ***,we design two key modules:the Knowledge-Aware Network(KAN)and theMulti-Instance Negative Pooling(MINP)*** KAN module significantly enhances the model’s understanding of complex scenarios through deep contextual semantic *** module handles samples,effectively improving the model’s ability to distinguish fine-grained *** experimental outcomes across diverse datasets underscore the remarkable performance of *** results vividly demonstrate the unique advantages and robust reliability of VPM-Net in fine-grained retrieval tasks.
Learning the accurate dynamics of robotic systems directly from the trajectory data is currently a prominent research *** physics-enforced networks,exemplified by Hamiltonian neural networks and Lagrangian neural netw...
详细信息
Learning the accurate dynamics of robotic systems directly from the trajectory data is currently a prominent research *** physics-enforced networks,exemplified by Hamiltonian neural networks and Lagrangian neural networks,demonstrate proficiency in modeling ideal physical systems,but face limitations when applied to systems with uncertain non-conservative dynamics due to the inherent constraints of the conservation laws *** this paper,we present a novel augmented deep Lagrangian network,which seamlessly integrates a deep Lagrangian network with a standard deep *** fusion aims to effectively model uncertainties that surpass the limitations of conventional Lagrangian *** proposed network is applied to learn inverse dynamics model of two multi-degree manipulators including a 6-dof UR-5 robot and a 7-dof SARCOS manipulator under *** experimental results clearly demonstrate that our approach exhibits superior modeling precision and enhanced physical credibility.
Low earth orbit(LEO) satellite edge computing can overcome communication difficulties in harsh environments, which lack the support of terrestrial communication infrastructure. It is an indispensable option for achiev...
详细信息
Low earth orbit(LEO) satellite edge computing can overcome communication difficulties in harsh environments, which lack the support of terrestrial communication infrastructure. It is an indispensable option for achieving worldwide wireless communication coverage in the future. To improve the quality-of-service(QoS) for Internet-of-things(IoT) devices, we combine LEO satellite edge computing and ground communication systems to provide network services for IoT devices in harsh environments. We study the QoS-aware computation offloading(QCO) problem for IoT devices in LEO satellite edge computing. Then we investigate the computation offloading strategy for IoT devices that can minimize the total QoS cost of all devices while satisfying multiple constraints, such as the computing resource constraint, delay constraint, and energy consumption constraint. We formulate the QoSaware computation offloading problem as a game model named QCO game based on the non-cooperative competition game among IoT devices. We analyze the finite improvement property of the QCO game and prove that there is a Nash equilibrium for the QCO game. We propose a distributed QoS-aware computation offloading(DQCO) algorithm for the QCO game. Experimental results show that the DQCO algorithm can effectively reduce the total QoS cost of IoT devices.
Breast mass identification is of great significance for early screening of breast cancer,while the existing detection methods have high missed and misdiagnosis rate for small *** propose a small target breast mass det...
详细信息
Breast mass identification is of great significance for early screening of breast cancer,while the existing detection methods have high missed and misdiagnosis rate for small *** propose a small target breast mass detection network named Residual asymmetric dilated convolution-Cross layer attention-Mean standard deviation adaptive selection-You Only Look Once(RCM-YOLO),which improves the identifiability of small masses by increasing the resolution of feature maps,adopts residual asymmetric dilated convolution to expand the receptive field and optimize the amount of parameters,and proposes the cross-layer attention that transfers the deep semantic information to the shallow layer as auxiliary information to obtain key feature *** the training process,we propose an adaptive positive sample selection algorithm to automatically select positive samples,which considers the statistical features of the intersection over union sets to ensure the validity of the training set and the detection accuracy of the *** verify the performance of our model,we used public datasets to carry out the *** results showed that the mean Average Precision(mAP)of RCM-YOLO reached 90.34%,compared with YOLOv5,the missed detection rate for small masses of RCM-YOLO was reduced to 11%,and the single detection time was reduced to 28 *** detection accuracy and speed can be effectively improved by strengthening the feature expression of small masses and the relationship between *** method can help doctors in batch screening of breast images,and significantly promote the detection rate of small masses and reduce misdiagnosis.
A two-stage algorithm based on deep learning for the detection and recognition of can bottom spray codes and numbers is proposed to address the problems of small character areas and fast production line speeds in can ...
详细信息
A two-stage algorithm based on deep learning for the detection and recognition of can bottom spray codes and numbers is proposed to address the problems of small character areas and fast production line speeds in can bottom spray code number *** the coding number detection stage,Differentiable Binarization Network is used as the backbone network,combined with the Attention and Dilation Convolutions Path Aggregation Network feature fusion structure to enhance the model detection *** terms of text recognition,using the Scene Visual Text Recognition coding number recognition network for end-to-end training can alleviate the problem of coding recognition errors caused by image color distortion due to variations in lighting and background *** addition,model pruning and quantization are used to reduce the number ofmodel parameters to meet deployment requirements in resource-constrained environments.A comparative experiment was conducted using the dataset of tank bottom spray code numbers collected on-site,and a transfer experiment was conducted using the dataset of packaging box production *** experimental results show that the algorithm proposed in this study can effectively locate the coding of cans at different positions on the roller conveyor,and can accurately identify the coding numbers at high production line *** Hmean value of the coding number detection is 97.32%,and the accuracy of the coding number recognition is 98.21%.This verifies that the algorithm proposed in this paper has high accuracy in coding number detection and recognition.
Time series data plays a crucial role in intelligent transportation *** flow forecasting represents a precise estimation of future traffic flow within a specific region and time *** approaches,including sequence perio...
详细信息
Time series data plays a crucial role in intelligent transportation *** flow forecasting represents a precise estimation of future traffic flow within a specific region and time *** approaches,including sequence periodic,regression,and deep learning models,have shown promising results in short-term series ***,forecasting scenarios specifically focused on holiday traffic flow present unique challenges,such as distinct traffic patterns during vacations and the increased demand for long-term ***,the effectiveness of existing methods diminishes in such ***,we propose a novel longterm forecasting model based on scene matching and embedding fusion representation to forecast long-term holiday traffic *** model comprises three components:the similar scene matching module,responsible for extracting Similar Scene Features;the long-short term representation fusion module,which integrates scenario embeddings;and a simple fully connected layer at the head for making the final *** results on real datasets demonstrate that our model outperforms other methods,particularly in medium and long-term forecasting scenarios.
Dialogue-based relation extraction(DialogRE) aims to predict relationships between two entities in dialogue. Current approaches to dialogue relationship extraction grapple with long-distance entity relationships in di...
详细信息
Dialogue-based relation extraction(DialogRE) aims to predict relationships between two entities in dialogue. Current approaches to dialogue relationship extraction grapple with long-distance entity relationships in dialogue data as well as complex entity relationships, such as a single entity with multiple types of connections. To address these issues, this paper presents a novel approach for dialogue relationship extraction termed the hypergraphs and heterogeneous graphs model(HG2G). This model introduces a two-tiered structure, comprising dialogue hypergraphs and dialogue heterogeneous graphs, to address the shortcomings of existing methods. The dialogue hypergraph establishes connections between similar nodes using hyper-edges and utilizes hypergraph convolution to capture multi-level features. Simultaneously, the dialogue heterogeneous graph connects nodes and edges of different types, employing heterogeneous graph convolution to aggregate cross-sentence information. Ultimately, the integrated nodes from both graphs capture the semantic nuances inherent in dialogue. Experimental results on the DialogRE dataset demonstrate that the HG2G model outperforms existing state-of-the-art methods.
Instance co-segmentation aims to segment the co-occurrent instances among two *** task heavily relies on instance-related cues provided by co-peaks,which are generally estimated by exhaustively exploiting all paired c...
详细信息
Instance co-segmentation aims to segment the co-occurrent instances among two *** task heavily relies on instance-related cues provided by co-peaks,which are generally estimated by exhaustively exploiting all paired candidates in point-to-point ***,such patterns could yield a high number of false-positive co-peaks,resulting in over-segmentation whenever there are mutual *** tackle with this issue,this paper proposes an instance co-segmentation method via tensor-based salient co-peak search(TSCPS-ICS).The proposed method explores high-order correlations via triple-to-triple matching among feature maps to find reliable co-peaks with the help of co-saliency *** proposed method is shown to capture more accurate intra-peaks and inter-peaks among feature maps,reducing the false-positive rate of co-peak *** having accurate co-peaks,one can efficiently infer responses of the targeted *** on four benchmark datasets validate the superior performance of the proposed method.
We derive the transport equations from the Vlasov–Fokker–Planck equation when the velocity space is spherically *** Shkarofsky's form of Fokker–Planck–Rosenbluth collision operator is employed in the Vlasov–F...
详细信息
We derive the transport equations from the Vlasov–Fokker–Planck equation when the velocity space is spherically *** Shkarofsky's form of Fokker–Planck–Rosenbluth collision operator is employed in the Vlasov–Fokker–Planck equation.A closed-form relaxation model for homogeneous plasmas could be presented in terms of Gauss *** has been accomplished based on the Maxwellian mixture ***,we demonstrate that classic models such as two-temperature thermal equilibrium model and thermodynamic equilibrium model are special cases of our relaxation model and the zeroth-order Braginskii heat transfer model can also be *** present relaxation model is a nonequilibrium model based on the hypothesis that the plasmas system possesses finitely distinguishable independent features,without relying on the conventional near-equilibrium assumption.
The slow development of traditional computing has prompted the search for new materials to replace silicon-based computers. Bio-computers, which use molecules as the basis of computation, are highly parallel and infor...
详细信息
The slow development of traditional computing has prompted the search for new materials to replace silicon-based computers. Bio-computers, which use molecules as the basis of computation, are highly parallel and information capable, attracting a lot of attention. In this study, we designed a NAND logic gate based on the DNA strand displacement mechanism. We assembled a molecular calculation model, a 4-wire-2-wire priority encoder logic circuit, by cascading the proposed NAND gates. Different concentrations of input DNA chains were added into the system, resulting in corresponding output, through DNA hybridization and strand displacement. Therefore, it achieved the function of a priority encoder. Simulation results verify the effectiveness and accuracy of the molecular NAND logic gate and the priority coding system presented in this study. The unique point of this proposed circuit is that we cascaded only one kind of logic gate, which provides a beneficial exploration for the subsequent development of complex DNA cascade circuits and the realization of the logical coding function of information.
暂无评论