Predicting crop disease on the image obtained from the affected crop has been a potential research topic. In this research, the Localise Search Optimisation Algorithm (LSOA) enabled deep Convolutional Neural Network (...
详细信息
When the ground communication base stations in the target area are severely destroyed,the deployment of Unmanned Aerial Vehicle(UAV)ad hoc networks can provide people with temporary communication ***,it is necessary t...
详细信息
This paper deals with sine cosine algorithm to make a balance between exploration and exploitation of the search space and find best convergence rate for global optima, that are used the two trigonometric function sin...
详细信息
Video forgery detection has been necessary with recent spurt in fake videos like Deepfakes and doctored videos from multiple video capturing devices. In this paper, we provide a novel technique of detecting fake video...
详细信息
Even though every individual is entitled to freedom of speech, some limitations exist when this freedom is used to target and harm another individual or a group of people, as it translates to hate speech. In this stud...
详细信息
In the workplace, risk prevention helps detect the risks and prevent accidents. To achieve this, workers' mental and physical parameters related to their health should be focused on and analyzed. It helps improve ...
详细信息
Large models have recently played a dominant role in natural language processing and multimodal vision-language learning. However, their effectiveness in text-related visual tasks remains relatively unexplored. In thi...
详细信息
Large models have recently played a dominant role in natural language processing and multimodal vision-language learning. However, their effectiveness in text-related visual tasks remains relatively unexplored. In this paper, we conducted a comprehensive evaluation of large multimodal models, such as GPT4V and Gemini, in various text-related visual tasks including text recognition, scene text-centric visual question answering(VQA), document-oriented VQA, key information extraction(KIE), and handwritten mathematical expression recognition(HMER). To facilitate the assessment of optical character recognition(OCR) capabilities in large multimodal models, we propose OCRBench, a comprehensive evaluation benchmark. OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available. Furthermore, our study reveals both the strengths and weaknesses of these models, particularly in handling multilingual text, handwritten text, non-semantic text, and mathematical expression *** importantly, the baseline results presented in this study could provide a foundational framework for the conception and assessment of innovative strategies targeted at enhancing zero-shot multimodal *** evaluation pipeline and benchmark are available at https://***/Yuliang-Liu/Multimodal OCR.
Efficient supply chain management is necessary to meet customer demands. Demand forecasting is a predictive analysis that estimates how much of a product or service a customer will need in the future. Accurate demand ...
详细信息
The key-value separation is renowned for its significant mitigation of the write amplification inherent in traditional LSM trees. However, KV separation potentially increases performance overhead in the management of ...
详细信息
Ethereum has received increasing attention as the first blockchain platform to support smart *** mining has become an important tool for analyzing Ethereum ***,existing methods have the disadvantage of covering partia...
详细信息
Ethereum has received increasing attention as the first blockchain platform to support smart *** mining has become an important tool for analyzing Ethereum ***,existing methods have the disadvantage of covering partial transactions and being vulnerable to privacy-enhancing *** this paper,we propose a scheme for transaction correlation with the node as an entity,which can cover all transactions while being resistant to privacy-enhancing *** timestamps relayed from N fixed nodes to describe the network properties of transactions,we cluster transactions that enter the network from the same source *** results show that our method can determine with 97%precision whether two transactions enter the network from the same source node.
暂无评论