We theoretically propose and experimentally demonstrate nonlocal metasurfaces rationally designed using symmetry-breaking principles to manipulate the optical wavefront with a geometric phase selective to custom ellip...
详细信息
ISBN:
(纸本)9781957171258
We theoretically propose and experimentally demonstrate nonlocal metasurfaces rationally designed using symmetry-breaking principles to manipulate the optical wavefront with a geometric phase selective to custom elliptical polarizations.
Genomics study, as opposed to socio-anthropology, has been demonstrated as an excellent tool to picture biological relatedness and disease risk factors. To analyze the data obtained from the study, Genome-wide Associa...
详细信息
Genomics study, as opposed to socio-anthropology, has been demonstrated as an excellent tool to picture biological relatedness and disease risk factors. To analyze the data obtained from the study, Genome-wide Association Study (GWAS) has been more than decades known as the mainstay approach., is the most popular approach in analysing genomics data. The confounding variables selection, being that ancestry estimation or population stratification, is substantial to maintain the quality of GWAS. Researchers have developed various methods in extracting the population stratification information from high dimensional genomics data, especially Single Nucleotide Polymorphisms (SNPs) data. In the present study, we proposed an implementation of Principal Component Analysis (PCA)-complemented Gaussian Mixture Model (GMM) as an unsupervised model to estimate population stratification from samples. The results derived from this approach was further compared to that resulted from K-means and from the commonly used ancestry estimation software, fast STRUCTURE. We figured out that our recent improved approach outperformed the two later mentioned as shown by the average cluster and population scores. Furthermore, it was able to generate the probability distribution of each sample across all population, despite its limited quality. These intriguing results worth further investigations with much more comprehensive population coverage and more advanced algorithm.
We present CELL-E 2, a novel bidirectional transformer that can generate images depicting protein subcellular localization from the amino acid sequences (and vice versa). Protein localization is a challenging problem ...
We present CELL-E 2, a novel bidirectional transformer that can generate images depicting protein subcellular localization from the amino acid sequences (and vice versa). Protein localization is a challenging problem that requires integrating sequence and image information, which most existing methods ignore. CELL-E 2 extends the work of CELL-E, not only capturing the spatial complexity of protein localization and produce probability estimates of localization atop a nucleus image, but also being able to generate sequences from images, enabling de novo protein design. We train and finetune CELL-E 2 on two large-scale datasets of human proteins. We also demonstrate how to use CELL-E 2 to create hundreds of novel nuclear localization signals (NLS). Results and interactive demos are featured at https://***/CELL-E_2/.
Interactions that occur on Twitter social media are easier to do and can easily reach all levels of society, as well as conversations tweeted by users. They can easily spread information or issues that are developing....
Interactions that occur on Twitter social media are easier to do and can easily reach all levels of society, as well as conversations tweeted by users. They can easily spread information or issues that are developing. Based on this, the use of conversational data on the Twitter platform can get an overview of issues that are developing and even those that are just being formed from community conversations on this Twitter platform. Still related to previous research, which researched to implement the Twitter social media monitoring system to provide access to additional information for journalists. This study aligns with its predecessors but comes with a richer data collection mechanism, so it can gather more conversations. The difficulty to be answered in this study is the difficulty of journalist determining a bunch of location of an incident on a Twitter conversation, which describes the name of a place or contains the address of a location in it. Also in this study, there are additional modifications to extract location names using the NER model, which has been customized according to the structure of naming location names in Indonesia. So from the text of the Twitter conversation, whether it's an address name or a description of a location name, it can get a location name that refers to the name of a place where it could be the location where an event occurred. The results of an Indonesian language location NER model, with an accuracy score of 97.67%, with a precision value of 90.57%, recall 59.30% and f1-score 71.67%.
Artificial Neural Network (ANN) is a machine learning algorithm that can perform classification. ANN has limitations; namely, it has a black box working principle, which is unsure which feature is the most influential...
详细信息
ISBN:
(纸本)9781665499705
Artificial Neural Network (ANN) is a machine learning algorithm that can perform classification. ANN has limitations; namely, it has a black box working principle, which is unsure which feature is the most influential. This study is to identify the most influential features inside the ANN's black box using a classification model by applying Principal Component Analysis (PCA) dimension reduction combined with Pearson correlation analysis. The result of the proposed model can identify the name of the main features of the data inside the ANN's black box. This study uses two public Kaggle cardiovascular datasets. The first dataset consists of 13 features, and the second dataset consists of 12 features. The result is height and gender are the most influential features in the first dataset with the correlation value of 0.734; sex and smoking are the most influential features in the second dataset with the correlation value of 0.728. Black box model result with 2 PCA's features against a model with height and gender features in the first dataset resulting from the same accuracy on the test dataset of the classification prediction results with the value of 49.90%, while on the second dataset 58.30%.
Drying has been an eco-friendly and cost-efficient method to reduce post-harvest losses of agricultural crops. In various countries, the technique has been widely utilized in the form of Solar Dryer Dome (SDD) buildin...
详细信息
Tuna fishing in Indonesia is excessive and if left unchecked, some types of tuna will become extinct within 3-10 years. There is an urgent need to help MMAF for producing more accurate fish catches data. This paper pr...
详细信息
Topology can extract the structural information in a dataset efficiently. In this paper, we attempt to incorporate topological information into a multiple output Gaussian process model for transfer learning purposes. ...
详细信息
An e-Commerce company has been using an Enterprise Resource Planning (ERP) system for several years, but is still constrained in its implementation, this is reflected in the number of issue/change request tickets subm...
详细信息
It is necessary to study technical factors such as bait used, oceanographic conditions of fishing areas and skipjack tuna trade patterns (Katsuwonus pelamis) as well as other factors in Sulawesi Fisheries. Supporting ...
详细信息
暂无评论