This paper presents a novel Transformer-based customer interest estimation method using videos from a security camera in a real store. Expectations for the application of Artificial Intelligence (AI) technology in various industrial fields have increased. In this study, we focus on the retail industry field rather than on the manufacturing industry field where AI technology has already been widely introduced. In the retail industry, understanding customer interest in products is one of the significant issues, and Point-of-Sales (POS) data have been used for the analysis of customer data. However, the information of customers before the purchase cannot be obtained from the POS data, which was a limitation due to the characteristics of the data. To provide a solution to the problem, we propose a new customer interest estimation method using visual information obtained from a security camera. The proposed method consists of three phases: Re-identification phase, 3D posture estimation phase, and interest estimation phase. The advantage of our architecture is that the Reidentification phase enables individual identification of multiple persons, and the 3D posture estimation phase obtains highly expressive posture information. Then interest estimation phase is used to enable individual interest estimation. Finally, we achieve a high-level customer interest estimation performance using the data obtained from a real store.
KEYWORDS: Head, Artificial intelligence, Information technology, Information science, Visualization, Vector spaces, Fluctuations and noise, Data centers, Compact discs
This paper presents a novel artist recommendation method based on knowledge graph and reinforcement learning. In the field of music services, online platforms based on subscriptions are becoming the mainstream, and the recommendation technology needs to be updated accordingly. In this field, it is desirable to achieve user-centered recommendation that satisfies various user preferences, rather than the recommendation that is biased toward popular songs and artists. Our method realizes highly accurate and explainable artist recommendation by exploring the knowledge graph constructed from users’ listening histories and artist metadata. We have confirmed the effectiveness of our method by comparing it with an existing state-of-the-art method.
This paper proposes a customer interest estimation method using security camera to meet the demand of the retail industry. In the field of retail industry, it is considered that the understanding of customers’ interests in the real store can be used for various marketing activities such as the product development and the layout of the store. Then, it is important to pay attention to customers’ behavior in the real store. Their behavior is often recorded by the cameras installed in the store for security purposes. A method for estimating their interests from the videos of the security camera is presented in this paper. The novelty of our method is three-fold. Firstly, the experimental data of subjects in our group were taken by using the security camera already installed in the real store. Secondly, we used a pre-trained posture estimation model and treated the results as the features to be trained by a two-layer neural network model. Finally, a professional have annotated the subjects’ interests. The effectiveness of our method was confirmed by comparing with benchmark supervised machine learning models.
This paper presents a new interior coordination image retrieval method using object-detection-based and color features. Interior coordination requires consideration of objects’ positional information and the overall atmosphere of the room simultaneously. However, similar image retrieval methods considering the coordination characteristics have not been proposed. In the proposed method, we extract different types of features from interior coordination images and realize the similar interior coordination image retrieval based on our newly derived features.
This paper presents a method for action detection based on Temporal Cycle Consistency(TCC) Learning. The proposed method realizes the action detection of flexible length segments based on a frame-level action prediction technique. We enable calculation of similarities for spatio-temporal features based on TCC to detect target actions from input videos. Finally, our method determines temporal segments by smoothing the frame-level action detection result. Experimental results show the validity of the proposed method.
In this paper, we propose a method to classify metastatic bone tumors using treatment-planning computed tomography images. The proposed method utilizes pre-trained deep convolutional neural network (DCNN) models as feature extractors and enables the metastatic bone tumor classification by using the obtained features. Performance of several state-of-the-art DCNN-based features was compared and evaluated in our experiment.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.