Skip to content

INFORMSJoC/2022.0333

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

INFORMS Journal on Computing Logo

This archive is distributed in association with the INFORMS Journal on Computing under the MIT License.

The software and data in this repository are a snapshot of the software and data that were used in the research reported on in the paper Product Redesign and Innovation Based on Online Reviews: A Multistage Combined Search Method by J. D. Qin, P. Zheng, and X. J. Wang.

Cite

To cite the contents of this repository, please cite both the paper and this repo, using their respective DOIs.

https://doi.org/10.1287/ijoc.2022.0333

https://doi.org/10.1287/ijoc.2022.0333.cd

Below is the BibTex for citing this version of the code.

@article{Qin2022Product,
  author =        {J. D. Qin, P. Zheng, and X. J. Wang},
  publisher =     {INFORMS Journal on Computing},
  title =         {Product Redesign and Innovation Based on Online Reviews: A Multistage Combined Search Method},
  year =          {2023},
  doi =           {10.1287/ijoc.2022.0333.cd},
  note =          {https://github.com/INFORMSJoC/2022.0333},
}  

Abstract

Online reviews published on the e-commerce platform provide a new source of information for designers to develop new products. Past research on new product development (NPD) using user-generated textual data commonly focused solely on extracting and identifying product features to be improved. However, the competitive analysis of product features and more specific improvement strategies have not been explored deeply. This study {fully uses} the rich semantic attributes of online review texts and proposes a novel online review--driven modeling framework. This new approach can extract fine-grained product features; calculate their importance, performance, and competitiveness; and build a competitiveness network for each feature. As a result, decision-making is assisted, and specific product improvement strategies are developed for NPD beyond existing modeling approaches in this domain. Specifically, online reviews are first classified into redesign- and innovation-related themes using a multiple embedding model, and the redesign and innovation product features can be extracted accordingly using a mutual information multilevel feature extraction method. Moreover, the importance and performance of features are calculated, and the competitiveness and competitiveness network of features are obtained through a personalized unidirectional bipartite graph algorithm. Finally, the importance—performance—competitiveness analysis plot is constructed, and the product improvement strategy is developed via a multistage combined search algorithm. Case studies and comparative experiments show the effectiveness of the proposed method and provide novel business insights for stakeholders, such as product providers, managers, and designers.

Description

Figure below shows the framework of the methodology, which is composed of three phases as follows:

  • Phase 1. Extract product features related to product redesign and innovation.

  • Phase 2. Calculate each feature's importance, performance, and competitiveness, and build a competitiveness network.

  • Phase 3. Construct the IPCA plot and develop a product improvement strategy.

In the first phase, online reviews are classified as redesign, innovation, or noise. Specifically, using part-of-speech (POS) tagging, dependency relations analysis, mutual information (MI) searching, and affinity propagation (AP) clustering, redesign and innovation features are extracted from redesign and innovation reviews, respectively. In the second phase, we can calculate the importance and performance of each feature using the SHAP method and sentiment analysis, and we determine each feature's competitiveness and competitiveness network using the proposed PUBG algorithm. In the final phase, based on the obtained importance, performance, and competitiveness, the IPCA plot can be constructed. Then, using the importance, performance, competitiveness network, and proposed MSCS algorithm, we can develop the product improvement strategy.

image.png

Environment Requirements

To run the code, you will need to make sure that you have the following dependencies installed:

Python 3.6 numpy tensorflow scikit-learn stanfordcorenlp networkx pandas gensim keras-xlnet

Replicating

The src folder contains all the code which implements the framework of this paper. Its structure is used when the provided software reads and writes files and references modules.

  • Categorize online reviews as redesign, innovation, or noise: ./src/Review Category/
  • Extract product features: ./src/Review Feature Extraction/
  • Clustering of extracted features: ./src/Review Feature Clustering/
  • Calculating the importance, performance and competitiveness of product features: ./src/Calculate Feature Importance, Performance and Competitiveness/
  • Develop specific product improvement strategies using the MSCS algorithm: ./src/MSCS Algorithm/

Results

Figure 4 details how the IPCA plot generates insights into product improvement strategies.

Figure 6 shows the IPCA plot of ten redesign and innovation features.

Figure 10 shows the effect comparison of the different algorithms.

You can access more results from the original paper.

Appendix

The file Online Supplement.pdf is the online supplementary material for the paper "Product Redesign and Innovation Based on Online Reviews: A Multistage Combined Search Method".

Acknowledgement

The authors thank the area editor and the anonymous reviewers for their constructive comments.

This work was supported by the National Natural Science Foundation of China (NSFC) under Project 72071151, and the Natural Science Foundation of Hubei Province, China (2023CFB712).