Research on the distribution patterns and interrelationships of sales volume of various vegetable categories and individual products
Abstract
This study conducts a comprehensive analysis of vegetable sales data. Using Python, the collected data was integrated, categorized, and checked for missing values with the "missingno" library. Anomalies were detected using box plots, revealing a few outliers, which were retained as they more accurately reflect real business phenomena. The analysis explored distribution patterns and interrelationships of sales quantities across different vegetable categories and individual items. Time series decomposition revealed significant seasonal variations in sales volumes throughout the year. Model accuracy was validated through residual analysis, and missing values were imputed using the Prophet model. Dynamic Time Warping (DTW) calculated distance matrices between categories to uncover similarities. K-means clustering analyzed sales trends and seasonal patterns of individual items, with DTW providing detailed similarity analysis within clusters. This approach identified correlated sales trends, such as the high correlation between Chinese cabbage and bell peppers, indicating that consumers may prefer to purchase these vegetables together. These findings offer valuable insights for optimizing supermarket restocking strategies.
Show Figures
Share and Cite
Article Metrics
References
- Sai H ,Ao L ,Dongyue Z , et al. Early warning of core network capacity in space-terrestrial integrated networks [J]. Journal of Systems Engineering and Electronics, 2024, 35 (04): 855-864.
- Chang Yanan, Duan Xingzhuo, Cui Jianqun, etc Opportunistic Network Routing Algorithm Integrating Unsupervised Learning Model X-Means [J/OL] Small microcomputer system, 1-13 [2022-08-26].
- Liu Yang, Zhou Haoyue, Lu Jinqi, etc Prediction of the incidence trend of influenza like cases in Jiaxing City, Zhejiang Province based on the Prophet model [J] Disease Surveillance, 2024, 39 (05): 629-633.
- Bi Zihang, Li Sumin, Zhang Longyu, etc Multi track time-series InSAR mining area 3D deformation monitoring and early warning combined with Prophet CNN model [J] Surveying and Mapping Bulletin, 2024, (05): 53-59 DOI:10.13474/j.cnki.11-2246.2024.0510.
- Zhang Shuhan, Cheng Yuehua, Jiang Bin A multivariate trend prediction method for solar cell arrays based on the STL Prophet Informer model [J] Space Control Technology and Applications, 2024, 50 (01): 35-45.
- Wei Meifang, Yang Jing, Huang Di, etc High loss line electricity theft detection method based on segmented dynamic time bending distance [J/OL] Southern Power Grid Technology, 1-9 [2022-08-26].
- Zhang Haiyan, Yan Wenjun, Zhang Limin, etc Research on Pilot Landing Skill Evaluation Based on Differential Thinking and Dynamic Time Warping (DTW) [J] Journal of Weapon Equipment Engineering, 2023, 44 (03): 124-130.
- Huang Zimeng, Yu Juan, Xiang Mingxu, etc PMU frequency anomaly detection and type recognition based on improved dynamic time bending [J] Power System Automation, 2022, 46 (24): 104-112.
- Ran Qisheng, Zhang Zhe, Han Jiexiang, etc Longitudinal protection scheme for DC distribution network lines based on improved dynamic time bending distance algorithm [J] Power Automation Equipment, 2022, 42 (12): 157-164.
- Wen Hongbo, Liu Xianwei, Jiang Youxiang Reliability analysis of K-means clustering method in setting middle school entrance examination standards [J] Chinese Exam, 2024, (08): 69-78 DOI:10.19360/j.cnki.11-3303/g4.2024.08.008.
- Shi Jiangnan, Peng Changgen, Tan Weijie K-means++clustering method supporting differential privacy protection in Spark framework [J] Information Security Research, 2024, 10 (08): 712-718.
- Liu Jiahui, Zhang Ping, Cao Jinyin, etc Exploration of Disease Cost Clustering Analysis and Fine Management Based on K-means Algorithm [J] Health Economics Research, 2024, 41 (08): 37-40+44 DOI:10.14055/j.cnki.33-1056/f.2024.08.006.
- Li Maolin, Xiao Dongsheng A precise positioning method for personnel inside buildings using the fusion of the strongest base station and K-means [J/OL] Surveying and Mapping Science, 1-12 [2022-08-26].