CN108665329B - Commodity recommendation method based on user browsing behavior - Google Patents

Commodity recommendation method based on user browsing behavior Download PDF

Info

Publication number
CN108665329B
CN108665329B CN201710200470.6A CN201710200470A CN108665329B CN 108665329 B CN108665329 B CN 108665329B CN 201710200470 A CN201710200470 A CN 201710200470A CN 108665329 B CN108665329 B CN 108665329B
Authority
CN
China
Prior art keywords
user
commodity
users
vector
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710200470.6A
Other languages
Chinese (zh)
Other versions
CN108665329A (en
Inventor
杨俊�
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jingdong Century Trading Co Ltd
Beijing Jingdong Shangke Information Technology Co Ltd
Original Assignee
Beijing Jingdong Century Trading Co Ltd
Beijing Jingdong Shangke Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jingdong Century Trading Co Ltd, Beijing Jingdong Shangke Information Technology Co Ltd filed Critical Beijing Jingdong Century Trading Co Ltd
Priority to CN201710200470.6A priority Critical patent/CN108665329B/en
Publication of CN108665329A publication Critical patent/CN108665329A/en
Application granted granted Critical
Publication of CN108665329B publication Critical patent/CN108665329B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a commodity recommendation method based on user browsing behaviors, which comprises the following steps: analyzing, by the purchase behavior predictor, merchandise browsing history data of one or more users, determining a first set of users that are more likely to make purchases over a future period of time; for each user in the determined first set of users: acquiring one or more commodities browsed by the user from commodity browsing history data of the user, performing commodity similarity calculation on the one or more commodities and target commodities, determining that the user is more likely to purchase the target commodities in the future period based on the commodity similarity calculation result, and adding the user to a second user set; and recommending the target commodity to each user in the determined second set of users.

Description

Commodity recommendation method based on user browsing behavior
Technical Field
The invention relates to a commodity recommendation method, a commodity recommendation system, electronic equipment and a readable storage medium based on user browsing behaviors.
Background
With the rise of the internet, more and more people go to shopping on the internet. The competition of the e-commerce industry is also becoming more and more intense. Various marketing approaches are also layered. In order to promote sales of commodities, marketing departments will promote the sales of commodities, and various marketing information is usually sent to users through emails, short messages, mobile phone push messages and the like. Mining potential users who are most likely to purchase goods has become an integral part of the marketing department.
Before sending marketing information to users, it is necessary to analyze users who potentially purchase the targeted merchandise being promoted. At present, three common commodity recommendation methods are generally available:
Recommendation based on crowd labels: various tags of the user, such as gender tags, income tags, consumer tags, occupation tags, etc., are analyzed for marketing to the crowd having the corresponding tag. Based on the crowd labeling method, the actual effect is not good, the identification of the labels is a complex process, and many labels are a dynamic change process, so that the accuracy of the labels is not high.
Recommendation based on commodities: finding out commodities similar to the target marketing commodity, and then extracting the user purchasing the similar commodity for marketing. The commodity-based method ignores the characteristics of the user, such as the fact that two commodities are very similar but different brands, and if the brand loyalty of the user is very high, the marketing can even cause the user to feel disliked.
Recommendation based on model training: the recommendation based on model training currently used mainly uses user evaluation and scoring of commodities as basic data. However, user ratings and ratings are affected by many subjective factors and data is not fully collected in the event that the user is not interested in the ratings or ratings and does not perform the corresponding action. Therefore, model training is performed by evaluation or scoring, the most realistic purchasing habit of the user cannot be obtained, and the crowd of potential purchasing target commodities cannot be accurately analyzed.
Disclosure of Invention
Therefore, the embodiment of the invention provides a commodity recommending method based on user browsing behaviors, which can accurately analyze the crowd purchasing potential target commodities by using a method of combining model calculation with similarity calculation, and improves the actual order conversion rate of users. Analyzing the historical data of each user by using model calculation to determine the possibility of purchasing each user in a future period, so as to obtain a user set which is more likely to perform purchasing behavior; and then, carrying out similarity calculation on the target commodity and the commodity browsed by the user who is likely to carry out purchasing behavior, and finally obtaining the user who is likely to purchase the target commodity in a future period. Thereby enabling the recommendation of the target commodity to a user who is likely to purchase the target commodity.
In order to achieve the above object, according to an aspect of the present invention, there is provided a commodity recommendation method based on browsing behavior of a user.
According to one aspect of the technical scheme, the commodity recommendation method based on the browsing behavior of the user comprises the following steps: analyzing, by the purchase behavior predictor, merchandise browsing history data of one or more users, determining a first set of users that are more likely to make purchases over a future period of time; for each user in the determined first set of users: acquiring one or more commodities browsed by the user from commodity browsing history data of the user, performing commodity similarity calculation on the one or more commodities and target commodities, determining that the user is more likely to purchase the target commodities in the future period based on the commodity similarity calculation result, and adding the user to a second user set; and recommending the target commodity to each user in the determined second set of users.
Optionally, wherein analyzing, by the purchase behavior predictor, the merchandise browsing history data of the one or more users, determining the first set of users more likely to make purchases during the future period comprises: for each of the one or more users: determining, by the purchase behavior predictor, a likelihood that the user will conduct a purchase behavior during the future period based on the merchandise browsing history data of the user; in response to determining that the likelihood is greater than a threshold, the user is added to the first set of users.
Optionally, training the purchase behavior predictor, wherein the training comprises: receiving sample commodity browsing history data of a plurality of specific users; generating training data associated with the plurality of particular users based on the received sample merchandise browsing history data; training the purchase behavior predictor using the training data.
Optionally, generating training data associated with the plurality of particular users includes: for each specific user in the plurality of specific users, extracting characteristics of sample basic historical data of the specific user based on predefined user behavior characteristics and adding labels to generate training data associated with the specific user; training data associated with each particular user is aggregated, the aggregated training data is normalized, and the training data associated with the plurality of particular users is generated.
Optionally, wherein performing the commodity similarity calculation on the one or more commodities to the target commodity includes: constructing a target commodity vector of the target commodity based on a predefined commodity attribute feature model aiming at the class of the target commodity; for each user in the first set of users: based on the predefined commodity attribute feature model, constructing a comparison commodity vector of commodities in the same category as the target commodity in commodities browsed by the user, wherein the target commodity vector and the comparison commodity vector contain the same vector dimension; and carrying out similarity calculation on the target commodity vector and the comparison commodity vector.
Optionally, wherein the training data of the user is represented in a vector.
Alternatively, the method in which the model is trained is a classification algorithm.
Optionally, wherein the similarity calculation is a cosine similarity calculation with a correction coefficient.
According to another aspect of the technical scheme of the invention, a commodity recommendation device based on user browsing behaviors is provided.
According to another aspect of the present invention, an apparatus for recommending goods based on browsing behavior of a user includes: the model calculation module is used for: analyzing, by the purchase behavior predictor, merchandise browsing history data of one or more users, determining a first set of users that are more likely to make purchases over a future period of time; for each user in the determined first set of users: acquiring one or more commodities browsed by the user from commodity browsing history data of the user, performing commodity similarity calculation on the one or more commodities and target commodities, determining that the user is more likely to purchase the target commodities in the future period based on the commodity similarity calculation result, and adding the user to a second user set; and recommending the target commodity to each user in the determined second set of users.
Optionally, the model calculation module is further configured to: for each of the one or more users: determining, by the purchase behavior predictor, a likelihood that the user will conduct a purchase behavior during the future period based on the merchandise browsing history data of the user; in response to determining that the likelihood is greater than a threshold, the user is added to the first set of users.
Optionally, the model calculation module is further configured to train the purchase behavior predictor, wherein the training includes: receiving sample commodity browsing history data of a plurality of specific users; generating training data associated with the plurality of particular users based on the received sample merchandise browsing history data; training the purchase behavior predictor using the training data.
Optionally, wherein the model calculation module is further configured to: for each specific user in the plurality of specific users, extracting characteristics of sample basic historical data of the specific user based on predefined user behavior characteristics and adding labels to generate training data associated with the specific user; training data associated with each particular user is aggregated, the aggregated training data is normalized, and the training data associated with the plurality of particular users is generated.
Optionally, the similarity calculation module is further configured to: constructing a target commodity vector of the target commodity based on a predefined commodity attribute feature model aiming at the class of the target commodity; for each user in the first set of users: based on the predefined commodity attribute feature model, constructing a comparison commodity vector of commodities in the same category as the target commodity in commodities browsed by the user, wherein the target commodity vector and the comparison commodity vector contain the same vector dimension; and carrying out similarity calculation on the target commodity vector and the comparison commodity vector.
Optionally, wherein the training data of the user is represented in a vector.
Alternatively, the method in which the model is trained is a classification algorithm.
Optionally, wherein the similarity calculation is a cosine similarity calculation with a correction coefficient.
To achieve the above object, according to still another aspect of the present invention, there is provided an electronic apparatus.
An electronic apparatus of the present invention includes: one or more processors; and the storage device is used for storing one or more programs, and when the one or more programs are executed by the one or more processors, the one or more processors realize the commodity recommendation method based on the browsing behaviors of the user.
To achieve the above object, according to still another aspect of the present invention, there is provided a computer-readable storage medium.
A computer readable storage medium of the present invention has stored thereon a computer program which, when executed by a processor, implements the method for commodity recommendation based on user browsing behavior provided by the present invention.
One embodiment of the above invention has the following advantages or benefits: because the technical means of combining model calculation with similarity calculation is adopted, the problem that a target user cannot be accurately positioned in the prior art is solved, and the technical effect of accurately analyzing potential crowds buying target commodities is achieved.
Further effects of the above-described non-conventional alternatives are described below in connection with the embodiments.
Drawings
The drawings are included to provide a better understanding of the invention and are not to be construed as unduly limiting the invention. Wherein:
FIG. 1 is a general flow chart of a merchandise recommendation method based on user browsing behavior according to an embodiment of the invention;
FIG. 2 is a flow chart of feature processing to generate a sample of a user according to an embodiment of the present invention;
FIG. 3 is a flow chart of model training to generate model files according to an embodiment of the present invention;
FIG. 4 is a flow chart of similarity calculation according to an embodiment of the present invention;
FIG. 5 is a system block diagram of a merchandise recommendation system based on user browsing behavior in accordance with an embodiment of the invention;
fig. 6 is a schematic diagram of a hardware structure of an electronic device for implementing a commodity recommendation method based on user browsing behavior according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present invention will now be described with reference to the accompanying drawings, in which various details of the embodiments of the present invention are included to facilitate understanding, and are to be considered merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
The invention uses model training to screen the user for the first time based on the sample of the user after the characteristic processing, predicts the user who is likely to purchase; and then, the user is screened for the second time by comparing the target commodity with the historical commodity browsed by the user who is likely to purchase, so as to predict the user who is likely to purchase the target commodity.
Sample of user: and performing feature processing and labeling on the user browsing history data to generate a sample of the user. Samples of multiple users may be used as training data for model training. In an example embodiment, user Amy browses 2 items at day 3 months 1 and adds one of the items to the shopping cart, browses 4 items at day 3 months 2, and purchases 2 items at day 3 months 4. The collection of these actions constitutes the underlying historical data of the user Amy. Processing the basic history data of Amy through a feature processing step, and finally forming a sample with features and labels. In one embodiment, the tag indicates whether the user purchased the merchandise. For example, 0 indicates that no merchandise has been purchased and 1 indicates that merchandise has been purchased. In alternative embodiments, the tag may represent other information, such as 0 indicating that no more than a predetermined number of items were purchased, and 1 indicating that more than a predetermined number of items were purchased.
The characteristics are as follows: we mine the features of various user behaviors through the history of the user's browsing. The features are defined according to a predetermined feature definition rule. In one embodiment, the characteristic values are represented by numbers.
And (3) model: models are generated by model training of thousands of samples. For example, a model is calculated by training samples of thousands of users. In one embodiment, the model may be a function. In an alternative embodiment, the model may be an algorithm. A model may be applied to each user's sample to predict the user's behavior in future time periods.
Model file: the model file holds the parameters of the model. And (3) carrying out mathematical operation on the user sample by introducing parameters in the model file into the model, and finally outputting a probability value between 0 and 1.
Fig. 1 is a general flow chart of a commodity recommendation method based on user browsing behavior according to an embodiment of the present invention. As shown in fig. 1, fig. 1 includes steps S11-S15.
In step S11 of fig. 1, a browsing history of each of a plurality of users is subjected to feature processing, and samples of the plurality of users are generated as training data. See fig. 2 for detailed steps.
Turning to fig. 2, fig. 2 is a flow chart illustrating a process of feature processing and sample generation based on historical data of a user, according to an embodiment of the present invention. In step S11-1 of FIG. 2, base history data for one or more users browsing websites is collected. In one embodiment, the underlying history data is from a business data store such as a flow table, click table, order table, reservation table, etc. stored in a storage unit of the distributed file system. Techniques such as mapreduce+hive may be employed to capture merchandise characteristics. In one embodiment, data may be processed in a user dimension, and the granularity of user browsing may be collected in a Stock Keeping Unit (SKU) number. In one embodiment, a timer is used to set the data acquisition time to achieve automatic data acquisition. For example, the night system is loaded less, and the timer can be set to automatically start data acquisition at night to avoid affecting the main business.
In step S11-2 of FIG. 2, feature extraction is performed on the base data for each user, and a user sample vector is generated. Table 1 below exemplarily lists example feature definition rules for user behavior. The feature extraction process screens dimensions and features such as those listed in table 1 according to predefined feature rules. In an example embodiment, for example, where the user Amy has focused on 4 items for the last week, but has not performed an action to join the shopping cart, the feature extraction process sets the "number of items focused" feature to 4 and the "join shopping cart" feature to 0. The desire of users to purchase goods is gradually diminished over time. Through analysis of the historical purchase records of the user, the closer the browsing time is, the greatest influence is on the commodity purchased by the user. In one embodiment, for exemplary purposes only, the feature extraction time dimension may be defined as the last 7 days, with example dimensions for feature extraction and corresponding example feature definitions shown in table 1. In other embodiments different features and extraction times may be defined.
In step S11-3 of FIG. 2, the feature is cleaned of outlier data and normalized for each dimension of the feature. Since the magnitude of each feature is not uniform, all dimensions of the user's sample vector must be normalized. In an exemplary embodiment, the number of commodities browsed by the user Amy is 5, the number of days between the last purchase and the present time is 200, and normalization processing is performed on '5' and '200' for processing by the algorithm. In one embodiment, the Min-Max Normalization normalization algorithm may be used to normalize the values to the interval Table 1 example feature definition rules
[0,1 ]. Alternatively, various other normalization algorithms such as Z-score may be employed. In an example embodiment, the sample of user Amy, bob, cindy, dora has features in two dimensions, "number of items of interest" and "number of items collected", respectively:
Table 2 example user samples
User name Concern the number of commodities Number of stored goods
Amy 2 4
Bob 8 3
Cindy 10 3
Dora 10000 2
In this example embodiment, the sample "number of commodities of interest" value of the user Dora is 10000, which is much larger than the values of other samples, and the Dora sample is removed as an outlier.
In one embodiment, taking the Min-Max Normalization normalization algorithm as an example, the feature value for each dimension is divided by the maximum value for that dimension feature. The user's sample is converted into the following form:
Table 3 example user samples feature cleaned
User name Concern the number of commodities Number of stored goods
Amy 0.2 1
Bob 0.8 0.75
Cindy 1 0.75
In step S11-4 of fig. 2, samples of the plurality of users subjected to the feature processing are output for use in the following steps. After the characterization process, the samples are labeled. In an example embodiment, the labels of the samples are defined as, for example, "positive and negative samples", and the selection criteria for the positive and negative samples are defined by parameter settings. For example, a positive sample may be defined as a user purchasing a good, and a negative sample may be defined as a user having browsing activity, but not purchasing a good. The output user sample is the sample with the feature + label. The feature-processed and labeled user sample is the basis for the subsequent operation of the present invention, which is based on the feature-processed and labeled user sample in the subsequent steps.
Returning to fig. 1, in step S12 of fig. 1, model training is performed using samples of a plurality of specific users according to predefined parameter settings, a purchase behavior predictor is generated, and a model file is output. The predefined parameters may set the start-stop time of the sample, specification of the model, sample partitioning criteria, etc. See fig. 3 for detailed steps.
Fig. 3 is a flow chart according to an embodiment of the invention. In step S12-1 of FIG. 3, the training set and the test set are partitioned. The training set is a sample set for model training, and the test set is a sample set for verifying the accuracy of the model. The 8:2 principle is generally adopted in model training. For example, 10 ten thousand samples of a particular user, 8 ten thousand as training sets and 2 ten thousand as test sets.
In step S12-2 of fig. 3, model training is performed using a specific user sample in the training set, and the model trained by the model is referred to as a purchase behavior predictor. In one embodiment, a supervised learning classification algorithm may be employed to train models (such as logistic regression, decision tree, etc. algorithms). The input data for model training is a two-dimensional matrix of user samples, where each row is a vector of user samples and each column represents a feature representing the value of the sample corresponding to the feature. For each sample, there is a model-trained target value. In embodiments employing supervised learning of the classification algorithm to train the model, there are only two target values for the user sample used for model training.
After model training on the training test set, the accuracy of the model is verified using the previously partitioned test set.
In step S12-3 of FIG. 3, the model file is output. The model file is a parameter of the purchase behavior predictor. Different model files may be generated for different scenarios. Different model files are stored in the storage unit for future selection for different predicted scenarios.
Returning to fig. 1, in step S13 of fig. 1, a sample of each user is predicted with the outputted model file to determine a first set of users likely to make purchases over a given future period of time. When the target user is predicted, a sample of the user is obtained, and then the parameters in the selected model file are utilized to carry out mathematical operation of a classification algorithm through a purchase behavior predictor, so that the purchase probability is obtained. In one embodiment, if the resulting probability is 0.8, a probability of 80% would be likely to make a purchase and a probability of 20% would not be. The purchase probability of each user is compared to a threshold, and a user having a purchase probability greater than the threshold is selected as the first set of users who are likely to make purchases.
In step S14 of fig. 1, the target commodity is compared with the commodities that each user in the first set of users who are more likely to purchase has browsed, and a second set of users who are likely to purchase the target commodity is determined. See fig. 4 for detailed steps.
Fig. 4 is a flowchart of similarity calculation according to an embodiment of the present invention. In S14-1 of fig. 4, a target commodity vector of the target commodity is constructed for the category of the target commodity based on the attribute feature model of the commodity defined in advance. The attribute features of the merchandise such as price, brand, place of origin, etc.
In S14-2 of fig. 4, for the commodities in the same category as the target commodity in the commodities once browsed by each user, a comparison commodity vector of the user is constructed based on the predefined attribute feature model of the commodity, wherein the target commodity vector and the comparison commodity vector contain the same vector dimension, and the vector dimension is the attribute feature.
In S14-3 of fig. 4, a similarity calculation is performed between the target commodity vector and the user' S reference commodity vector. In one embodiment, correction factors may be used in combination with cosine similarity to calculate.
In an embodiment employing a correction factor in combination with cosine similarity, the similarity calculation formula is as follows:
Wherein sigma is a correction coefficient, V 1 is a target commodity attribute feature vector, V 2 is a commodity attribute feature vector browsed by a user, and W i is a weight vector corresponding to each attribute. Wherein V 1 and V 2 comprise the same vector dimensions. And the goods browsed by the user are in the same category as the target goods. The vector dimensions represent the attribute features of the commodity, and the same vector dimensions represent the same commodity attributes.
V1=(v11,v12,v13........)V2=(v21,v22,v23........)W=(W1,W2,W3........)
In this embodiment of the present invention, in one embodiment,
Target commodity vector is preset to V 1 = (1,) No./No.)
The attribute feature values of the target commodity are all set to 1. Alternatively, the characteristic value of the price attribute may be obtained by dividing the actual price value by a normalization coefficient (e.g., 10000).
The calculation method of the comparison commodity vector V 2 browsed by the user is as follows:
each attribute feature value in the user's browsing commodity feature vector is calculated according to the following formula. Where price attributes are special attributes, and thus are calculated in a separate "price attribute" formula, other attributes, such as brands, are calculated in an "other attribute" formula.
Taking a computer as an example, the attribute and the weight of the attribute may be, for example:
and (3) a computer: price-1.0, screen size-0.3, hot spot-0.3, brand-0.6, body color-0.3, system-0.6, core number-0.3, memory-0.5.
1. The characteristic value calculation formula of the price attribute comprises the following steps:
a i is the price of the browsed commodity, b i is the browsing times of the commodity, and n is the number of the browsed commodities.
For example, the user browses 4 commodities, the prices are 5000, 4500, 6000, 3500, and the average prices are 1, 3, 1, respectively
(5000*1+4500*3+6000*1+3500*1)/6=4667,
Then divided by the normalization coefficient (e.g., 10000) to obtain a price characteristic value 0.4667.
2. The eigenvalue calculation formula of other attributes:
for example, when the user browses 4 items, brands A, B, C, D respectively, the total browsing times are 4 times, and brand C appears 2 times, the feature value of brand C is 2/4=0.5.
And then, calculating the similarity between each feature of the target commodity and each feature of the commodity in the same category browsed by the user respectively by adopting the cosine similarity based on the weight.
When the cosine similarity is calculated, the behavior of the user needs to be increased by using a correction coefficient from the perspective of the property of the commodity. For example, the number of times the user browses the item detail page the last week, the number of times the item detail page is clicked on the record point, and the like, and the behavior of the latest time is weighted more. The cosine similarity is improved, and the result of the cosine similarity is multiplied by a correction coefficient.
Correction coefficient calculation formula= (browsing coefficient+recording point coefficient)/2
1. The method for calculating the browsing coefficient a1 comprises the following steps:
n i is the number of browsing per day
2. Recording a point coefficient a2:
n i is the number of clicks of a record, and w i is the record weight (the top 8 record are ranked by the weight).
The calculated similarity is compared with a threshold value in S14-4 of fig. 4, and the user corresponding to the similarity greater than the threshold value is determined, generating a set of potential users who are likely to purchase the target commodity. In an example embodiment, a threshold value such as 0.8 may be set, and a user whose final similarity calculation result is greater than 0.8 is extracted as a user who is likely to purchase the target commodity.
Returning to fig. 1, in step S15 of fig. 1, the target commodity is recommended to the user in the second user set that is determined to be more likely to purchase the target commodity.
FIG. 5 is a system block diagram of a merchandise recommendation system based on user browsing behavior according to an embodiment of the invention. The system is divided into a model calculation module and a similarity calculation module. The model calculation module comprises a feature processing sub-module, a model training sub-module and a model prediction sub-module. The model calculation module carries out feature processing on the historical data of each user to obtain a sample of the user, carries out model training on the samples of a plurality of users to generate a model file, predicts the purchasing behavior of the user in the future period by using the sample of the target user and the selected model file, and determines a first user set which is likely to carry out purchasing behavior; the similarity calculation module calculates, for each user in the first set of users, a similarity between the target commodity and the historical commodity that the user has browsed, thereby determining a second set of users who are likely to purchase the target commodity.
According to an embodiment of the present invention, the present invention also provides an electronic device and a readable storage medium.
The electronic device of the present invention includes: at least one processor; and a memory communicatively coupled to the at least one processor; the memory stores instructions executable by the one processor, and the instructions are executed by the at least one processor, so that the at least one processor executes the commodity recommendation method based on the browsing behavior of the user.
The non-transitory computer readable storage medium of the present invention stores computer instructions for causing the computer to execute the method for commodity recommendation based on user browsing behavior provided by the present invention.
Fig. 6 is a schematic diagram of a hardware structure of an electronic device for implementing a method for recommending goods based on browsing behavior of a user according to an embodiment of the present invention. As shown in fig. 6, the electronic device includes: one or more processors 61 and a memory 62, one processor 61 being exemplified in fig. 6. The memory 62 is a non-transitory computer readable storage medium provided by the present invention.
The electronic device of the commodity recommendation method based on the browsing behavior of the user can further comprise: an input device 63 and an output device 64.
The processor 61, the memory 62, the input means 63 and the output means 64 may be connected by a bus or otherwise, in fig. 6 by way of example.
The memory 62 is used as a non-transitory computer readable storage medium for storing non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules (e.g., the model calculation module 51 and the similarity calculation module 52 shown in fig. 5) corresponding to the method of commodity recommendation based on user browsing behavior in the embodiment of the present invention. The processor 61 executes various functional applications of the server and data processing, namely, a method of realizing commodity recommendation based on user browsing behavior in the above-described method embodiment by running non-transitory software programs, instructions, and modules stored in the memory 62.
Memory 62 may include a storage program area that may store an operating system, at least one application program required for functionality, and a storage data area; the storage data area may store data created according to the use of the apparatus for commodity recommendation based on the user's browsing behavior, and the like. In addition, the memory 62 may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, memory 62 may optionally include memory remotely located with respect to processor 61, which may be connected via a network to a means of merchandise recommendation based on user browsing activity. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The input device 63 may receive input numeric or character information and key signal inputs related to user settings and function controls of the device that generate merchandise recommendations based on user browsing behavior. The output device 64 may include a display device such as a display screen.
The one or more modules are stored in the memory 62 and when executed by the one or more processors 61 perform the method of merchandise recommendation based on user browsing behavior in any of the method embodiments described above.
The product can execute the method provided by the embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method. Technical details not described in detail in this embodiment may be found in the methods provided in the embodiments of the present invention.
According to the technical scheme of the embodiment of the invention, potential users purchasing target commodities are accurately excavated, the target commodities are recommended to the screened potential users, and finally the purpose of improving the order conversion rate of the users is achieved.
The above embodiments do not limit the scope of the present invention. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives can occur depending upon design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the scope of the present invention.

Claims (16)

1. A commodity recommendation method based on user browsing behavior, the method comprising:
analyzing, by the purchase behavior predictor, merchandise browsing history data of one or more users, determining a first set of users that are more likely to make purchases over a future period of time;
For each user in the determined first set of users:
One or more goods browsed by the user are obtained from the goods browsing history data of the user,
And calculating the commodity similarity between the one or more commodities and the target commodity, wherein the commodity similarity calculation comprises the following steps:
Constructing a target commodity vector of the target commodity based on a predefined commodity attribute feature model aiming at the class of the target commodity;
For each user in the first set of users:
Based on the predefined commodity attribute feature model, constructing a comparison commodity vector of commodities in the same category as the target commodity in commodities browsed by the user, wherein the target commodity vector and the comparison commodity vector contain the same vector dimension;
performing similarity calculation on the target commodity vector and the comparison commodity vector;
Determining that the user is more likely to purchase a target commodity in the future period based on the result of the commodity similarity calculation, and
Adding the user to a second set of users; and
Recommending the target commodity to each user in the determined second set of users.
2. The method of claim 1, wherein analyzing, by the purchase behavior predictor, the merchandise browsing history data of the one or more users, determining the first set of users that are more likely to make purchases during the future period comprises:
for each of the one or more users:
Determining, by the purchase behavior predictor, a likelihood that the user will conduct a purchase behavior during the future period based on the merchandise browsing history data of the user;
in response to determining that the likelihood is greater than a threshold, the user is added to the first set of users.
3. The method of claim 1, further comprising training the purchase behavior predictor, wherein the training comprises:
receiving sample commodity browsing history data of a plurality of specific users;
Generating training data associated with the plurality of particular users based on the received sample merchandise browsing history data;
Training the purchase behavior predictor using the training data.
4. The method of claim 3, wherein generating training data associated with the plurality of particular users comprises:
For each specific user in the plurality of specific users, extracting characteristics of sample basic historical data of the specific user based on predefined user behavior characteristics and adding labels to generate training data associated with the specific user;
training data associated with each particular user is aggregated, the aggregated training data is normalized, and the training data associated with the plurality of particular users is generated.
5. The method of any of claims 3-4, wherein the training data associated with the particular user is represented in a vector.
6. The method of any one of claims 3-4, wherein the method of training is a classification algorithm.
7. The method of any of claims 1-4, wherein the similarity calculation is a cosine similarity calculation with a correction coefficient.
8. An apparatus for commodity recommendation based on user browsing behavior, comprising:
the model calculation module is used for:
analyzing, by the purchase behavior predictor, merchandise browsing history data of one or more users, determining a first set of users that are more likely to make purchases over a future period of time;
For each user in the determined first set of users:
One or more goods browsed by the user are obtained from the goods browsing history data of the user,
And calculating the commodity similarity between the one or more commodities and the target commodity, wherein the commodity similarity calculation comprises the following steps:
Constructing a target commodity vector of the target commodity based on a predefined commodity attribute feature model aiming at the class of the target commodity;
For each user in the first set of users:
Based on the predefined commodity attribute feature model, constructing a comparison commodity vector of commodities in the same category as the target commodity in commodities browsed by the user, wherein the target commodity vector and the comparison commodity vector contain the same vector dimension;
performing similarity calculation on the target commodity vector and the comparison commodity vector;
Determining that the user is more likely to purchase a target commodity in the future period based on the result of the commodity similarity calculation, and
Adding the user to a second set of users; and
Recommending the target commodity to each user in the determined second set of users.
9. The apparatus of claim 8, wherein the model calculation module is further to:
for each of the one or more users:
Determining, by the purchase behavior predictor, a likelihood that the user will conduct a purchase behavior during the future period based on the merchandise browsing history data of the user;
in response to determining that the likelihood is greater than a threshold, the user is added to the first set of users.
10. The apparatus of claim 8, wherein the model calculation module is further configured to train the purchase behavior predictor, wherein the training comprises:
receiving sample commodity browsing history data of a plurality of specific users;
Generating training data associated with the plurality of particular users based on the received sample merchandise browsing history data;
Training the purchase behavior predictor using the training data.
11. The apparatus of claim 10, wherein the model calculation module is further configured to:
For each specific user in the plurality of specific users, extracting characteristics of sample basic historical data of the specific user based on predefined user behavior characteristics and adding labels to generate training data associated with the specific user;
training data associated with each particular user is aggregated, the aggregated training data is normalized, and the training data associated with the plurality of particular users is generated.
12. The apparatus of any of claims 10-11, wherein the training data associated with the particular user is represented in a vector.
13. The apparatus of any one of claims 10-11, wherein the method of training is a classification algorithm.
14. The apparatus of any of claims 8-11, wherein the similarity calculation is a cosine similarity calculation with a correction coefficient.
15. An electronic device, comprising:
One or more processors;
storage means for storing one or more programs,
When executed by the one or more processors, causes the one or more processors to implement the method of any of claims 1-7.
16. A computer readable storage medium, on which a computer program is stored, characterized in that the program, when being executed by a processor, implements the method according to any of claims 1-7.
CN201710200470.6A 2017-03-29 2017-03-29 Commodity recommendation method based on user browsing behavior Active CN108665329B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710200470.6A CN108665329B (en) 2017-03-29 2017-03-29 Commodity recommendation method based on user browsing behavior

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710200470.6A CN108665329B (en) 2017-03-29 2017-03-29 Commodity recommendation method based on user browsing behavior

Publications (2)

Publication Number Publication Date
CN108665329A CN108665329A (en) 2018-10-16
CN108665329B true CN108665329B (en) 2024-09-24

Family

ID=63787066

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710200470.6A Active CN108665329B (en) 2017-03-29 2017-03-29 Commodity recommendation method based on user browsing behavior

Country Status (1)

Country Link
CN (1) CN108665329B (en)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111125503B (en) * 2018-10-31 2023-04-07 北京字节跳动网络技术有限公司 Method and apparatus for generating information
CN109242649A (en) * 2018-10-31 2019-01-18 广州品唯软件有限公司 A kind of Method of Commodity Recommendation and relevant apparatus
CN111241382A (en) * 2018-11-28 2020-06-05 北京京东尚科信息技术有限公司 Data processing method and device, storage medium and electronic equipment
CN111275511A (en) * 2018-12-05 2020-06-12 北京京东尚科信息技术有限公司 Method, device, electronic equipment and medium for identifying competitive commodities
CN109933741B (en) * 2019-02-27 2020-06-23 京东数字科技控股有限公司 Method, device and storage medium for extracting user network behavior characteristics
CN111724221B (en) * 2019-03-19 2024-07-19 北京京东尚科信息技术有限公司 Method, system, electronic device and storage medium for determining commodity matching information
CN111797858A (en) * 2019-04-09 2020-10-20 Oppo广东移动通信有限公司 Model training method, behavior prediction method, device, storage medium and equipment
CN110827112B (en) * 2019-09-16 2023-02-03 中国平安人寿保险股份有限公司 Deep learning commodity recommendation method and device, computer equipment and storage medium
CN112837106A (en) * 2019-11-22 2021-05-25 上海哔哩哔哩科技有限公司 Commodity recommendation method and device and computer equipment
CN111159555A (en) * 2019-12-30 2020-05-15 北京每日优鲜电子商务有限公司 Commodity recommendation method, commodity recommendation device, server and storage medium
CN113139767A (en) * 2020-01-20 2021-07-20 菜鸟智能物流控股有限公司 Logistics object allocation method and device, electronic equipment and computer-readable storage medium
CN111429203A (en) * 2020-03-02 2020-07-17 北京明略软件系统有限公司 Commodity recommendation method and device based on user behavior data
CN113393281A (en) * 2020-03-11 2021-09-14 北京沃东天骏信息技术有限公司 Method and device for processing request
CN113743636B (en) * 2020-05-28 2024-05-14 北京达佳互联信息技术有限公司 Target operation prediction method and device, electronic equipment and storage medium
US20210390436A1 (en) * 2020-06-11 2021-12-16 Sap Se Determining Categories For Data Objects Based On Machine Learning
CN112036981B (en) * 2020-09-02 2021-09-17 珠海必要工业科技股份有限公司 Method, device, equipment and medium for providing target comparison commodities
CN113763085B (en) * 2020-09-23 2024-07-19 京东城市(北京)数字科技有限公司 Information pushing method, system, electronic equipment and storage medium
CN112149037B (en) * 2020-09-28 2024-03-19 微梦创科网络科技(中国)有限公司 Method and system for identifying abnormal attention in real time based on logistic regression
CN113298600A (en) * 2020-09-28 2021-08-24 阿里巴巴集团控股有限公司 Method and device for browsing commodity information
CN112149003B (en) * 2020-10-28 2023-06-20 浙江集享电子商务有限公司 Commodity community recommendation method and device and computer equipment
CN113761332A (en) * 2020-11-10 2021-12-07 北京沃东天骏信息技术有限公司 Data processing method, device, equipment and computer readable storage medium
CN113763012A (en) * 2020-12-29 2021-12-07 北京京东尚科信息技术有限公司 Model training method, similarity calculation device, similarity calculation equipment and similarity calculation medium
CN112767085B (en) * 2021-01-22 2024-05-24 武汉蔚来能源有限公司 Commodity similarity analysis and commodity recommendation method and device and computer storage medium
CN113781152A (en) * 2021-02-08 2021-12-10 北京沃东天骏信息技术有限公司 Object recommendation method and device
CN113254664B (en) * 2021-05-14 2022-05-24 震坤行工业超市(上海)有限公司 Enterprise-oriented item recommendation method and device and storage medium
CN113312555B (en) * 2021-06-15 2023-10-03 北京百度网讯科技有限公司 Conversion rate prediction method, conversion rate prediction device, electronic equipment and storage medium
CN113469786A (en) * 2021-06-29 2021-10-01 深圳市点购电子商务控股股份有限公司 Method and device for recommending articles, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102156932A (en) * 2010-02-11 2011-08-17 阿里巴巴集团控股有限公司 Prediction method and device for secondary purchase intention of customers
CN103489117A (en) * 2012-06-12 2014-01-01 深圳市腾讯计算机系统有限公司 Method and system for information releasing
CN105469263A (en) * 2014-09-24 2016-04-06 阿里巴巴集团控股有限公司 Commodity recommendation method and device
CN105787770A (en) * 2016-04-27 2016-07-20 上海遥薇(集团)有限公司 Non-negative matrix factorization (NMF) algorithm-based big data commodity and service recommending method and system

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105528374A (en) * 2014-10-21 2016-04-27 苏宁云商集团股份有限公司 A commodity recommendation method in electronic commerce and a system using the same
CN104331816B (en) * 2014-10-28 2017-10-20 常州大学 Knowledge based learns big data user's purchase intention Forecasting Methodology with secret protection
CN104866540B (en) * 2015-05-04 2018-04-27 华中科技大学 A kind of personalized recommendation method based on group of subscribers behavioural analysis
CN105335875A (en) * 2015-10-30 2016-02-17 小米科技有限责任公司 Purchasing power prediction method and purchasing power prediction device
CN106127525A (en) * 2016-06-27 2016-11-16 浙江大学 A kind of TV shopping Method of Commodity Recommendation based on sorting algorithm

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102156932A (en) * 2010-02-11 2011-08-17 阿里巴巴集团控股有限公司 Prediction method and device for secondary purchase intention of customers
CN103489117A (en) * 2012-06-12 2014-01-01 深圳市腾讯计算机系统有限公司 Method and system for information releasing
CN105469263A (en) * 2014-09-24 2016-04-06 阿里巴巴集团控股有限公司 Commodity recommendation method and device
CN105787770A (en) * 2016-04-27 2016-07-20 上海遥薇(集团)有限公司 Non-negative matrix factorization (NMF) algorithm-based big data commodity and service recommending method and system

Also Published As

Publication number Publication date
CN108665329A (en) 2018-10-16

Similar Documents

Publication Publication Date Title
CN108665329B (en) Commodity recommendation method based on user browsing behavior
US11107118B2 (en) Management of the display of online ad content consistent with one or more performance objectives for a webpage and/or website
CN107729937B (en) Method and device for determining user interest tag
CN105989004B (en) Information delivery preprocessing method and device
JP6134444B2 (en) Method and system for recommending information
US20180047071A1 (en) System and methods for aggregating past and predicting future product ratings
CN111080398B (en) Commodity recommendation method, commodity recommendation device, computer equipment and storage medium
CN111095330B (en) Machine learning method and system for predicting online user interactions
CN112148973B (en) Data processing method and device for information push
CN113077317B (en) Article recommendation method, device, equipment and storage medium based on user data
CN112365283B (en) Coupon issuing method and device, terminal equipment and storage medium
US20180053199A1 (en) Auto-segmentation
CN112001754A (en) User portrait generation method, device, equipment and computer readable medium
JP6679451B2 (en) Selection device, selection method, and selection program
US10198762B1 (en) Ordering search results to maximize financial gain
CN112925973B (en) Data processing method and device
US20140156394A1 (en) Targeted Advertisement Generation For Travelers
CN110348947B (en) Object recommendation method and device
JP5914549B2 (en) Information processing apparatus and information analysis method
WO2020221022A1 (en) Service object recommendation method
US20210217053A1 (en) Methods and apparatuses for selecting advertisements using semantic matching
CN112598472A (en) Product recommendation method, device, system, medium and program product
CN110689402A (en) Method and device for recommending merchants, electronic equipment and readable storage medium
JP2017027359A (en) Device, method, and program for processing information
CN111680213B (en) Information recommendation method, data processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant