python (65.1k questions)
javascript (44.2k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (12.9k questions)
preparing product purchase data for pyspark ALS implicit recommendations
I'm trying to build a product recommender. I'm using a pyspark ml recommendation ALS matrix factorization model. I have data like the example data below, where I have customer and product id and the...
user3476463
Votes: 0
Answers: 1
How to drop original columns in a spark ML transformer
When I run a spark ml transformer, we provide input and output columns. The transformed data set contains both types of columns, i.e. old columns and transformed columns
e.g.
from pyspark.ml.feature i...
Fahadakbar
Votes: 0
Answers: 0
matrix factorization model returning much smaller dataframe after predicting ratings in pyspark
I'm trying to create a product recommender with the code below. I'm using matrix factorization from spark ml. I have data that has a customer_id, product_id, and a numeric rating value that has been...
user3476463
Votes: 0
Answers: 1
Can PySpark ML models be run on only parts of a dataframe, depending on a condition?
I have trained a logistic regression algorithm to match job titles and descriptions to a set of 4 digit numeric codes. This it does very well. It will form part of a pipeline that first attempts to ma...
Alex Howard
Votes: 0
Answers: 1