Hi,
I have five columns in the dataset named Product1, Product2, Product3, Product4, and Product5.
I want to find out how many products have been sold in 1 transaction. All columns have value 'Y' and 'N'.
To find out the combinations of 2 products bundle, I am using:
(case when
(`Product1` = 'Y' and `Product2` = 'Y') or
(`Product1` = 'Y' and `Product3` = 'Y') or
(`Product1` = 'Y' and `Product4` = 'Y') or
(`Product1` = 'Y' and `Product5` = 'Y') or
(`Product2` = 'Y' and `Product3` = 'Y') or
(`Product2` = 'Y' and `Product4` = 'Y') or
(`Product2` = 'Y' and `Product5` = 'Y') or
(`Product3` = 'Y' and `Product4` = 'Y') or
(`Product3` = 'Y' and `Product5` = 'Y') or
(`Product4` = 'Y' and `Product5` = 'Y')
then 1
end
)
Then I am taking the count of that. The resulting number is more than what it should be.
Any help will be appreciated.
Thanks