Why is it creating duplicate rows?

Hi, my first dataset has 21,683 rows, and the second dataset has 323 rows. However, when I joined them together, the resulting dataset has 31,045 rows. I noticed that there are duplicate rows. How can I fix this? Thank you!

Tagged:

Best Answer

  • GrantSmith
    GrantSmith Coach
    Answer ✓

    it’s likely the case that one of your datasets has duplicate key columns causing your rows to get duplicated. Double check your join condition to make sure it’s joining correctly

    **Was this post helpful? Click Agree or Like below**
    **Did this solve your problem? Accept it as a solution!**

Answers

  • GrantSmith
    GrantSmith Coach
    Answer ✓

    it’s likely the case that one of your datasets has duplicate key columns causing your rows to get duplicated. Double check your join condition to make sure it’s joining correctly

    **Was this post helpful? Click Agree or Like below**
    **Did this solve your problem? Accept it as a solution!**
  • @GrantSmith Thank you! I saw that I have a duplicate key column.