Datasets

Datasets

How do I filter by most recent date?

Contributor
edited March 2024 in Datasets

We have a dataset that get multiple 'loan_id' entries recorded (shown below)

However, when I go to remove duplicates on any of the columns, I generally get the wrong loan amounts.

The correct loan amounts are always the ones with the latest 'time_stamp'

I am hoping to remove any duplicate 'loan_id' based off the latest time stamp.

How to I do this? Is this possible using an ETL?

If this helps, feel free to agree, accept or awesome it!

Tagged:

Best Answer

  • Answer ✓

    You can do this in Magic ETL with the Rank & Window and Filter tiles. In the Rank & Window tile, select the Rank function and order by time_stamp descending and partition by load_id. Add a filter tile after this and filter to where rank equals 1. This will give you just one entry per loan_id and it will be the one with the latest timestamp.

    **Check out my Domo Tips & Tricks Videos

    **Make sure to <3 any users posts that helped you.
    **Please mark as accepted the ones who solved your issue.

Answers

  • Answer ✓

    You can do this in Magic ETL with the Rank & Window and Filter tiles. In the Rank & Window tile, select the Rank function and order by time_stamp descending and partition by load_id. Add a filter tile after this and filter to where rank equals 1. This will give you just one entry per loan_id and it will be the one with the latest timestamp.

    **Check out my Domo Tips & Tricks Videos

    **Make sure to <3 any users posts that helped you.
    **Please mark as accepted the ones who solved your issue.
  • Contributor

    @MarkSnodgrass ah those trainings are coming back now. much appreciated, sir!

    If this helps, feel free to agree, accept or awesome it!

Welcome!

It looks like you're new here. Members get access to exclusive content, events, rewards, and more. Sign in or register to get started.
Sign In