Keeping only the latest date for unique ID
Hi Guys,
I have a question related to ETL transformation. I have a dataset including unique ID's and dates assigned to it. In many cases, there is multiple dates associated with the same unique ID. In my case I want to keep only the latest date associated with each ID. For example I have ID 2 with a date 10.18.2023. If my dataset updates in 2 days and I get the a new entry with ID 2 and date 10.20.2023, I want to have the entry with 10.18.2023 removed and only keep 10.20.2023 for the ID 2. Is it just remove duplicates? I'm not sure how the data would refresh when a new data entry appears in the dataset.
Thoughts?
Thanks
Best Answers
-
You would use a rank & window tile in Magic ETL. You would rank your data on date in descending order partitioned by the ID field. You would then use a filter tile to only keep rows where rank = 1.
If you get stuck let me know!
If I solved your problem, please select "yes" above
3 -
Another option is to use the Group By tile and choose Max for the aggregation type on the date field.
**Check out my Domo Tips & Tricks Videos
**Make sure to any users posts that helped you.
**Please mark as accepted the ones who solved your issue.3
Answers
-
You would use a rank & window tile in Magic ETL. You would rank your data on date in descending order partitioned by the ID field. You would then use a filter tile to only keep rows where rank = 1.
If you get stuck let me know!
If I solved your problem, please select "yes" above
3 -
Another option is to use the Group By tile and choose Max for the aggregation type on the date field.
**Check out my Domo Tips & Tricks Videos
**Make sure to any users posts that helped you.
**Please mark as accepted the ones who solved your issue.3
Categories
- All Categories
- 1.5K Product Ideas
- 1.5K Ideas Exchange
- 1.4K Connect
- 1.1K Connectors
- 278 Workbench
- 4 Cloud Amplifier
- 4 Federated
- 2.7K Transform
- 89 SQL DataFlows
- 559 Datasets
- 2K Magic ETL
- 3.3K Visualize
- 2.3K Charting
- 575 Beast Mode
- 12 App Studio
- 28 Variables
- 582 Automate
- 141 Apps
- 414 APIs & Domo Developer
- 26 Workflows
- 1 DomoAI
- 28 Predict
- 12 Jupyter Workspaces
- 16 R & Python Tiles
- 356 Distribute
- 95 Domo Everywhere
- 259 Scheduled Reports
- 2 Software Integrations
- 92 Manage
- 89 Governance & Security
- 9 Product Release Questions
- Community Forums
- 42 Getting Started
- 28 Community Member Introductions
- 89 Community Announcements
- 4.8K Archive