Removing Duplicates with Condition
Hi All. My dataset have a lot of duplicate rows so I wanted to use "Remove Duplicates" in ETL based on the Shipment ID.
However, there are some products that are hazardous. I want to have the hazardous row left remaining when I remove the duplicates if there is a mix of Haz and non-Haz. Any function or use of "Rank & Window" I can use to have my desired output?
SAMPLE
Shipment ID vs Hazardous Tag
1 - Non-Haz
1 - Haz
1 - Non-Haz
2 - Non-Haz
2 - Non-Haz
3 - Haz
3 - Haz
4 - Non-Haz
4 - Non-Haz
4 - Haz
DESIRED OUTPUT
1 - Haz
2 - Non-Haz
3 - Haz
4 - Haz
Logic: When at least one is hazardous, return 1 line as hazardous
Best Answer
-
You can take the minimum value of your Haz / Non-Haz text in your rank and window to determine if had any Hazardous.
Alternatively, you can use a case statement to return 1 if the value is hazardous and 0 if it's not and then take the max of that value in your case statement and then convert it back to a text value using another formula tile.
**Was this post helpful? Click Agree or Like below**
**Did this solve your problem? Accept it as a solution!**1
Answers
-
You can take the minimum value of your Haz / Non-Haz text in your rank and window to determine if had any Hazardous.
Alternatively, you can use a case statement to return 1 if the value is hazardous and 0 if it's not and then take the max of that value in your case statement and then convert it back to a text value using another formula tile.
**Was this post helpful? Click Agree or Like below**
**Did this solve your problem? Accept it as a solution!**1 -
@GrantSmith thank you sir! it worked! Did the ranking to put HAZ on top and when I removed duplicates, it's the one getting selected. :)
0
Categories
- All Categories
- 2K Product Ideas
- 2K Ideas Exchange
- 1.6K Connect
- 1.3K Connectors
- 308 Workbench
- 6 Cloud Amplifier
- 10 Federated
- 3.8K Transform
- 660 Datasets
- 117 SQL DataFlows
- 2.2K Magic ETL
- 815 Beast Mode
- 3.3K Visualize
- 2.5K Charting
- 84 App Studio
- 46 Variables
- 780 Automate
- 191 Apps
- 482 APIs & Domo Developer
- 84 Workflows
- 23 Code Engine
- 41 AI and Machine Learning
- 20 AI Chat
- 1 AI Playground
- 2 AI Projects and Models
- 18 Jupyter Workspaces
- 413 Distribute
- 121 Domo Everywhere
- 281 Scheduled Reports
- 11 Software Integrations
- 145 Manage
- 141 Governance & Security
- 8 Domo Community Gallery
- 49 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 41 Getting Started
- 31 Community Member Introductions
- 115 Community Announcements
- 4.8K Archive