How can I mark a duplicate row instead of deleting it in magic ETL?
I use domo to handle metadata transformations for music tracks, and for publishing purposes it's important that I don't have any duplicate track titles. It can mess with the royalty payouts. So basically I want to mark a row as a duplicate based on specific values in columns/rows instead of deleting it. Ideally I'd love to make a new row titled "Has duplicates?" and if it does, assign the value to "YES" and if not, "NO." If it has duplicates, then I will know and I can go ahead and change the title before I submit it for publishing. So I want pretty much the exact same functionality as the delete rows tiles, where I can choose certain columns and it checks if there are duplicates, but instead of deleting duplicates after the first row, I just want it marked.
Best Answer
-
You could use the group by tile. Choose what fields make a row unique and include them in the group by fields. Then do a count as the group by aggregation. Then join that back to your data. Anytime there is a value of 2 or more, you'll know you have duplicates. You could then use a formula with a case statement to add a flag of "YES" or "NO" for duplicates if the value of 2+ isn't enough.
If I solved your problem, please select "yes" above
0
Answers
-
You could use the group by tile. Choose what fields make a row unique and include them in the group by fields. Then do a count as the group by aggregation. Then join that back to your data. Anytime there is a value of 2 or more, you'll know you have duplicates. You could then use a formula with a case statement to add a flag of "YES" or "NO" for duplicates if the value of 2+ isn't enough.
If I solved your problem, please select "yes" above
0 -
Thank you so so much! This worked like a charm!
1
Categories
- All Categories
- 1.8K Product Ideas
- 1.8K Ideas Exchange
- 1.6K Connect
- 1.2K Connectors
- 302 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 2.9K Transform
- 104 SQL DataFlows
- 633 Datasets
- 2.2K Magic ETL
- 3.9K Visualize
- 2.5K Charting
- 760 Beast Mode
- 62 App Studio
- 42 Variables
- 699 Automate
- 181 Apps
- 457 APIs & Domo Developer
- 51 Workflows
- 10 DomoAI
- 38 Predict
- 16 Jupyter Workspaces
- 22 R & Python Tiles
- 401 Distribute
- 116 Domo Everywhere
- 277 Scheduled Reports
- 8 Software Integrations
- 130 Manage
- 127 Governance & Security
- 8 Domo Community Gallery
- 38 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 111 Community Announcements
- 4.8K Archive