Fill empty columns based on values in other columns
I'm trying to figure out a way to do the following in an ETL.
Given data that looks like this, I want to propagate the email address from the first row with a specific ID to all other rows that match that ID. I need to do this in order to attribute all interactions of all sorts to a specific user's email address:
Input Example:
Desired Output:
Best Answers
-
This is best accomplished in a dataflow. Using magicETL you can use the group by tile and group by ID for the field Email and select the first non-null value option, then, with 1 email per id from the output of that tile, you can join that with you data on the ID field. Let me know if you have any questions!
If I solved your problem, please select "yes" above
1 -
No because you will break out the group by into a separate line in the dataflow. So from you input dataset there will be two lines:
1. All rows of data2. 1 unique row for each ID.
Then you will left join the unique rows for each ID with the original input that has all rows of data. No rows of data will be lost.
If I solved your problem, please select "yes" above
1
Answers
-
This is best accomplished in a dataflow. Using magicETL you can use the group by tile and group by ID for the field Email and select the first non-null value option, then, with 1 email per id from the output of that tile, you can join that with you data on the ID field. Let me know if you have any questions!
If I solved your problem, please select "yes" above
1 -
Thanks @colemenwilson I'm going to try that approach and will post back if I can't figure it out.
0 -
@colemenwilson - if I do the group by ID tile, won't I lose the additional rows for that same ID? The ID is in the same table as the individual interaction rows
0 -
No because you will break out the group by into a separate line in the dataflow. So from you input dataset there will be two lines:
1. All rows of data2. 1 unique row for each ID.
Then you will left join the unique rows for each ID with the original input that has all rows of data. No rows of data will be lost.
If I solved your problem, please select "yes" above
1 -
Got it, thanks!
0
Categories
- All Categories
- 1.8K Product Ideas
- 1.8K Ideas Exchange
- 1.5K Connect
- 1.2K Connectors
- 300 Workbench
- 6 Cloud Amplifier
- 8 Federated
- 2.9K Transform
- 100 SQL DataFlows
- 616 Datasets
- 2.2K Magic ETL
- 3.8K Visualize
- 2.5K Charting
- 738 Beast Mode
- 56 App Studio
- 40 Variables
- 684 Automate
- 176 Apps
- 452 APIs & Domo Developer
- 46 Workflows
- 10 DomoAI
- 35 Predict
- 14 Jupyter Workspaces
- 21 R & Python Tiles
- 394 Distribute
- 113 Domo Everywhere
- 275 Scheduled Reports
- 6 Software Integrations
- 123 Manage
- 120 Governance & Security
- 8 Domo Community Gallery
- 38 Product Releases
- 10 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 108 Community Announcements
- 4.8K Archive