Forward Filling using ETL/ Without Python tile
Hello, I am trying to work out a logic to forward fill values.
Below is an example of what I want to achieve:
I want to forward fill the Calculation column values and the grouping will be based on platform and code.
So, I want to populate last non null value instead of zero in the calculation column. So, the output then becomes.
The reason for doing this is when I take absolute average of the calculation column for let's say A platform and ABC code, I will get the correct answer. Currently what it does is from image 1 it sums calculation and divides it by number of rows irrespective if it is zero or not. I am aware that we can skip zero values while averaging but I don't want to remove those records instead fill it with last non-null values.
Thanks much in advance!
Best Answers
-
I think this would work conceptually:
In ETL, filter out all the zeroes, so you only have the rows with the calculation values. Then, use a window function to lead the next date with a value, so that each row has the original date, the calc value, and the next date with a value. Join that back to the original input with the join being something like date > calc_value_date and date < next_value_date. You can then use a formula tile to replace the 0s with the value you just brought in on the join.
0 -
I've outlined an alternative method here:
You'd need to use 0 instead of NULL when calculating your column field.
**Was this post helpful? Click Agree or Like below**
**Did this solve your problem? Accept it as a solution!**0
Answers
-
You'll need an ETL, since you need to generate additional rows.
1 -
Hello @trafalger , thank you for replying. I am not getting an idea as to how we can fill the zeros with last non null values based on platform and code grouping. I am not sure how it will keep looping
0 -
I think this would work conceptually:
In ETL, filter out all the zeroes, so you only have the rows with the calculation values. Then, use a window function to lead the next date with a value, so that each row has the original date, the calc value, and the next date with a value. Join that back to the original input with the join being something like date > calc_value_date and date < next_value_date. You can then use a formula tile to replace the 0s with the value you just brought in on the join.
0 -
I've outlined an alternative method here:
You'd need to use 0 instead of NULL when calculating your column field.
**Was this post helpful? Click Agree or Like below**
**Did this solve your problem? Accept it as a solution!**0
Categories
- All Categories
- 1.8K Product Ideas
- 1.8K Ideas Exchange
- 1.5K Connect
- 1.2K Connectors
- 300 Workbench
- 6 Cloud Amplifier
- 8 Federated
- 2.9K Transform
- 100 SQL DataFlows
- 616 Datasets
- 2.2K Magic ETL
- 3.9K Visualize
- 2.5K Charting
- 738 Beast Mode
- 57 App Studio
- 40 Variables
- 685 Automate
- 176 Apps
- 452 APIs & Domo Developer
- 47 Workflows
- 10 DomoAI
- 36 Predict
- 15 Jupyter Workspaces
- 21 R & Python Tiles
- 394 Distribute
- 113 Domo Everywhere
- 275 Scheduled Reports
- 6 Software Integrations
- 124 Manage
- 121 Governance & Security
- 8 Domo Community Gallery
- 38 Product Releases
- 10 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 108 Community Announcements
- 4.8K Archive