Magic ETL - Using Group by - run time impact question
Wondering what is the best practice to reduce overall run time of a ETL dataflow...
I've got 5 input datasets totalling about 14M rows of data that are ultimately appended together in the final result.
Is it better to use group by in the magic ETL for each dataset (5 steps) or one consolidated step after the appends?
Also, are there certain items in Magic ETL that are run time bandits?
Comments
-
Is anyone able to help out with this request?
0 -
Hello @swagner,
It is hard to determine what might be causing long run times in an ETL without looking at the ETL in specific.
Generally, it is most efficient to use a select columns tile and select only the columns you need.
Next would be to filter your data down to only the rows you need.
After filtering your data to only the columns and rows that you need grouping your data will help reduce the size.
In regards to your group by question. Normally it will not make a difference between doing the group by's before the append vs after. If you can provide a screenshot of your ETL I can look to see if there are any other steps that we might be able to optimize.**Say “Thanks" by clicking the thumbs up in the post that helped you.
**Please mark the post that solves your problem by clicking on "Accept as Solution"1
Categories
- All Categories
- 1.9K Product Ideas
- 1.9K Ideas Exchange
- 1.6K Connect
- 1.3K Connectors
- 305 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 3K Transform
- 107 SQL DataFlows
- 648 Datasets
- 2.2K Magic ETL
- 4K Visualize
- 2.5K Charting
- 775 Beast Mode
- 75 App Studio
- 43 Variables
- 734 Automate
- 186 Apps
- 471 APIs & Domo Developer
- 63 Workflows
- 14 DomoAI
- 40 Predict
- 17 Jupyter Workspaces
- 23 R & Python Tiles
- 403 Distribute
- 117 Domo Everywhere
- 277 Scheduled Reports
- 9 Software Integrations
- 137 Manage
- 134 Governance & Security
- 8 Domo Community Gallery
- 44 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 113 Community Announcements
- 4.8K Archive