Trying to filter out rows, where a certain column has NULL values
Hi,
I have a dataset of almost 2 billion rows. I need to filter out rows, where a certain column has NULL values.
What is the best and fastest way to do it?
I tried creating a DataFlow using the mySQL option, but the flow has been running for more than 24 hours now.
Could any of you please help in understanding how this process can be made to run faster?
Comments
-
With large datasets, I normally have more luck (where speed is concerned) using the ETL tool.
You could accomplish what you're looking for using 3 widgets, Input Dataset into Filter Rows into Output Dataset.
On the filter rows, just select the row you want to filter and there is an option for IS NOT NULL.
That should hopefully be all you need. If not, just let me know.
Sincerely,
Valiant
**Please mark "Accept as Solution" if this post solves your problem
**Say "Thanks" by clicking the "heart" in the post that helped you.2 -
Are you needing to filter those rows out completely, or do you want to improve your filtering ability in your visualizations? Is this a text field or numeric?
If you want to keep the rows, but need the null values filled in, you can use the mapper. This works for both text and numeric fields.
If you are simply needing to remove them efficiently, @Valiant 's solution is perfect.
DataMaven
Breaking Down Silos - Building Bridges
**Say "Thanks" by clicking a reaction in the post that helped you.
**Please mark the post that solves your problem by clicking on "Accept as Solution"1
Categories
- All Categories
- 1.7K Product Ideas
- 1.7K Ideas Exchange
- 1.5K Connect
- 1.2K Connectors
- 292 Workbench
- 4 Cloud Amplifier
- 8 Federated
- 2.8K Transform
- 95 SQL DataFlows
- 600 Datasets
- 2.1K Magic ETL
- 3.7K Visualize
- 2.4K Charting
- 688 Beast Mode
- 43 App Studio
- 38 Variables
- 658 Automate
- 170 Apps
- 441 APIs & Domo Developer
- 42 Workflows
- 5 DomoAI
- 32 Predict
- 12 Jupyter Workspaces
- 20 R & Python Tiles
- 385 Distribute
- 110 Domo Everywhere
- 269 Scheduled Reports
- 6 Software Integrations
- 112 Manage
- 109 Governance & Security
- 8 Domo University
- 25 Product Releases
- Community Forums
- 39 Getting Started
- 29 Community Member Introductions
- 98 Community Announcements
- Domo Community Gallery
- 4.8K Archive