Data Science - Outliers
Hi,
New to using the data science tiles in ETL.
I have a dataset with sales by category by day.
I can run this through the outliers tile for one category and it will correctly mark the outliers.
What is the best way to do this per category?
Thanks
Best Answer
-
If you're not wanting to have a tile per column you want to detect outlier on, an option is to use the Python tile.
You can create a function (or multiple) to run through columns you want to evaluate and generate everything in one tile. You can also customize how you want to define the outlier detection methodology. You can still implement the same standard deviation or mean absolute deviation as you can in the Outlier Detection tile, but also can also adapt it with more precision based on your particular data/context.
David Cunningham
** Was this post helpful? Click Agree 😀, Like 👍️, or Awesome ❤️ below **
** Did this solve your problem? Accept it as a solution! ✔️**0
Answers
-
If you're not wanting to have a tile per column you want to detect outlier on, an option is to use the Python tile.
You can create a function (or multiple) to run through columns you want to evaluate and generate everything in one tile. You can also customize how you want to define the outlier detection methodology. You can still implement the same standard deviation or mean absolute deviation as you can in the Outlier Detection tile, but also can also adapt it with more precision based on your particular data/context.
David Cunningham
** Was this post helpful? Click Agree 😀, Like 👍️, or Awesome ❤️ below **
** Did this solve your problem? Accept it as a solution! ✔️**0
Categories
- All Categories
- 1.5K Product Ideas
- 1.5K Ideas Exchange
- 1.4K Connect
- 1.1K Connectors
- 283 Workbench
- 4 Cloud Amplifier
- 4 Federated
- 2.7K Transform
- 90 SQL DataFlows
- 565 Datasets
- 2K Magic ETL
- 3.4K Visualize
- 2.3K Charting
- 593 Beast Mode
- 13 App Studio
- 28 Variables
- 588 Automate
- 143 Apps
- 417 APIs & Domo Developer
- 27 Workflows
- 1 DomoAI
- 28 Predict
- 12 Jupyter Workspaces
- 16 R & Python Tiles
- 361 Distribute
- 99 Domo Everywhere
- 260 Scheduled Reports
- 2 Software Integrations
- 96 Manage
- 93 Governance & Security
- 15 Product Releases
- Community Forums
- 37 Getting Started
- 28 Community Member Introductions
- 90 Community Announcements
- 4.8K Archive