Calculating only the distinct values for many duplicate sessions
Hello!
I'm working on a conversation dataset that has a lot of duplicate values as our API creates a duplicate session ID for each part of the conversation someone reaches when they engage with our experience. My problem is I'm having difficulty calculating that average conversation duration because each duplicate row per session ID also has a duplicate conversation duration value, so I can't simply take the average value of that column. Is there a way that I can create a beast mode calculation that only calculates the average session duration 'durationsec' when there is a unique 'session'?
I've included an extract of my dataset that includes the duplicate session IDs and their corresponding duration metrics.
Any help would be greatly appreciated :)
Thanks!
Best Answer
-
You can use a fixed function for this:
AVG(MAX(`duration`) FIXED (BY `session`))
This will take the maximum of the duration for each session identifier and then average each of those values out. As the duration should be the same across all of the same sessions this is essentially deduping your durations.
**Was this post helpful? Click Agree or Like below**
**Did this solve your problem? Accept it as a solution!**3
Answers
-
You can use a fixed function for this:
AVG(MAX(`duration`) FIXED (BY `session`))
This will take the maximum of the duration for each session identifier and then average each of those values out. As the duration should be the same across all of the same sessions this is essentially deduping your durations.
**Was this post helpful? Click Agree or Like below**
**Did this solve your problem? Accept it as a solution!**3 -
Thanks so much for the help, Grant! Your beastmode worked like a charm.
0
Categories
- All Categories
- 1.9K Product Ideas
- 1.9K Ideas Exchange
- 1.6K Connect
- 1.3K Connectors
- 303 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 2.9K Transform
- 104 SQL DataFlows
- 640 Datasets
- 2.2K Magic ETL
- 4K Visualize
- 2.5K Charting
- 767 Beast Mode
- 70 App Studio
- 43 Variables
- 715 Automate
- 185 Apps
- 460 APIs & Domo Developer
- 56 Workflows
- 14 DomoAI
- 39 Predict
- 16 Jupyter Workspaces
- 23 R & Python Tiles
- 402 Distribute
- 116 Domo Everywhere
- 277 Scheduled Reports
- 9 Software Integrations
- 134 Manage
- 131 Governance & Security
- 8 Domo Community Gallery
- 44 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 111 Community Announcements
- 4.8K Archive