Remove duplicates from large dataset
Comments
-
A few options you might try:
1. Depending on the end view you're after on your cards, you could leverage a distinct operation in a calculated field:
count(distinct `fieldName`)
2. Leverage either the R or Python plugins to pull down the data, run a de-duplication function, and then push the data back into Domo:
R: unique(yourDataFrame)
Python: drop_duplicates(yourDataFrame)Stack Exchange Reference:
0 -
That's great, thanks. I'll give those a try.
0
Categories
- All Categories
- 1.9K Product Ideas
- 1.9K Ideas Exchange
- 1.6K Connect
- 1.3K Connectors
- 306 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 3K Transform
- 112 SQL DataFlows
- 649 Datasets
- 2.2K Magic ETL
- 4K Visualize
- 2.5K Charting
- 787 Beast Mode
- 78 App Studio
- 43 Variables
- 742 Automate
- 187 Apps
- 474 APIs & Domo Developer
- 67 Workflows
- 14 DomoAI
- 40 Predict
- 17 Jupyter Workspaces
- 23 R & Python Tiles
- 406 Distribute
- 117 Domo Everywhere
- 279 Scheduled Reports
- 10 Software Integrations
- 139 Manage
- 136 Governance & Security
- 8 Domo Community Gallery
- 44 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 113 Community Announcements
- 4.8K Archive