Remove duplicates from large dataset
Comments
-
A few options you might try:
1. Depending on the end view you're after on your cards, you could leverage a distinct operation in a calculated field:
count(distinct `fieldName`)
2. Leverage either the R or Python plugins to pull down the data, run a de-duplication function, and then push the data back into Domo:
R: unique(yourDataFrame)
Python: drop_duplicates(yourDataFrame)Stack Exchange Reference:
0 -
That's great, thanks. I'll give those a try.
0
Categories
- All Categories
- 1.7K Product Ideas
- 1.7K Ideas Exchange
- 1.5K Connect
- 1.2K Connectors
- 292 Workbench
- 4 Cloud Amplifier
- 8 Federated
- 2.8K Transform
- 95 SQL DataFlows
- 602 Datasets
- 2.1K Magic ETL
- 3.7K Visualize
- 2.4K Charting
- 694 Beast Mode
- 43 App Studio
- 39 Variables
- 658 Automate
- 170 Apps
- 441 APIs & Domo Developer
- 42 Workflows
- 5 DomoAI
- 32 Predict
- 12 Jupyter Workspaces
- 20 R & Python Tiles
- 386 Distribute
- 111 Domo Everywhere
- 269 Scheduled Reports
- 6 Software Integrations
- 113 Manage
- 110 Governance & Security
- 8 Domo University
- 30 Product Releases
- Community Forums
- 39 Getting Started
- 29 Community Member Introductions
- 98 Community Announcements
- Domo Community Gallery
- 4.8K Archive