Remove duplicates from large dataset
Comments
-
A few options you might try:
1. Depending on the end view you're after on your cards, you could leverage a distinct operation in a calculated field:
- count(distinct `fieldName`)
2. Leverage either the R or Python plugins to pull down the data, run a de-duplication function, and then push the data back into Domo:
- R: unique(yourDataFrame)
Python: drop_duplicates(yourDataFrame)
Stack Exchange Reference:
0 -
That's great, thanks. I'll give those a try.
0
Categories
- All Categories
- 1.8K Product Ideas
- 1.8K Ideas Exchange
- 1.6K Connect
- 1.2K Connectors
- 302 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 2.9K Transform
- 103 SQL DataFlows
- 633 Datasets
- 2.2K Magic ETL
- 3.9K Visualize
- 2.5K Charting
- 760 Beast Mode
- 61 App Studio
- 41 Variables
- 699 Automate
- 181 Apps
- 457 APIs & Domo Developer
- 51 Workflows
- 10 DomoAI
- 38 Predict
- 16 Jupyter Workspaces
- 22 R & Python Tiles
- 400 Distribute
- 116 Domo Everywhere
- 277 Scheduled Reports
- 7 Software Integrations
- 130 Manage
- 127 Governance & Security
- 8 Domo Community Gallery
- 38 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 111 Community Announcements
- 4.8K Archive