Remove duplicates from large dataset
Comments
-
A few options you might try:
1. Depending on the end view you're after on your cards, you could leverage a distinct operation in a calculated field:
count(distinct `fieldName`)
2. Leverage either the R or Python plugins to pull down the data, run a de-duplication function, and then push the data back into Domo:
R: unique(yourDataFrame)
Python: drop_duplicates(yourDataFrame)Stack Exchange Reference:
0 -
That's great, thanks. I'll give those a try.
0
Categories
- 10.5K All Categories
- 7 Connect
- 917 Connectors
- 250 Workbench
- 466 Transform
- 1.7K Magic ETL
- 69 SQL DataFlows
- 477 Datasets
- 193 Visualize
- 252 Beast Mode
- 2.1K Charting
- 11 Variables
- 17 Automate
- 354 APIs & Domo Developer
- 89 Apps
- 3 Workflows
- 20 Predict
- 5 Jupyter Workspaces
- 15 R & Python Tiles
- 246 Distribute
- 62 Domo Everywhere
- 243 Scheduled Reports
- 21 Manage
- 42 Governance & Security
- 173 Product Ideas
- 1.2K Ideas Exchange
- 12 Community Forums
- 27 Getting Started
- 14 Community Member Introductions
- 55 Community News
- 4.5K Archive