Remove Duplicates for Append Before it Gets to Domo


I have some data i want to bring in but unfortunately the source data is a "Keep most recent X records" not a "Keep the last X days records".
One option I know would be to just append it daily and then have a dataflow that removes the duplicates and it would give me what I'm looking for, however this would mean my original Domo dataset would become bloated (~365000 rows/yr). How might I remove duplicates before it gets to Domo?
**Make sure to like any users posts that helped you and accept the ones who solved your issue.**
Best Answer
-
Sounds to me like you'd have to do this programatically using Domo's developer APIs. You could pull the data from Domo, add and deduplicate source system data, and reupload the cleaned data back to Domo. Repeated on whatever cadence you need.
Aside from that, using a dataflow is actually a good idea, and 365k annual rows isn't a huge deal in my book. We have datasets in the tens of millions and there are customers with hundreds of millions or billions in their Domo instances. That seems like a secondary concern to me.
Aaron
MajorDomo @ Merit Medical
**Say "Thanks" by clicking the heart in the post that helped you.
**Please mark the post that solves your problem by clicking on "Accept as Solution"1
Answers
-
Sounds to me like you'd have to do this programatically using Domo's developer APIs. You could pull the data from Domo, add and deduplicate source system data, and reupload the cleaned data back to Domo. Repeated on whatever cadence you need.
Aside from that, using a dataflow is actually a good idea, and 365k annual rows isn't a huge deal in my book. We have datasets in the tens of millions and there are customers with hundreds of millions or billions in their Domo instances. That seems like a secondary concern to me.
Aaron
MajorDomo @ Merit Medical
**Say "Thanks" by clicking the heart in the post that helped you.
**Please mark the post that solves your problem by clicking on "Accept as Solution"1
Categories
- All Categories
- 2K Product Ideas
- 2K Ideas Exchange
- 1.6K Connect
- 1.3K Connectors
- 311 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 3.8K Transform
- 655 Datasets
- 115 SQL DataFlows
- 2.2K Magic ETL
- 811 Beast Mode
- 3.3K Visualize
- 2.5K Charting
- 80 App Studio
- 45 Variables
- 771 Automate
- 190 Apps
- 481 APIs & Domo Developer
- 77 Workflows
- 23 Code Engine
- 36 AI and Machine Learning
- 19 AI Chat
- AI Playground
- AI Projects and Models
- 17 Jupyter Workspaces
- 410 Distribute
- 120 Domo Everywhere
- 280 Scheduled Reports
- 10 Software Integrations
- 142 Manage
- 138 Governance & Security
- 8 Domo Community Gallery
- 48 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 41 Getting Started
- 31 Community Member Introductions
- 113 Community Announcements
- 4.8K Archive