Reloading Same File

I have a dataset with two columns defined as upsertKey. I can upload a dataset successfully the first time using the -a flag to append and I also use the -h flag because my file contains a header. I using the upload-dataset command to upload just one single file with the -f flag. Now, I am tryign to upload that same file in order to test that NO duplicate records will be added to my dataset, thus why I used the upsertKey columsn set to true. Here is the error I get when I try to do this. Is there a way around this? I don't want duplicates getting into my datasets, so I want records to be rejected IF they already exist in the dataset. Thanks!

The Domo command-line interface (CLI) is used to manage Domo users, cards, pages, datasets, and dataflows. The utility can upload RFC 4180-compliant CSV file(s), or a directory of RFC 4180-compliant CSV file(s), compressed or uncompressed. If you are uploading the contents of a directory, each file is uploaded as a "part". Each part (file) should be ~100MB. Large files will be split into parts automatically before uploading to Domo and will require sufficient disk space in the installed directory to stage the data. To improve upload times, this utility DOES NOT do data validation. Use 'help' to see available commands. Use 'help [command name]' to see how to use a specific command. You are currently running the version: 3.30.18 Newer Version Available: 3.31.24 Download Here: [1mhttps://app.domo.com/labs/java-sdk/latest/domoUtil.jar[0m Research Technology (userid: 1239360699) Executing POST: https://nbcuni.domo.com/api/data/v3/datasources/5d3ea633-85fa-404f-b9df-695fa3b54075/uploads 2022-10-19 20:05:46.523 - Starting upload for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075 Started file part 1 upload for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075 Starting upload for part 1. Attempt 1. Data upload warning for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075: /api/data/v3/datasources/5d3ea633-85fa-404f-b9df-695fa3b54075/uploads/11/parts/1 Finished file part 1 upload for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075 2022-10-19 20:05:46.742 - Finished upload for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075 Total bytes sent 186 Executing PUT: https://nbcuni.domo.com/api/data/v3/datasources/5d3ea633-85fa-404f-b9df-695fa3b54075/uploads/11/commit Executing POST: https://nbcuni.domo.com/api/data/v3/datasources/5d3ea633-85fa-404f-b9df-695fa3b54075/indexes

{ "requested": true, "requestedOn": 1666209947052, "requestKey": "20221019200547.059" }

2022-10-19 20:05:47.099 - Started indexing for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075 Current indexing progress for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075 - Status: STARTING, 0 Current indexing progress for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075 - Status: ERROR, 1 Index progress error for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075. Indexing terminated with status: ERROR Index progress error for dataset 5d3ea633-85fa-404f-b9df-695fa3b54075. RequestFailedException{failureType=INDEXING_ERROR, failureBody='Indexing failed: com.domo.ice.engine.impl.tundra.TundraException: [Tundra](3020) Empty primary key at row "9" column "Attributed Second"'} Failed to upload: Indexing failed: com.domo.ice.engine.impl.tundra.TundraException: [Tundra](3020) Empty primary key at row "9" column "Attributed Second" An error occurred while uploading the data from file Disconnected

Answers

  • Did you verify that row "9" of your data has data for the primary key?


    “There is a superhero in all of us, we just need the courage to put on the cape.” -Superman
  • I did. I do notice these messages in the Domo UI. What do these mean? I almost sounds as if it was merely a Domo processing error and nothing on my end.