nav[aria-label="Primary Navigation"] { padding: 0; & ul { list-style: none; width: 100%; display: flex; flex-direction: row; justify-content: start; align-items: start; gap: 30px; padding: 0; & li { margin: 0; } & ul li { list-style: none; } } }

Appending several datasets in Workbench

iresanjogar

Hello Dojo Community!

I have 15 different and large datasets that I need to append to create a unique output dataset that I pull into Domo using Workbench (version 4). I would like to know if I can create a single job in Workbench that allows me to append always the same 15 datasets to be able to upload a single dataset into my Domo instance related to a single job in Workbench. I know I could create 15 different jobs in Workbench and append them all in Domo using Magic ETL, but ideally I would like to have only 1 job and 1 dataset in my Domo instance.

Thanks!

Child Item

Quick Links

Accepted answers

n8isjack-ret

@iresanjogar In workbench you may be able to load all 15 files in a single workbench job. The base requirements are:

All the files must have the same schema (column 1 on all 15 files must be the same data type, column 2 also, and so forth for all columns). Workbench will assume column positions match, not column names.
The files must be in the same folder (should also be alone in that folder without any other files).

When you add the file in the settings remove the filename so that it is pointing to the folder like this:

Screen Shot 2016-11-02 at 3.55.05 PM.png

Kurbz

hi,

when i created a new csv job it worked!!

(note, did not work when trying to change an excel job to csv)..

many thanks!

sorted.

cwalliser

My bad. I found a structure difference in one of the files. When I removed that file from the folder, the update worked.

A lovely thing...

All comments

[Deleted User]

@iresanjogar

Thanks for your question, we are taking a closer look at it.

In the meantime have a look at the FAQ document for Workbench 4.

Regards,

AustinH

You are able to combine different datasets in a workbench job to load as one dataset into Domo. The basic way to do this would be by using a UNION or UNION ALL inbetween select statements for each dataset. With unions, you will need to make sure the columns are aligned in your datasets or in the select statement.

Example:

SELECT

`columnA`

,`columnB`

FROM dataset1

UNION ALL

SELECT

`columnA`

,`columnB`

FROM dataset2

iresanjogar

Thanks for your reply!

However, could you specify where I should write that code within Workbench? Is this a customized transformation that I can add to my job? If so, could you explain a bit better the process? The type of transformations allowed seems pretty limited and I am not sure which option I should select to do this.

However, if I understood properly, I will still need to create 15 jobs in Workbench (associated to each of the 15 datasets I want to append), and then another one that would append all those datasets and load as one dataset into Domo, am I wrong?

Thanks once again!

AustinH

Can you provide more context of what type of data sources you are pulling data from? Database, excel, csv?

In the job settings, you will see the "Query" tab as an option in the "Data Source Property" section. Here you can input custom SQL to pull the data you want. If you are connected to a database, you can simply create one job that references the different datasets available in the dataset:

If you are wanting to load 15 different files, excel files for example, then you will need to create different jobs from each, since you are only able to browse to one local file at a time. In this case you would then combine them once they are in Domo, which I know you are wanting to avoid.

Let me know if you need additional details and include more information about your datasources.

iresanjogar

This was really useful! Thank you very much!

The type of files I need to append are CSV, which then means that I need to create 15 different jobs and perform the transformation in Domo. However, now I know that if I am connected to a database I will be able to do these type of operations! Thanks! It would have been great though if in the subsection "transport method" of the "Source" information you would be able to add several files.

n8isjack-ret

@iresanjogar In workbench you may be able to load all 15 files in a single workbench job. The base requirements are:

All the files must have the same schema (column 1 on all 15 files must be the same data type, column 2 also, and so forth for all columns). Workbench will assume column positions match, not column names.
The files must be in the same folder (should also be alone in that folder without any other files).

When you add the file in the settings remove the filename so that it is pointing to the folder like this:

Screen Shot 2016-11-02 at 3.55.05 PM.png

unknown

@iresanjogar, did any of the above replies help you out?

iresanjogar

Thank you @n8isjack-ret for your reply! This is exaclty what I needed (cc: @LizWR)! Since my files fulfill the requirements I can use this method without a problem. Thanks again!

[Deleted User]

@iresanjogar, can you mark the appropriate reply "accept as solution" so others can benefit from this conversation?

Thanks!

Dani

Kurbz

Hi Folks,

I am trying to do this method, however when I remove the filename and save it saves with a red dot. When I run the job it fails with:

Could not find a part of the path 'E:\MyDataFolder\'.

I have ran the job successfully with one selected file, but I cannot execute it successfully with no filename and just the folder path ?

Am I missing something?

Thanks Dojo's!

kshah008

@Kurbz, please feel free to open a new thread for better exposure to your question ?

n8isjack-ret

Can you confirm that you are using a .csv file import?? This trick only is possible with .csv files.

Kurbz

hi,

when i created a new csv job it worked!!

(note, did not work when trying to change an excel job to csv)..

many thanks!

sorted.

cwalliser

I tried this approach to load 3 csv files with different names in the same folder. Domo Upload Source Folder.PNG

The preview test failed with an error message below. I notice only Q1 & Q2 files show in the log.

Is there a file size limit? Some other factor I'm missing? I don't see any structure differences in any of the files.

[08.28.17 03:45:48 PM] Requesting CSV source file from the Local File Provider
[08.28.17 03:45:48 PM] Loading local file provider properties
[08.28.17 03:45:48 PM] Loading CSV file: C:\Users\cwalliser\OneDrive - Gigamon Inc\DOMO\SvcMetricsFiles\
[08.28.17 03:45:48 PM] Parsing file: 'C:\Users\cwalliser\OneDrive - Gigamon Inc\DOMO\SvcMetricsFiles\ServiceMetric_2017_Q1.csv'
[08.28.17 03:45:48 PM] Parsing file: 'C:\Users\cwalliser\OneDrive - Gigamon Inc\DOMO\SvcMetricsFiles\ServiceMetric_2017_Q2.csv'

[08.28.17 03:45:49 PM] The number of fields in the record is greater than the available space from index to the end of the destination array.
Parameter name: array

cwalliser

My bad. I found a structure difference in one of the files. When I removed that file from the folder, the update worked.

A lovely thing...

user09630

Hi,

I am trying to import muitlipe CSV's in workbench by date modified order (oldest first) and it seems like the workbench imports the multilple CSV's ordered by filename.

Our developer has used guid while generating the CSV names so any help how to load by date modified will be appreciated.

Thanks.

n8isjack-ret

Ok, help me understand the impact of the load order. My first thought is that the order in which they are loaded will not affect the end result in Domo.

Is it because you don't want to load the old files ever again once reloaded?

Are you loading all files every time?

Does the same record appear in multiple files?

user09630

Hi,

Let me explain further.

We are generating delta loads as a CSV files so currently the name of the files are loaded in the following order by default:

1) ACCOUNT_380f10d4-3cd9-46a2-8196-76dcd85ac4dc.csv   --date modified as of 03/10/2016 6:08 AM (DD/MM/YYYY)
2) ACCOUNT_3f771f53-9ca4-4251-ae94-96f9284ba306.csv   --date modified as of 03/10/2016 6:02 AM
3) ACCOUNT_5caaaf2d-ca50-43ff-a56c-b1358b80ebb6.csv   --date modified as of 03/10/2016 6:00 AM
4) ACCOUNT_5caaaf2d-ca50-43ff-a56c-b1358b80ebb6_1.csv --date modified as of 03/10/2016 6:00 AM
5) ACCOUNT_5e131c62-0c7d-4d02-9d9f-8e2d5f40ae53.csv   --date modified as of 03/10/2016 6:15 AM

Since its a delta load we would like the files to be loaded by file date modified order. so i am expecting this to be loaded in the following order:

3) ACCOUNT_5caaaf2d-ca50-43ff-a56c-b1358b80ebb6.csv     --date modified as of 03/10/2016 6:00 AM
4) ACCOUNT_5caaaf2d-ca50-43ff-a56c-b1358b80ebb6_1.csv --date modified as of 03/10/2016 6:00 AM
2) ACCOUNT_3f771f53-9ca4-4251-ae94-96f9284ba306.csv     --date modified as of 03/10/2016 6:02 AM
1) ACCOUNT_380f10d4-3cd9-46a2-8196-76dcd85ac4dc.csv    --date modified as of 03/10/2016 6:08 AM
5) ACCOUNT_5e131c62-0c7d-4d02-9d9f-8e2d5f40ae53.csv   --date modified as of 03/10/2016 6:15 AM

To answer your question: yes the order they are loaded does affects the data so the oldest csv needs to be loaded first. We are loading different files in every run. yes the records appears in multiple file which is named as above with the help of the guid in the file name. Due to some reasons we are not able to name the files as Account_1.csv, Account_2.csv ...

Thanks.

motard2dijon

Hi Dojo Community,

I have a follow up question on this topic. When you upload different csv files with the method described in this post, is there an option to add an additional column where the specific name of each csv file can be written down and recorded?

I would like to be able to distinguish between the different csv files because it is important for my analysis to know from which csv file each row comes from.

The other option is to upload them separately and add that column within the Workbench transformation tool, but given that all csv files have the same structure I would like to use this trick which simplified a lot the process.

Thanks in advance!

other categories

Product Ideas
Have a Domo product enhancement idea? Submit or upvote on ideas in the Ideas Exchange.
Ideas Exchange
Suggest & vote on new features you would like to see implemented in the Domo Product.
Data Connections
Ask questions about Connectors, Workbench, Cloud Amplifier and get best practices from Domo peers
Connectors
A space to troubleshoot connector errors (like authentication and sync issues), best practices for building or customizing connectors, and API and writeback options.
Workbench
Workbench discussions including configuring and running jobs, managing data types and schema, troubleshooting upload errors, and working with large datasets. Ask questions about scheduling and automation, version updates, system requirements, and SQL query behavior.
Cloud Integrations
Discussions around federated and cloud integration topics, such as Cloud Amplifier, Snowflake, Databricks, BigQuery, Oracle NetSuite, and other data warehouse or lake connections. Ask questions about authentication, auto-preview settings, cost implications, pass-through SQL, and integration configuration.
Data & ETL
Ask questions about Magic ETL, SQL DataFlows, DataFusion, Dataset Views and get best practices from Domo peers
Magic ETL
Magic ETL discussions including data transformation flows, formula editor use, tile functions (e.g., Pivot, Join, Group By, Rank & Window), and handling schema and datatype conversions. Ask questions about workflow logic, preview behavior, visual editing features, freeform SQL, and performance/error tuning.
SQL DataFlows
SQL DataFlows discussions including creating and managing SQL dataflows, API automation (e.g., via Python), error resolution (such as row-count mismatches or timeout limits), and SQL transform logic. Ask questions about performance optimization, execution time limits, workflow error troubleshooting, API integration, and SQL view or query visibility.
Datasets
Datasets discussions including DataFusion and Dataset Views, dataset sharing and permissions, importing and formatting data (e.g., CSV/XLSX), dataset granularity and filtering behavior. Ask questions about data merging and snapshots, API metadata access, header changes in imported files, and export/view limits.
Visualize & Apps
Ask questions about Beast Mode, Cards, Charting, Dashboards, Stories, Variables and get best practices from Domo peers
Dashboards
Dashboards discussions including Cards, Dashboards, and Stories—covering topics like card formatting, dashboard navigation, filtering logic, and data visualization behavior. Ask questions about layout consistency, dynamic labeling, drill-downs, access permissions, inter-dashboard navigation, and export options.
App Studio
App Studio discussions including building multi-page apps, custom navigation, themes, forms, filters, queues, and component behaviors. Ask questions about popup forms, filter persistence, control visibility, mobile access, theming and branding, embedded workflows, and publish workflows.
Pro-code Components
Pro-code Components discussions including building and debugging Domo Bricks or pro-code apps, app lifecycle management (e.g., manifest.json), and dataset or workflow integration. Ask questions about permission configurations, app-to-dataset writebacks, form security, PDF export, workflow initiation code, and use of the web-based Pro-code Editor.
Charting & Analyzer
Charting & Analyzer discussions including chart types (e.g., period-over-period charts, bullet charts, pivot tables, heat maps), tooltip and data label configuration, filter behavior, and time-based visualization logic. Ask questions about date selector binding, custom calculation displays, sorting order, annotations, chart alerts, and multi-metric formatting.
Calculations & Variables (Beast Mode)
Calculations & Variables (Beast Mode) discussions including creating and troubleshooting calculated fields, using variables in Analyzer, nesting Beast Modes, and leveraging FIXED and window functions like RANK or aggregation logic. Ask questions about variable scoping, date and running total calculations, error handling (e.g., divide-by-zero, row filters), ETL vs Beast Mode placement, and performance optimization.
AI & Data science
Ask questions about DomoAI and get answers from Domo peers.
Domo AI & AI Chat
Domo AI & AI Chat discussions including AI readiness tools, AI Chat interface behavior, AI agent creation and workflows, and AI dictionary or metadata configuration. Ask questions about AI Chat sessions reports, chat history visibility, publication syncing, AI agent errors, and dataset readiness governance.
Managing AI
Managing AI discussions including AI Playground usage, AI project setup, and AI model management within Domo. Ask questions about AI Academy episodes, AI agent errors, AI readiness guidance, and image/upload workflows.
Jupyter Workspaces
Jupyter Workspaces discussions including Notebook execution, scheduling DataFlows, error troubleshooting (e.g., “no output” or workspace down), and package or library support within the workspace. Ask questions about AI features, file share connectors, domojupyter APIs, Jupyter via Workflows, and data science resources.
Automate
Ask questions about App Framework, Workflows, Domo Bricks, Domo Developer, API and get best practices from Domo peers
Workflows
Workflows discussions including Task Center automation, form-based workflows, conditional logic, alerts, and code-driven tasks using Code Engine (JavaScript/Python). Ask questions about email triggers, append/writebacks, dataset logging, API integration, error handling, and workflow-task interactions like Projects & Tasks or dashboards.
Alerts
Alerts discussions including setting up card-based and dataset-based alerts, conditional notifications, and monitoring alert execution behavior. Ask questions about summary number triggers, email content values, multi-dimensional logic, non-firing alerts, and configuration differences across dataset types.
Distribute
Ask questions about Domo Everywhere, Scheduled Reports, Mobile and get best practices from Domo peers
Domo Everywhere
Domo Everywhere discussions including embedding dashboards and cards (public vs private), filtering and access control, performance and layout behavior, and API/client ID management. Ask questions about license tracking, text selection in embedded content, export limitations, embed errors, and configuration of .env and datasetRedirects.
Reporting
Reporting discussions including Scheduled Reports, Report Builder, and Slideshow Publications. Ask questions about bulk managing scheduled reports, CSV/PDF export formatting, report layout customization, interface changes, and admin visibility of reports.
Manage
Ask questions about Governance Administration, Approvals, Teams, Alerts, and Buzz and get best practices from Domo peers
Governance & Security
Governance & Security discussions including managing People, Groups, Roles, Teams, Approvals, and PDP, plus sandbox environment access and activity log investigation. Ask questions about role delegation, dynamic group attributes, SSO/SCIM onboarding, governance toolkit usage, and governance dataset visibility and reporting.
Navigation & Productivity
Navigation & Productivity discussions including navigation layout and customization, Projects & Tasks usage, Goals tracking, and Buzz chat functionality. Ask questions about custom icons in navigation, level-specific dashboard creation, workspace navigation behavior, and project/task visibility in Buzz.
APIs
APIs discussions including Domo REST APIs, Python SDK, Java SDK, data import/export, and App API use cases. Ask questions about authentication (client ID/secret), rate limits, error handling (401/403), dataset append/update, and embedding or snapshot automation.
Add-ins & Plugins
Add-Ins & Plugins discussions including Microsoft add-ins (Excel, PowerPoint), Google Slides, and other third-party integrations. Ask questions about installation errors, legacy vs new plugin behavior, refresh failures, template formatting, iframe embedding, and version differences.
Domo Community Gallery
Watch how our Customers are using Domo to solve their complex problems. Featuring real-world use cases, customer success stories, and community-shared workflows or integrations. Learn how our customers are using Domo to solve their complex problems.
Product Releases
Domo support and product teams are here to live-answer questions about the most recent product releases. Please post questions in this Forum board for all users to benefit (rather than submitting a support ticket).
Domo University
Domo University discussions include self-paced training, instructor-led courses, virtual/in-person learning, and certification paths. Ask questions about course content updates, certification exam tips, platform onboarding improvements, and training resource formatting or errors.
Community Forums
Getting Started
Welcome to Domo's Community Forums! You'll find everything you need to get started in this category.
Community Announcements
Get the latest from Domo's Community Team.
Archive
Old or outdated content that could still be found helpful.