How can I efficiently identify specific columns in a large dataset using Domo’s AI/ML tools?
I’m working with a dataset that has 1308 columns, and I need to identify 5 specific columns that contain data like street addresses, credit card numbers, and property IDs. Or, at the very least, Domo provide 2-3 potential columns for each of the 5. The challenge here is that I'm unsure how these columns might be labeled within the data.
I’ve heard about Domo’s new AI and ML functionalities and was wondering if these tools can help me quickly sift through these columns to find the data I need. Has anyone used these features for a similar purpose? What’s the best approach to take in this situation?
Best Answer
-
Whew. That's a bunch of columns. Whether you do it through Domo's AI or externally….you can feed AI a list of column names and ask it the question. It may have a limit on how many you can give it at one time. You may need to feed it a hundred or so at a time and limit the answers down til you get what you need.
Let's say I go to ChatGPT or in a Domo question box….and say exactly as below:
"Working with Domo, I have a dataset with the following columns/fields:
record type,financialcategoryname,batchtimestamp,projectid,plannedcost
Can you please identify the column that best represents date, and the column that best represents amount?"
It will respond with something like this:Certainly! Based on the column names provided:
- Column representing date:
batchtimestamp
: This column likely contains date and time information. It is the best candidate for representing date and time.
- Column representing amount:
plannedcost
: This column likely represents financial values or costs, making it the best candidate for representing an amount.
So,
batchtimestamp
is your date column, andplannedcost
is your amount column.** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **1 - Column representing date:
Answers
-
Whew. That's a bunch of columns. Whether you do it through Domo's AI or externally….you can feed AI a list of column names and ask it the question. It may have a limit on how many you can give it at one time. You may need to feed it a hundred or so at a time and limit the answers down til you get what you need.
Let's say I go to ChatGPT or in a Domo question box….and say exactly as below:
"Working with Domo, I have a dataset with the following columns/fields:
record type,financialcategoryname,batchtimestamp,projectid,plannedcost
Can you please identify the column that best represents date, and the column that best represents amount?"
It will respond with something like this:Certainly! Based on the column names provided:
- Column representing date:
batchtimestamp
: This column likely contains date and time information. It is the best candidate for representing date and time.
- Column representing amount:
plannedcost
: This column likely represents financial values or costs, making it the best candidate for representing an amount.
So,
batchtimestamp
is your date column, andplannedcost
is your amount column.** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **1 - Column representing date:
Categories
- All Categories
- 1.8K Product Ideas
- 1.8K Ideas Exchange
- 1.6K Connect
- 1.2K Connectors
- 300 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 2.9K Transform
- 102 SQL DataFlows
- 626 Datasets
- 2.2K Magic ETL
- 3.9K Visualize
- 2.5K Charting
- 753 Beast Mode
- 61 App Studio
- 41 Variables
- 692 Automate
- 177 Apps
- 456 APIs & Domo Developer
- 49 Workflows
- 10 DomoAI
- 38 Predict
- 16 Jupyter Workspaces
- 22 R & Python Tiles
- 398 Distribute
- 115 Domo Everywhere
- 276 Scheduled Reports
- 7 Software Integrations
- 130 Manage
- 127 Governance & Security
- 8 Domo Community Gallery
- 38 Product Releases
- 11 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 110 Community Announcements
- 4.8K Archive