How can I efficiently identify specific columns in a large dataset using Domo’s AI/ML tools?

I’m working with a dataset that has 1308 columns, and I need to identify 5 specific columns that contain data like street addresses, credit card numbers, and property IDs. Or, at the very least, Domo provide 2-3 potential columns for each of the 5. The challenge here is that I'm unsure how these columns might be labeled within the data.
I’ve heard about Domo’s new AI and ML functionalities and was wondering if these tools can help me quickly sift through these columns to find the data I need. Has anyone used these features for a similar purpose? What’s the best approach to take in this situation?
Best Answer
-
Whew. That's a bunch of columns. Whether you do it through Domo's AI or externally….you can feed AI a list of column names and ask it the question. It may have a limit on how many you can give it at one time. You may need to feed it a hundred or so at a time and limit the answers down til you get what you need.
Let's say I go to ChatGPT or in a Domo question box….and say exactly as below:
"Working with Domo, I have a dataset with the following columns/fields:
record type,financialcategoryname,batchtimestamp,projectid,plannedcost
Can you please identify the column that best represents date, and the column that best represents amount?"
It will respond with something like this:Certainly! Based on the column names provided:
- Column representing date:
batchtimestamp
: This column likely contains date and time information. It is the best candidate for representing date and time.
- Column representing amount:
plannedcost
: This column likely represents financial values or costs, making it the best candidate for representing an amount.
So,
batchtimestamp
is your date column, andplannedcost
is your amount column.** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **1 - Column representing date:
Answers
-
Whew. That's a bunch of columns. Whether you do it through Domo's AI or externally….you can feed AI a list of column names and ask it the question. It may have a limit on how many you can give it at one time. You may need to feed it a hundred or so at a time and limit the answers down til you get what you need.
Let's say I go to ChatGPT or in a Domo question box….and say exactly as below:
"Working with Domo, I have a dataset with the following columns/fields:
record type,financialcategoryname,batchtimestamp,projectid,plannedcost
Can you please identify the column that best represents date, and the column that best represents amount?"
It will respond with something like this:Certainly! Based on the column names provided:
- Column representing date:
batchtimestamp
: This column likely contains date and time information. It is the best candidate for representing date and time.
- Column representing amount:
plannedcost
: This column likely represents financial values or costs, making it the best candidate for representing an amount.
So,
batchtimestamp
is your date column, andplannedcost
is your amount column.** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **1 - Column representing date:
Categories
- All Categories
- 2K Product Ideas
- 2K Ideas Exchange
- 1.6K Connect
- 1.3K Connectors
- 311 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 3.8K Transform
- 656 Datasets
- 115 SQL DataFlows
- 2.2K Magic ETL
- 813 Beast Mode
- 3.3K Visualize
- 2.5K Charting
- 81 App Studio
- 45 Variables
- 771 Automate
- 190 Apps
- 481 APIs & Domo Developer
- 77 Workflows
- 23 Code Engine
- 36 AI and Machine Learning
- 19 AI Chat
- AI Playground
- AI Projects and Models
- 17 Jupyter Workspaces
- 410 Distribute
- 120 Domo Everywhere
- 280 Scheduled Reports
- 10 Software Integrations
- 142 Manage
- 138 Governance & Security
- 8 Domo Community Gallery
- 48 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 41 Getting Started
- 31 Community Member Introductions
- 114 Community Announcements
- 4.8K Archive