How to count strings in a column containing a comma delimited list of strings in each row
Hello, I have a field in which each row contains a comma delimited list of strings. I'm trying to find a way to display how often each string appears in the dataset. For example, the column might be "City":
Row 1 | New York,Boston,San Francisco
Row 2 | Boston
Row 3 | New York,San Diego
Row 4 | New York,Topeka
I like to create some data display that shows New York:3, Boston:2, San Francisco:1, San Diego:1, Topeka:1.
A regular bar graph doesn't recognize the comma and treats each unique string including the commas as a separate entry.
I can split the field into multiple, City1, City2, City3, etc, but I don't know of a data display that looks into multiple columns.
Any advice is appreciated, thank you.
Best Answer
-
The way I did it in SQL, it doesn't matter how many commas there are. In ETL, I'd have to create something recusive to make sure you can get any number of elements typed into a string. If you know you won't exceed the number of delimited items you show, something like this will work.
Using that flow, I then went out and had some website give me random city names which I typed into my webform.
** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **2
Answers
-
You should be able to do a calculated field such as
CASE whenCity
like '%New York%' then 'New York'
whenCity
like '%San Diego%' then 'San Diego'
whenCity
like '%San Francisco%' then 'San Francisco'
whenCity
like '%Boston%' then 'Boston'
whenCity
like '%Topeka%' then 'Topeka'
end
or
CASE when INSTR(City
,'New York')>0 then 'New York'
when INSTR(City
,'San Diego')>0 then 'San Diego'
when INSTR(City
,'Boston')>0 then 'Boston'
when INSTR(City
,'San Francisco')>0 then 'San Francisco'
when INSTR(City
,'Topeka')>0 then 'Topeka'
end
Then use the field in a bar chart** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **1 -
Thanks for that solution @ArborRose. It works well for this example.
What if there are more than five possible strings, for example, what if any city in the U.S. could appear and the data set has tens of thousands of rows? It wouldn't be reasonable to make a INSTR for every potential. Any ideas on how to make this scalable?
0 -
I'd need to ponder a bit. First instinct - try to break up the string by the delimiter, in this case a comma. Feed the pieces to a table (aka a list) and then aggregate using count.
** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **0 -
@tom_rotchadl@tom_rotchadl How many cities could be listed in a single row? I'd suggest using a combination of the Split Column and Dynamic Unpivot ETL tiles:
0 -
As @MichelleH mentions, in the ETL I'd be looking to use split to get the pieces from the string. I don't have a chance right now to try it in Domo, but I did a quick mock up in SQL with code shown below.
As I mentioned, I would split the strings to create a table as shown below. Then aggregate the table using count to get the source for the bar chart.** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **0 -
The way I did it in SQL, it doesn't matter how many commas there are. In ETL, I'd have to create something recusive to make sure you can get any number of elements typed into a string. If you know you won't exceed the number of delimited items you show, something like this will work.
Using that flow, I then went out and had some website give me random city names which I typed into my webform.
** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **2
Categories
- All Categories
- 1.9K Product Ideas
- 1.9K Ideas Exchange
- 1.6K Connect
- 1.3K Connectors
- 302 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 2.9K Transform
- 104 SQL DataFlows
- 637 Datasets
- 2.2K Magic ETL
- 3.9K Visualize
- 2.5K Charting
- 761 Beast Mode
- 65 App Studio
- 42 Variables
- 702 Automate
- 182 Apps
- 458 APIs & Domo Developer
- 52 Workflows
- 10 DomoAI
- 39 Predict
- 16 Jupyter Workspaces
- 23 R & Python Tiles
- 401 Distribute
- 116 Domo Everywhere
- 277 Scheduled Reports
- 8 Software Integrations
- 132 Manage
- 129 Governance & Security
- 8 Domo Community Gallery
- 38 Product Releases
- 12 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 111 Community Announcements
- 4.8K Archive