How to count strings in a column containing a comma delimited list of strings in each row
Hello, I have a field in which each row contains a comma delimited list of strings. I'm trying to find a way to display how often each string appears in the dataset. For example, the column might be "City":
Row 1 | New York,Boston,San Francisco
Row 2 | Boston
Row 3 | New York,San Diego
Row 4 | New York,Topeka
I like to create some data display that shows New York:3, Boston:2, San Francisco:1, San Diego:1, Topeka:1.
A regular bar graph doesn't recognize the comma and treats each unique string including the commas as a separate entry.
I can split the field into multiple, City1, City2, City3, etc, but I don't know of a data display that looks into multiple columns.
Any advice is appreciated, thank you.
Best Answer
-
The way I did it in SQL, it doesn't matter how many commas there are. In ETL, I'd have to create something recusive to make sure you can get any number of elements typed into a string. If you know you won't exceed the number of delimited items you show, something like this will work.
Using that flow, I then went out and had some website give me random city names which I typed into my webform.
** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **2
Answers
-
You should be able to do a calculated field such as
CASE whenCity
like '%New York%' then 'New York'
whenCity
like '%San Diego%' then 'San Diego'
whenCity
like '%San Francisco%' then 'San Francisco'
whenCity
like '%Boston%' then 'Boston'
whenCity
like '%Topeka%' then 'Topeka'
end
or
CASE when INSTR(City
,'New York')>0 then 'New York'
when INSTR(City
,'San Diego')>0 then 'San Diego'
when INSTR(City
,'Boston')>0 then 'Boston'
when INSTR(City
,'San Francisco')>0 then 'San Francisco'
when INSTR(City
,'Topeka')>0 then 'Topeka'
end
Then use the field in a bar chart** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **1 -
Thanks for that solution @ArborRose. It works well for this example.
What if there are more than five possible strings, for example, what if any city in the U.S. could appear and the data set has tens of thousands of rows? It wouldn't be reasonable to make a INSTR for every potential. Any ideas on how to make this scalable?
0 -
I'd need to ponder a bit. First instinct - try to break up the string by the delimiter, in this case a comma. Feed the pieces to a table (aka a list) and then aggregate using count.
** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **0 -
@tom_rotchadl@tom_rotchadl How many cities could be listed in a single row? I'd suggest using a combination of the Split Column and Dynamic Unpivot ETL tiles:
0 -
As @MichelleH mentions, in the ETL I'd be looking to use split to get the pieces from the string. I don't have a chance right now to try it in Domo, but I did a quick mock up in SQL with code shown below.
As I mentioned, I would split the strings to create a table as shown below. Then aggregate the table using count to get the source for the bar chart.** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **0 -
The way I did it in SQL, it doesn't matter how many commas there are. In ETL, I'd have to create something recusive to make sure you can get any number of elements typed into a string. If you know you won't exceed the number of delimited items you show, something like this will work.
Using that flow, I then went out and had some website give me random city names which I typed into my webform.
** Was this post helpful? Click Agree or Like below. **
** Did this solve your problem? Accept it as a solution! **2
Categories
- All Categories
- 1.7K Product Ideas
- 1.7K Ideas Exchange
- 1.5K Connect
- 1.2K Connectors
- 292 Workbench
- 4 Cloud Amplifier
- 8 Federated
- 2.8K Transform
- 95 SQL DataFlows
- 600 Datasets
- 2.1K Magic ETL
- 3.7K Visualize
- 2.4K Charting
- 685 Beast Mode
- 43 App Studio
- 38 Variables
- 655 Automate
- 170 Apps
- 438 APIs & Domo Developer
- 42 Workflows
- 5 DomoAI
- 32 Predict
- 12 Jupyter Workspaces
- 20 R & Python Tiles
- 383 Distribute
- 110 Domo Everywhere
- 267 Scheduled Reports
- 6 Software Integrations
- 111 Manage
- 108 Governance & Security
- 8 Domo University
- 25 Product Releases
- Community Forums
- 39 Getting Started
- 29 Community Member Introductions
- 98 Community Announcements
- Domo Community Gallery
- 4.8K Archive