Text to Floating Decimal Issue
Hey there! So, I am trying to alter a text into a floating decimal but I keep getting the "failed to parse" warning. For context, I'm uploading a CSV file that already has the field I'm needing as a currency. But, when I upload into Domo it turns it into a text. I will be needing to aggregate (sum) this field so I need it into a number instead of text.
I've tried replacing the '$' as an empty string and then altering the text to floating decimal, but that didn't work. I've also tried using the text formatting tile, but that doesn't appear to be giving me what I need either since removing the decimal only results in incorrect numbers.
Does anyone know a workaround for this without having to change the CSV file every time it's uploaded?
*I provided a screenshot of how the field looks like the CSV file prior to uploading into Domo.
Best Answer
-
I would recommend trying to understand the problem a little better before fixing it in that particular way. The error message you're getting back should include an example of a failing value. Something like "Failed to parse text '$1.23' as type…", or perhaps "Failed to parse text '#VALUE!'…". If we figure out exactly what's failing to parse, we can fix the problem in a more focused way.
The expression
CAST(REGEXP_REPLACE(x, '[^0-9.]', '') AS DECIMAL)
will replace any character inx
that isn't a digit or a period with the empty string (effectively deleting it) and then convert the resulting string toDECIMAL
, which is called "Fixed Decimal" in other places. There are a few issues with this.
For one, there are other characters that may be present in valid numbers coming from Excel which will be quietly deleted, resulting in incorrect values that may appear correct. Here are some valid floating-point number representations that will be successfully but wrongly processed by this expression:
1.-5.5
will be parsed as5.5
2.
1.6E+3
will be parsed as1.63
If you know your data well, then these concerns may not apply. Perhaps there are not and never will be negative values, and you know the values are never represented in scientific notation (which, although surprising, is actually the standard way of representing floating-point numbers beyond a certain number of digits).
That said, in most cases I would strongly recommend at least modifying the regular expression from
[^0-9.]
to[^-+0-9.eE]
to cover all the valid floating point values (besides theNaN
values, but it's less likely you have those in your data).But if it were me, I'd figure out specifically why the error is occurring and fix it in a focused way. For example, if the problem is that Excel error values like
#VALUE!
are in your data, you could get rid of all of them with an expression likeCASE WHEN STR_STARTS_WITH(Ad_fees, '#') THEN NULL ELSE CAST(Ad_fees AS DOUBLE) END
(useDOUBLE
if you want a floating-point number andDECIMAL
if you want a fixed-point decimal). Or, if the problem is whitespace characters, you can useCAST(SQUASH_WHITESPACE(Ad_fees) AS DOUBLE)
. If you have dollar signs, as I'm sure you already know, you can useREPLACE(Ad_fees, '$', '')
(be careful with$
in a regex; it has a special meaning so you need to escape it.REPLACE
is always better thanREGEXP_REPLACE
if you just want to replace a single static string or character). If you can't figure out what's in your data that's causing the problem, you could doSTR_TO_CHARNAMES(Ad_fees)
and export the result and look through it for anything surprising. That will surface any weird, hard-to-spot things in your string like special whitespace characters or Unicode characters that look the same as common characters but aren't.
The advantage of taking the extra time to do it this way is that if a value that you didn't expect does show up in your data, the system will tell you about it instead of quietly dealing with it. That's a tradeoff, of course. Quietly dealing with it might be exactly what you want. But do keep in mind that if you use a simple character-whitelist regex like[^-+0-9.eE]
and new data includes values like1/5
or5.000,00
it will not warn you before simply interpreting it as15
and5.00
Randall Oveson <randall.oveson@domo.com>
0
Answers
-
@Mwatson It's possible that if there are extra characters, like commas, in your field that it is still not able to convert your text to a floating decimal. Try using this formula to remove all non-number characters except decimal points and cast to a decimal:
CAST(REGEX_REPLACE(`Ad fees`,'[^0-9.]','') as DECIMAL())
1 -
That's what it was. Thank you for thinking of that! Only difference from your formula is I had to use REGEXP_REPLACE function instead of REGEX_REPLACE. Here was my final output:
CAST(REGEXP_REPLACE(
Ad fees
,'[^0-9.]','') as DECIMAL())Thanks again!
0 -
@MichelleH There is one more similar situation I'm having issues with that you might be able to help me on. I'm trying to change this "Date" field into a Date instead of Text. It's also having issues parsing.
What's the best formula to fix that? I have attached a picture of how the Date string is.
0 -
I would recommend trying to understand the problem a little better before fixing it in that particular way. The error message you're getting back should include an example of a failing value. Something like "Failed to parse text '$1.23' as type…", or perhaps "Failed to parse text '#VALUE!'…". If we figure out exactly what's failing to parse, we can fix the problem in a more focused way.
The expression
CAST(REGEXP_REPLACE(x, '[^0-9.]', '') AS DECIMAL)
will replace any character inx
that isn't a digit or a period with the empty string (effectively deleting it) and then convert the resulting string toDECIMAL
, which is called "Fixed Decimal" in other places. There are a few issues with this.
For one, there are other characters that may be present in valid numbers coming from Excel which will be quietly deleted, resulting in incorrect values that may appear correct. Here are some valid floating-point number representations that will be successfully but wrongly processed by this expression:
1.-5.5
will be parsed as5.5
2.
1.6E+3
will be parsed as1.63
If you know your data well, then these concerns may not apply. Perhaps there are not and never will be negative values, and you know the values are never represented in scientific notation (which, although surprising, is actually the standard way of representing floating-point numbers beyond a certain number of digits).
That said, in most cases I would strongly recommend at least modifying the regular expression from
[^0-9.]
to[^-+0-9.eE]
to cover all the valid floating point values (besides theNaN
values, but it's less likely you have those in your data).But if it were me, I'd figure out specifically why the error is occurring and fix it in a focused way. For example, if the problem is that Excel error values like
#VALUE!
are in your data, you could get rid of all of them with an expression likeCASE WHEN STR_STARTS_WITH(Ad_fees, '#') THEN NULL ELSE CAST(Ad_fees AS DOUBLE) END
(useDOUBLE
if you want a floating-point number andDECIMAL
if you want a fixed-point decimal). Or, if the problem is whitespace characters, you can useCAST(SQUASH_WHITESPACE(Ad_fees) AS DOUBLE)
. If you have dollar signs, as I'm sure you already know, you can useREPLACE(Ad_fees, '$', '')
(be careful with$
in a regex; it has a special meaning so you need to escape it.REPLACE
is always better thanREGEXP_REPLACE
if you just want to replace a single static string or character). If you can't figure out what's in your data that's causing the problem, you could doSTR_TO_CHARNAMES(Ad_fees)
and export the result and look through it for anything surprising. That will surface any weird, hard-to-spot things in your string like special whitespace characters or Unicode characters that look the same as common characters but aren't.
The advantage of taking the extra time to do it this way is that if a value that you didn't expect does show up in your data, the system will tell you about it instead of quietly dealing with it. That's a tradeoff, of course. Quietly dealing with it might be exactly what you want. But do keep in mind that if you use a simple character-whitelist regex like[^-+0-9.eE]
and new data includes values like1/5
or5.000,00
it will not warn you before simply interpreting it as15
and5.00
Randall Oveson <randall.oveson@domo.com>
0
Categories
- All Categories
- 1.8K Product Ideas
- 1.8K Ideas Exchange
- 1.5K Connect
- 1.2K Connectors
- 300 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 2.9K Transform
- 100 SQL DataFlows
- 622 Datasets
- 2.2K Magic ETL
- 3.9K Visualize
- 2.5K Charting
- 744 Beast Mode
- 58 App Studio
- 41 Variables
- 686 Automate
- 176 Apps
- 453 APIs & Domo Developer
- 47 Workflows
- 10 DomoAI
- 36 Predict
- 15 Jupyter Workspaces
- 21 R & Python Tiles
- 395 Distribute
- 113 Domo Everywhere
- 276 Scheduled Reports
- 6 Software Integrations
- 125 Manage
- 122 Governance & Security
- 8 Domo Community Gallery
- 38 Product Releases
- 10 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 108 Community Announcements
- 4.8K Archive