How can I get the name of a product from the "Notes" field?

sangelov
sangelov Member
edited May 29 in AI Projects and Models

We have a bunch of missing product names, and I want to analyze the notes column and try to get the name from there.

I can create a dataset with all unique product names and start searching in the notes, and if there is a match, populate it in the product column.

Answers

  • The obvious way is to use AI. Review some of the recent forum AI Academy recording for examples.

    You could try something like stripping punctuation, [clean notes]:

    LOWER(REGEXP_REPLACE(`notes`, '[^a-zA-Z0-9 ]', ''))

    and then use a beast mode to scan for keywords

    CASE
    WHEN `clean_notes` LIKE '%widget pro%' THEN 'Widget Pro'
    WHEN `clean_notes` LIKE '%turbo blender%' THEN 'Turbo Blender'
    WHEN `clean_notes` LIKE '%micro charger%' THEN 'Micro Charger'
    ELSE NULL
    END

    I have a process where I reclassify or standardize names. One example in particular is insurance company names. We get so many records with different abbreviations or spellings for an insurance company that I need to reclassify them to a single name. For that I do something like this:

    WHEN `insurance_company` in ('name1','Name1','name_1') THEN 'Name - 1' 
    

    ** Was this post helpful? Click Agree or Like below. **
    ** Did this solve your problem? Accept it as a solution! **

  • I like ArborRose's regex for cleaning the notes, but then the second part seems too manual, especially if you have a large number of company names.

    If you have a list of clean names, you could join with an expression to see if those clean names are like the clean notes.

    WHEN clean_notes LIKE concat('%', clean_names, '%') then clean_names

    Please 💡/💖/👍/😊 this post if you read it and found it helpful.

    Please accept the answer if it solved your problem.