Name matching methods and solutions?
Hello everyone,
I would like to get any methods the community has used when ingesting large datasets and matching them against data already stored in a database. We possess a large database of business records and ingest data given to us by partners.
The criteria we match on are business name and business address to determine whether or not the incoming record has a business in our system or not. The incoming data is often not clean or standardized so we are struggling to produce a good match rate.
Ficticious Example:
Incoming Record:
- Name:"Policy 123- ford motor co"
- Address: "1 Car Dr Suite 101, Detroit, MI 999991111"
Desired Business record we need to programatically match in our database:
- Name: "Ford Motor Company"
- Address: "1 Car Drive ste 101, Detroit, MI 99999-1111"
We would like any feedback from the community who face similar situations and advice on how to deal with the above types of matching situations. Kinds of feedback we are looking for:
- Overall step by step method on how you deal with matching unclean data
- How to leverage DOMO to identify common patterns to later create programatic rules to deal with them
- Any DOMO features or other third party solutions that can help with our struggle
Thank you very much for any feedback
Comments
-
Can anyone help with this request?
Thanks,
0
Categories
- All Categories
- 1.8K Product Ideas
- 1.8K Ideas Exchange
- 1.6K Connect
- 1.2K Connectors
- 300 Workbench
- 6 Cloud Amplifier
- 9 Federated
- 2.9K Transform
- 102 SQL DataFlows
- 626 Datasets
- 2.2K Magic ETL
- 3.9K Visualize
- 2.5K Charting
- 755 Beast Mode
- 61 App Studio
- 41 Variables
- 693 Automate
- 178 Apps
- 456 APIs & Domo Developer
- 49 Workflows
- 10 DomoAI
- 38 Predict
- 16 Jupyter Workspaces
- 22 R & Python Tiles
- 398 Distribute
- 115 Domo Everywhere
- 276 Scheduled Reports
- 7 Software Integrations
- 130 Manage
- 127 Governance & Security
- 8 Domo Community Gallery
- 38 Product Releases
- 11 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 110 Community Announcements
- 4.8K Archive