Name matching methods and solutions?
Hello everyone,
I would like to get any methods the community has used when ingesting large datasets and matching them against data already stored in a database. We possess a large database of business records and ingest data given to us by partners.
The criteria we match on are business name and business address to determine whether or not the incoming record has a business in our system or not. The incoming data is often not clean or standardized so we are struggling to produce a good match rate.
Ficticious Example:
Incoming Record:
- Name:"Policy 123- ford motor co"
- Address: "1 Car Dr Suite 101, Detroit, MI 999991111"
Desired Business record we need to programatically match in our database:
- Name: "Ford Motor Company"
- Address: "1 Car Drive ste 101, Detroit, MI 99999-1111"
We would like any feedback from the community who face similar situations and advice on how to deal with the above types of matching situations. Kinds of feedback we are looking for:
- Overall step by step method on how you deal with matching unclean data
- How to leverage DOMO to identify common patterns to later create programatic rules to deal with them
- Any DOMO features or other third party solutions that can help with our struggle
Thank you very much for any feedback
Comments
-
Can anyone help with this request?
Thanks,
0
Categories
- All Categories
- 1.8K Product Ideas
- 1.8K Ideas Exchange
- 1.5K Connect
- 1.2K Connectors
- 300 Workbench
- 6 Cloud Amplifier
- 8 Federated
- 2.9K Transform
- 100 SQL DataFlows
- 616 Datasets
- 2.2K Magic ETL
- 3.9K Visualize
- 2.5K Charting
- 738 Beast Mode
- 57 App Studio
- 40 Variables
- 685 Automate
- 176 Apps
- 452 APIs & Domo Developer
- 47 Workflows
- 10 DomoAI
- 36 Predict
- 15 Jupyter Workspaces
- 21 R & Python Tiles
- 394 Distribute
- 113 Domo Everywhere
- 275 Scheduled Reports
- 6 Software Integrations
- 124 Manage
- 121 Governance & Security
- 8 Domo Community Gallery
- 38 Product Releases
- 10 Domo University
- 5.4K Community Forums
- 40 Getting Started
- 30 Community Member Introductions
- 108 Community Announcements
- 4.8K Archive