How Middesk matches data

You can most effectively use Middesk’s results if you understand how Middesk matches data. The Middesk approach to data is designed to provide accurate results, even across common permutations, while saving time.

Data matching process

Middesk performs these three steps to process data:

1

Normalize

Middesk first normalizes the data you submit into a “cleaned version.” This includes removing case sensitivity, extra white space, and punctuation. Then Middesk converts that text to a canonical version (for example, Street may become ST).

3

Match

Middesk then conducts an attribute-by-attribute comparison between the submitted “cleaned version” data and the retrieved records to validate matches.

Match reporting

Depending on the attribute, Middesk returns a value indicating the match type: Success, Warning, Failure, or Alternate.

Success

The normalized submitted attribute exactly matches at least one normalized, retrieved attribute.

AttributeSubmittedReturned
Business nameMiddeskMiddesk
Business address85 2nd St, San Francisco, CA 94105-345985 2nd St, San Francisco, CA 94105-3459
TIN123456789123456789
Associated peopleJohn Roger SmithJohn R Smith

Warning

The normalized attribute is similar to at least one normalized, retrieved attribute.

AttributeSubmittedReturned
Business nameMiddesk IncMiddesk LLC
Business address4322 N Hall St, Dallas, TX 75219-27314104 N Hall St Apt 107, Dallas, TX 75219-5627

Failure

The submitted normalized attribute does not match any normalized, retrieved attributes.

AttributeSubmittedReturned
Business nameMiddeskGoogle
Business address4322 N Hall St, Dallas, TX 75219-27314201 Cypress Creek Pkwy Ste 540 #1197, Houston, TX 77068-3458
TIN123456789987654321
Associated peopleJohn Roger SmithKeith Morgan

Alternate (TIN-only)

The submitted attribute is associated with an alternative attribute according to the IRS. Middesk returns the alternative attribute associated with the submitted TIN.

Attribute-specific matching

Each attribute type has specific normalization and matching rules.

Name matching

Normalization

Middesk normalizes business names by:

  • Removing case sensitivity
  • Converting entity suffixes to canonical versions
  • Removing extra whitespace at the beginning and end
  • Removing punctuation and extraneous characters during comparison
SubmittedNormalized
ABC Limited Liability CompanyABC LLC
ABC company llcABC LLC
ABC IncorporatedABC CORP
ABC CorporationABC CORP
Middesk, Inc.MIDDESK INC

Matching rules

Success: All characters in the normalized submitted name match all characters in the normalized retrieved name.

Warning: The normalized submitted name is similar to the retrieved name. This includes:

  • Entity suffix differences (for example, “Middesk Inc” matches “Middesk LLC”)
  • Minor character differences based on name length:
Name lengthAllowed character difference
Less than 10 characters0 (any difference results in Failure)
10 to 30 characters1 character
More than 30 characters2 characters

Failure: The normalized submitted name does not match any normalized retrieved name, or the submitted TIN is associated with a different business name.

Address matching

Normalization

Middesk normalizes and geocodes all addresses prior to matching using a third-party provider. Addresses are converted to a standard format.

SubmittedNormalized
85 2ND STREET SAN FRANCISCO CA85 2nd St, San Francisco, CA 94105-3459
Middesk Inc 85 2ND ST ste 710 san francisco ca85 2nd St Ste 710, San Francisco, CA 94105-3465

Matching rules

Success: The normalized submitted address exactly matches at least one normalized retrieved address.

Warning: The normalized submitted address is similar or approximate to a retrieved address. There are two scenarios:

Similar address: Addresses are more than 0.2 miles apart but share the same state and city, or share the same postal code and street.

TypeSubmittedRetrieved
Same city, different street number4322 N Hall St, Dallas, TX 75219-27314104 N Hall St Apt 107, Dallas, TX 75219-5627
Same postal code, different city3755 Redwine Rd Apt 9323, Atlanta, GA 30344-59703755 Redwine Rd Apt 9215, East Point, GA 30344-5983

Approximate address: Addresses are within 0.2 miles of each other with the same street, state, and postal code. This typically occurs due to typos or missing suite numbers.

TypeSubmittedRetrieved
Typo in street number21098 E Duncan St, Queen Creek, AZ 85142-486721089 E Duncan St, Queen Creek, AZ 85142-4868
Missing suite number4201 Cypress Creek Pkwy Ste 540 #1197, Houston, TX 77068-34584201 Cypress Creek Pkwy Ste 540, Houston, TX 77068-3458

Failure: The normalized submitted address does not match any normalized retrieved address.

TIN matching

Normalization

Middesk normalizes TINs by:

  • Requiring exactly 9 digits
  • Removing dashes and other extraneous characters
SubmittedNormalized
12-3456789123456789

Matching rules

Middesk uses the IRS TIN Matching program to verify whether the submitted TIN and business name match.

The IRS TIN Matching program only matches on the first four characters of the submitted business name, regardless of the actual name length.

Success: The submitted business name is associated with the submitted TIN according to the IRS.

Alternate: The submitted TIN is associated with a different business name according to the IRS. Middesk returns the alternative business name associated with the submitted TIN.

Failure: The submitted TIN is not associated with the submitted business name according to the IRS, and is not associated with any other business name.

Person matching

Normalization

Middesk normalizes person names by:

  • Removing case sensitivity
  • Removing punctuation
  • Removing extra whitespace at the beginning and end
  • Removing extraneous characters during comparison
SubmittedNormalized
Kyle MackKYLE MACK
John+SmithJOHN SMITH

Matching rules

Success: The normalized submitted person name matches at least one normalized retrieved person name. Middesk accounts for common variations:

Variation typeSubmittedRetrieved
Simple typoKeith MorganKieth Morgan
Name suffixJohn Smith JrJohn Smith Junior
TranspositionJohn SmithSmith John
Middle name missingJohn SmithJohn Roger Smith
Middle initialJohn Roger SmithJohn R Smith
Maiden nameJane Johnson SmithJane Smith

Failure: The submitted normalized person name does not match any normalized retrieved person name.

Get a demo
Contact your account manager or contact sales to inquire about access.