Okay if these is the order of my definitions:
FieldDefinition("linkedin_url", "string",MatchType.NULL_OR_BLANK, MatchType.EXACT),
FieldDefinition("email", "string", MatchType.NULL_OR_BLANK, MatchType.EXACT),
FieldDefinition("domain", "string", MatchType.NULL_OR_BLANK, MatchType.EXACT),
FieldDefinition("full_name", "string",MatchType.NULL_OR_BLANK, MatchType.FUZZY),
FieldDefinition("company_name", "string", MatchType.NULL_OR_BLANK, MatchType.FUZZY)
These are null count:
linkedin_url| email|domain|full_name|
+------------+---------+------+---------+
| 516770|251455239| 0| 3395582|
This is duplicate domain repeating 1.5 M times:
domain |count |
+---------------------+-------+
|linktr.ee |1566375|
same for Full Name:
full_name |count |
+--------------+-------+
|NULL |3395582|
| |18892 |
|Amit Kumar |18096 |
Is these creating issues