Tim D.

Commented on Error in trainMatch Step: rawPredictionCol Vectors...·Posted inHelp Zingg

The end goal is to only match some 300-400 unmatched 'new' entities (and primarily only 1-2 columns of data like Name + State) to a 'mastered' list of 12.5k. But there are nearly 400k of variations that already match to 12.5k entities. Can you guide me on how to structure this data and process for zingg to tackle this use case?

Posted in Help Zingg·

Tim D.

Running Databricks Sample with 400k Records from Custom Dataset Instead of Test Data

This is when trying to run through the sample with databricks except instead of test data I use some of my own. Not heavy on columns but the dataset is 400k records

2Comments

Posted in Help Zingg·

Tim D.

Error in trainMatch Step: rawPredictionCol Vectors Must Have Length 2 but Got 1

During the trainMatch step I get this error - zingg.common.client.ZinggClientException: Exception thrown in awaitResult: requirement failed: rawPredictionCol vectors must have length=2, but got 1

3Comments