Train my own selection instead of zingg to pick: I want to label and train my model some specfic record handcrafted by me. Is it possible to do that?
Currently, zingg oicks the edge cases and ask to label interactively. But I want to label some specific records !
I tied zingg labeling but somehow it does not group these sample example together :
N Johnathon, 999-999-9999, abcd@gmail.com
Nathan Johathon, 777-777-7777, abcd@gmail.com
Nathen Johnathon, 999-999-9999, xyz@gmail.com
All three records belong to same group or person due to common determinstic attributes, like phone number and email id along with similar names. But zingg match them as two differt persons !
Also I tried with my own training data along with zingg's automated training and labelling twice.
I noticed two things: somehow spark ran very slowly. And the accuracy went really bad. Somehow it grouped some totally unrelated persons wrongly