Results 76 comments of gaurav

Dear @jennybc, Here's how I was thinking about this: 1. Specify a distance_measure_of_choice for each columns or some aggregate distance measure across columns (say net edit distance no greater than...

The more general way is to learn the weights on appropriate distance measures for each column based on training data using some supervised technique and then match the rest.

dear @sfirke As you already anticipate and indirectly state, there is no one distance measure that will work everywhere. Depends on the kinds of errors you expect etc. Idea is...

Apologies for the delay in responding. Two things: 1. Can you share a reproducible example so that I try debugging at my end. 2. A brief tour of Stackoverflow suggests...

Hey @jcmundy: can you send a reproducible example. Would def. look into it.

ok --- I think I have a diagnosis of the problem. We are getting the replies. There is a way to get the replies by adding in the part 'id,replies,snippet'....

Hey @rodik, I did some due diligence around whether whether we are getting the latest and the oldest comment. We are. ``` a[747,] authorDisplayName authorProfileImageUrl 1208 wìld wingõ https://yt3.ggpht.com/-aaHd2yRLPtw/AAAAAAAAAAI/AAAAAAAAAAA/AA-MPoywE0Y/s28-c-k-no-mo-rj-c0xffffff/photo.jpg authorChannelUrl...

@jcmundy I can see that there are replies. ``` a

@jcmundy --- apologies for being cryptic! my point was that the function does pull in replies. your point = it isn't pulling in all the replies. investigating the vmware thing....

thanks @rodik! the function is super kludgy. will rewrite this. will finalize and release soon.