SQLNet Aggregation Prediction uses select column from the ground truth

The below code snippet looks like using the column position for select in evaluation as well as training.

https://github.com/xiaojunxu/SQLNet/blob/5dfb96edc6f1131c3f640f31bcef3520ea0f922c/sqlnet/utils.py#L204

https://github.com/xiaojunxu/SQLNet/blob/5dfb96edc6f1131c3f640f31bcef3520ea0f922c/sqlnet/model/sqlnet.py#L131

https://github.com/xiaojunxu/SQLNet/blob/5dfb96edc6f1131c3f640f31bcef3520ea0f922c/sqlnet/model/modules/aggregator_predict.py#L41

Don't you think the model should predict the column for select instead of the column given as ground truth before aggregation prediction? In fact, the select column will not be given in prediction time when applying this to real worlds.

You made selection prediction, so the output could be fed with the aggregation prediction. In result, the evaluation number might be wrong while comparing with the original paper, seq2sql.

Jan 17 '18 23:01 ghost

https://github.com/xiaojunxu/SQLNet/issues/3

Jan 22 '18 06:01 guotong1988

@guotong1988

Here is the code snippet to replace the ground truth to the prediction. So it will work well in evaluation as well as training.

        if gt_sel is None:
            if self.trainable_emb:
                x_emb_var, x_len = self.sel_embed_layer.gen_x_batch(q, col)
                col_inp_var, col_name_len, col_len = \
                        self.sel_embed_layer.gen_col_batch(col)
                max_x_len = max(x_len)
                sel_score = self.sel_pred(x_emb_var, x_len, col_inp_var,
                        col_name_len, col_len, col_num)
            else:
                sel_score = self.sel_pred(x_emb_var, x_len, col_inp_var,
                        col_name_len, col_len, col_num)

            pred_sel = []
            for b in range(B):
                pred_sel.append(np.asscalar(np.argmax(sel_score[b].data.cpu().numpy())))

            # use the preds for aggregator
            gt_sel = pred_sel

Jan 26 '18 10:01 ghost

@deepcoord

Thanks for the code snippet. I have a question though:

If I'm not mistaken, with your fix, sel_score is computed twice:

a first time by your piece of code before predicting the aggregator
a second time by the original code after predicting the aggregator

Wouldn't it be simpler to just reverse the order in which sel_score and agg_score are computed in the forward method in sqlnet.py ? By doing so, we could compute sel_score only one time and add a shorter snippet of code just before predicting the aggregator :

if gt_sel is None:
      sel_score_np = sel_score.data.cpu().numpy()
      gt_sel = [np.asscalar(np.argmax(sel_score_np[b])) for b in range(B)]

Am I missing something?

Feb 05 '18 10:02 ThomasLecat

Thanks for recognizing the issue! Yes, this will lead to a wrong break-down result on aggregator but will not affect the overall acc_qm or acc_ex. I will fix the bug soon.

Feb 12 '18 02:02 xiaojunxu