annw0922

Results 2 issues of annw0922

https://github.com/EricSteinberger/Deep-CFR/blob/master/DeepCFR/workers/la/sampling_algorithms/MultiOutcomeSampler.py as 'aprx_imm_reg' here is computed for every action and put to buffer without being summed up, I have no idea why 'aprx_imm_reg *= legal_action_mask / n_actions_to_smpl ' I think...

`def data_collator(features: list) -> dict: len_ids = [len(feature["input_ids"]) for feature in features] longest = max(len_ids) input_ids = [] labels_list = [] for ids_l, feature in sorted(zip(len_ids, features), key=lambda x: -x[0]):...