LightGBM icon indicating copy to clipboard operation
LightGBM copied to clipboard

Potential problem with thresholds for bins around 0

Open jaguerrerod opened this issue 1 year ago • 2 comments

I have a dataset with features with values -4, -3, -2, -1, 0, 1, 2, 3, 4 When I check the trees fitted I see cut points for bins -3.5, -2.5, -1.5, 1.5, 2.5, 3.5 all ok But around 0 I see 1.0000000180025095e-35 and -1.0000000180025095e-35 Why not 0.5 and -0.5? These extremely small values could be considered both 0 by the machine, and feature <= -1.0000000180025095e-35 could include 0 so both -1.0000000180025095e-35 and 1.0000000180025095e-35 would be the same cut point. Why this election of cut points for bins? image

jaguerrerod avatar Aug 05 '24 08:08 jaguerrerod

@shiyu1994 can you please answer this?

jameslamb avatar Aug 05 '24 14:08 jameslamb

same question, have you already solved it?

nmslandwsnd avatar Mar 18 '25 10:03 nmslandwsnd