OfflineRL icon indicating copy to clipboard operation
OfflineRL copied to clipboard

Question about cql_loss calculation in COMBO

Open return-sleep opened this issue 2 years ago • 0 comments

When COMBO is derived from CQL, why do they calculate CQL_loss differently?

return-sleep avatar Jan 04 '24 13:01 return-sleep