off-policy icon indicating copy to clipboard operation
off-policy copied to clipboard

mqmix hypernet b2

Open zcyyyyyyyyyyy opened this issue 2 years ago • 0 comments

in mqmix mixer

self.hyper_b2 = nn.Sequential( init_(nn.Linear(self.cent_obs_dim, self.hypernet_hidden_dim)), nn.ReLU(), init_(nn.Linear(self.hypernet_hidden_dim, 1)) ).to(self.device)

should be

self.hyper_b2 = nn.Sequential( init_(nn.Linear(self.cent_obs_dim, self.mixer_hidden_dim)), nn.ReLU(), init_(nn.Linear(self.mixer_hidden_dim, 1)) ).to(self.device)

?

zcyyyyyyyyyyy avatar Dec 14 '23 09:12 zcyyyyyyyyyyy