Hal Daumé III comments

Results 10 comments of


                                            Hal Daumé III

issue: typo in chapter 6

There are two equivalent deﬁnitions of a CONCAVE function. The ﬁrst is that it’s second derivative is always non-negative. The second, more geometric, deﬁtion is that any chord of the...

I just pushed a change that implements this. https://github.com/hal3/macarico/blob/reorg/macarico/base.py#L80 for the interface and https://github.com/hal3/macarico/blob/reorg/macarico/policies/linear.py for some examples. any thoughts?

requirements?

it should only require pytorch unless i forgot something. what errors are you getting?

requirements?

I’m currently running python 3.5.4 with torch 0.4.1. If you go to the tests directory and run `python test_randomly.py` what happens? - hal (👨‍🔬 MSR-NYC ↔ 👨‍🏫 UMD ○ 🌐hal3.name...

seq2seq

for edit distance? i have one too ;). i wonder which is better

Completed issue 31: Support Reward Per time-step

shouldn't this be a tail sum?

Completed issue 31: Support Reward Per time-step

i'm not sure i agree. why should the env have to know how the RL algo works? and also not all RL algs will want tail sums

Completed issue 31: Support Reward Per time-step

ugh gamma. i think this should be an argument of the RL algorithm. there's good reason (eg Nan Jiang's work) to think you migth want to learn with a different...

Implement BanditLOLS

basic implementation is done in https://github.com/hal3/macarico/blob/master/macarico/lts/lols.py

Implement BanditLOLS

there's some super-ugliness in BanditLOLS/LinearPolicy that I'd like to get your take on (see lols.py:55,72 and __init__.py:82-85). the issue is that in order to do CS bLOLS, you need to...