Bill Zou Garner Options
The theoretical analysis demonstrates that EDIS exhibits lessened suboptimality in comparison with only making use of on the net data or immediately reusing offline details. EDIS can be a plug-in approach and can be combined with present strategies in offline-to-on the web RL environment. By employing EDIS to off-the-shelf techniques Cal-QL and IQL