|
Description |
Value |
|
action set |
|
|
UCB parameter |
20000 |
|
opt. horizon |
5 |
|
num. of episodes |
3000 |
|
num. of particles |
300 |
|
discount fact. |
1 |
|
dyn. covariance |
|
|
obs. covariance |
|
|
IDM st. dev. |
1
|
|
time headway |
2
|
|
acc. exponent |
4 |
|
IDM min. dist. |
1
|
|
acc. reward coef. |
1 |
|
vel. reward coef. |
100 |
|
crash reward |
10000 |
|
max. lat. acc. |
0.5
|