[0] target: s0:0 s1:1 s2:2 s3:3 SL:4
[1] action: 0,1,2,3,4,5,6
[2] t: Disable truncation 0, Enable truncation 1
[3] q: 固定Q算法 0, 动态Q算法 1
[3] q: fixed Q algorithm 0, dynamic Q algorithm 1
[4] startQ: 0,1,2,3.....15 注意:在固定Q算法下,Q固定为StartQ,忽略MinQ 和 MaxQ
[4] startQ: 0,1,2,3.....15 note: in fixed algorithm, Q is fixed as StartQ, neglect MinQ and MaxQ
[5] minQ: 0,1,2,3.....15
[6] maxQ:0,1,2,3......15
[7] dr:0,1
[8] coding:0,1,2,3
[9] p:0,1
[10] sel:0,1,2,3
[11] Session:0,1,2,3
[12] g:0,1
[13] linkFrequency:0,1,2,3,4,5,6,7