Skip to content

Commit 50708b4

Browse files
Update README.md
1 parent 6824b3a commit 50708b4

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,7 @@ Goal is to learn how to take actions in order to maximize the reward. The object
2121

2222
where,
2323
<br/><b>Q_[s_, a_]</b> - value of the objective function on the next step,
24+
<br/><b>Q[s, a]</b> - value of the objective function on the current position,
2425
<br/><b>max(Q_[s_, a_]) – Q[s, a])</b> - choosing maximum value from the possible next steps,
2526
<br/><b>s</b> – current position of the agent,
2627
<br/><b>a</b> – current action,

0 commit comments

Comments
 (0)