Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

State of the Q Network #2

Open
abhincho opened this issue Feb 15, 2020 · 0 comments
Open

State of the Q Network #2

abhincho opened this issue Feb 15, 2020 · 0 comments

Comments

@abhincho
Copy link

Hi,
I have a doubt. In your code the state/observation is a n length vector ( n is the number of nodes in the graph) initialized to zero.
e.g. for a 5 node graph [ 0 0 0 0 0]
And when an action (node) is taken in the MVC environment we get a new state. In your code, the new state is give as this self.observation[node]=1. So if the action (node) 2 is selected than the vector becomes
[0 1 0 0 0]
Why is just the action(node) selected, set to 1 and not all the vertices that is reachable from that action(node) not set to 1. i.e. shouldn't the new state be all the states that have been visited after selecting the action.

Is this correct or am I missing something?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant