is the continuous_a3c code valid? #20

Morphlng · 2022-10-13T07:40:59Z

I've tried the continuous_A3C.py, there exists some problems.

Problem

1. Incorrect dictionary update

macad-agents/src/macad_agents/a3c/continuous_A3C.py

Lines 30 to 33 in b2726b3

 env_config = DEFAULT_MULTIENV_CONFIG 

 config_update = update_scenarios_parameter( 

 json.load(open("macad_agents/a3c/env_config.json"))) 

 env_config.update(config_update)

Use dict.update to update a dictionary will override pre-exists keys in the dict, such as:

(d1,d2) = ({"Country":{}},{"Country":{}})
d1['Country']['area'] = 960
d2['Country']['population'] = 14

d1.update(d2) # The 'area' key will be replaced by 'population'

This will cause the "fixed_delta_seconds" key to be lost, as a result macad-gym can't initialize.

2. Serialization Problem

macad-agents/src/macad_agents/a3c/continuous_A3C.py

Lines 145 to 154 in b2726b3

 class Worker(mp.Process): 

 def __init__(self, gnet, opt, global_ep, global_ep_r, res_queue, name): 

 super(Worker, self).__init__() 

 self.name = 'w%i' % name 

 self.g_ep, self.g_ep_r, self.res_queue = (global_ep, global_ep_r, 

 res_queue) 

 self.gnet, self.opt = gnet, opt 

 self.lnet = Net(N_S.spaces[vehicle_name], 

 N_A.spaces[vehicle_name]) # local network 

 self.env = MultiCarlaEnv(env_config)

Putting environment inside Net is not a good idea, when mp.Process serialize this object, it will try to serialize the environment as well, resulting in "can't pickle pygame.Font object" error.

Training

Even though I fixed those problems, it still doesn't seem to work. The "Mean Reward Curve" doesn't tend to go up (nor the distance curve to go down), and I haven't achieved one success episode yet (3M steps, maybe it's not enough to draw a conclusion?).

I know that PPO and IMPALA are the recommended algorithms, but since A3C is available in the repo, I want to know if it actually works.

The text was updated successfully, but these errors were encountered:

SExpert12 · 2024-07-07T07:09:47Z

Hi,
are you able to solve the problem in A3C algorithm?

Morphlng · 2024-07-12T02:42:04Z

Hi, are you able to solve the problem in A3C algorithm?

No, the algorithm provided in this repo does not work out of the box. You can try to combine MACAD-Gym with MARLlib (or RLlib), we have successfully trained cooperative agent with MAPPO.

SExpert12 · 2024-07-12T05:51:25Z

Okay. Thanks

SExpert12 · 2024-07-16T10:24:26Z

Hi, @Morphlng
I have seen you have worked so much on this repo. I am using this repo for multi-agent training. I don't understand how to visualize the training process in carla simulator.
Can you please guide me regarding this issue?

Morphlng closed this as completed Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is the continuous_a3c code valid? #20

is the continuous_a3c code valid? #20

Morphlng commented Oct 13, 2022

SExpert12 commented Jul 7, 2024

Morphlng commented Jul 12, 2024

SExpert12 commented Jul 12, 2024

SExpert12 commented Jul 16, 2024

is the continuous_a3c code valid? #20

is the continuous_a3c code valid? #20

Comments

Morphlng commented Oct 13, 2022

Problem

1. Incorrect dictionary update

2. Serialization Problem

Training

SExpert12 commented Jul 7, 2024

Morphlng commented Jul 12, 2024

SExpert12 commented Jul 12, 2024

SExpert12 commented Jul 16, 2024