Harsh Bhatt (@harshbhatt7585) 's Twitter Profile
Harsh Bhatt

@harshbhatt7585

19 | training neural networks since 16.
prev @ remyx.ai secta.ai aragon.ai voice.ai
alumni@ tks.world launchx.com

ID: 1578325960272662528

linkhttps://youtube.com/channel/UCiD7kslR7lKSaPGSQ-heOWg calendar_today07-10-2022 10:07:19

943 Tweet

474 Takipçi

565 Takip Edilen

Harsh Bhatt (@harshbhatt7585) 's Twitter Profile Photo

The main problem with using dropout in reinforcement learning is that the policy network is used twice. Once when collecting rollouts and again during the gradient update. Each time a new dropout mask is randomly generated, meaning the network behaves slightly differently even

The main problem with using dropout in reinforcement learning is that the policy network is used twice. Once when collecting rollouts and again during the gradient update. Each time a new dropout mask is randomly generated, meaning the network behaves slightly differently even
Harsh Bhatt (@harshbhatt7585) 's Twitter Profile Photo

Our brain is off-policy learner with on-policy corrections. 1. When we go to sleep we replays experiences during sleep or rest (like the “replay buffer” in RL). That’s an off-policy mechanism. 2. Learning from observation: We can learn by watching others (imitation learning).

Our brain is off-policy learner with on-policy corrections.

1. When we go to sleep we replays experiences during sleep or rest (like the “replay buffer” in RL). That’s an off-policy mechanism.

2. Learning from observation: 
We can learn by watching others (imitation learning).
Harsh Bhatt (@harshbhatt7585) 's Twitter Profile Photo

built a UI for live training of an agent in gridworld env with Q-learning, you can see how agent is training and how Q score of a state is being updated. During the training you can tweak parameters like learning rate, exploration (epsilon), discount factor and see how training

Harsh Bhatt (@harshbhatt7585) 's Twitter Profile Photo

Wrote a simple article on Q-learning Algorithm. I will also create a video on it :) plume-robin-b8f.notion.site/Q-learning-29b…

Wrote a simple article on Q-learning Algorithm. 

I will also create a video on it :)

plume-robin-b8f.notion.site/Q-learning-29b…
Harsh Bhatt (@harshbhatt7585) 's Twitter Profile Photo

Congratulations to Women’s Team India for winning the Cricket World Cup. I really enjoyed watching the match. This cup will inspire the whole next generation not just girls but also boys to follow the passion irrespective of any circumstances. It is a great cricket memory to

Congratulations to Women’s Team India for winning the Cricket World Cup. I really enjoyed watching the match. This cup will inspire the whole next generation not just girls but also boys to follow the passion irrespective of any circumstances.

It is a great cricket memory to