AI With Reinforcement Learning: Fall 2020 Assignment 1 Tasks

Question

AI with reinforcement Learning, Fall 2020 Assignment 1 Tasks Your task is to design a reinforcement learning algorithm to teach the mouse how to find the food. The fundamental task is as follows: There is a 100x100 matrix representing a grid where any space can either be occupied by a mouse or a piece of food. There is only one mouse, and an arbitrary amount of food. The mouse is able to sense the food with a 3x3 matrix representing its sense of smell, the range of which is the full grid, and stacks (Two foods next to each other will generate twice as much smell). This will be the input for your algorithm. Note that the center of this matrix will always be 0, as that space represents the mouse itself. The mouse has a limited amount of energy, which is fully replenished when it finds food. If it runs out of energy, it dies, and the game is over. The mouse is able to move in any cardinal direction (North, South, East, and West). The goal is for them to eat all of the food in the grid as quickly as possible. This simulation is visually represented using PyGame. This task can also be solved by a trivial algorithm using nothing but simple arithmetic, as the 'scent' of food is a function of distance from the mouse. You can try to find a non-RL solution and compare it to the best version of the RL algorithm's results. For each frame, a forward pass is run through your model. The input for each frame is an array (or tuple) with four numbers between 0 and 1. These are the probabilities generated from your model. These will determine how the mouse moves. Questions: Does the order matter for the reinforcement learning model? (Ex: Inputting (N,S,E,W) vs (N,E,S,W)) Does the order matter for a closed form solution? What would be better for reinforcement learning; taking the highest value from the array as the movement choice, or choosing a random direction weighted by the given probabilities? Why? Tasks for the students: Write a reward function. You have access to the current gam

Dr. Jack HW Helper · Accepted Answer

The design of a reinforcement learning (RL) algorithm to teach a mouse to find food in a grid-like environment is a compelling computational problem that illustrates the capacities of AI in making autonomous decisions based on sensory input. This approach is deeply rooted in principles of RL, where agents learn to make choices that maximize cumulative rewards through interactions with their environments. The fundamental aspect of the assigned task is to create an algorithm capable of navigating a 100x100 grid-level environment populated by food items and the mouse itself. In this scenario, the mouse must use its sensing capabilities, represented as a 3x3 sensory matrix, to detect food and maneuver accordingly. The center of this matrix always represents the mouse’s current location, which isolates the neighboring food inputs around it. The mouse's movement decision is based on probabilities generated by the RL model. Given an input consisting of the sensory matrix excluding its center, the model outputs probabilities corresponding to movement in four cardinal directions: north, south, east, and west. As a crucial part of the algorithm, the mouse possesses a limited energy resource, which is restored upon finding food, creating tension in energy management throughout the navigation process. In crafting the reward function, students must navigate the nuances of reward shaping to ensure efficient learning. Students can utilize various strategies, including direct rewards for every food item consumed or triggering a reward upon the mouse moving closer to food, thus guiding learning not only from immediate rewards but also from exploring prior states. Sparse reward functions could entail minimal rewards, primarily when food is acquired, whereas more complex formulations could factor in distances to food sources and the number of moves taken. The design choices when implementing the reinforcement learning model raise pertinent questions about the influence of input order

AI With Reinforcement Learning: Fall 2020 Assignment 1 Tasks ✓ Solved

AI with reinforcement Learning, Fall 2020 Assignment 1 Tasks

Paper For Above Instructions

References

AI with reinforcement Learning, Fall 2020 Assignment 1 Tasks

Paper For Above Instructions

References

Related Assignments