Chapter 3: Task Environments

3.1 What is a Task Environment?

A task environment is the problem to which an agent is the solution. It consists of:

Property	Description	Example
Fully Observable	Agent can see complete state	Chess (vs Poker)
Deterministic	Next state depends only on current state + action	Taxi navigation (vs Poker)
Episodic	Actions divided into atomic episodes	Image classification
Static	Environment doesn't change while agent deliberates	Crossword puzzle
Discrete	Finite number of states/actions	Chess (vs autonomous driving)
Single Agent	No other agents affecting environment	Sudoku solver

Fully Observable: Agent sensors give access to complete state of environment (e.g., chess)

Partially Observable: Agent has limited/partial information (e.g., poker, real-world navigation)

Deterministic: Next state completely determined by current state + action (e.g., solving a puzzle)

Stochastic: Some randomness in outcomes (e.g., robot navigation with sensor noise)

Episodic: Each action independent of previous ones (e.g., image classification)

Sequential: Current decision affects future ones (e.g., chess, navigation)

Environment	Observability	Determinism	Episodic/Sequential	Static/Dynamic
Chess	Fully	Deterministic	Sequential	Semi-static
Poker	Partial	Stochastic	Sequential	Dynamic
Robot Navigation	Partial	Stochastic	Sequential	Dynamic
Image Classification	Fully	Deterministic	Episodic	Static
Medical Diagnosis	Partial	Stochastic	Sequential	Dynamic

Compare and contrast fully observable and partially observable environments with examples.
Why is chess considered a deterministic environment while poker is stochastic?
Explain how the properties of an environment affect agent design choices.
Classify the following environments: self-driving car, spam filter, chess AI, weather prediction system.
What makes an environment dynamic versus static? Give two examples of each.