The XOR problem is the classic example of a non-linearly separable problem. A single-layer perceptron cannot solve it — you need at least one hidden layer. Watch the decision boundary evolve as the network learns!