Tutor ModeCreate code using backpropagation network to store the following patterns. The input

patterns are the “letters” given in the 5x3 arrays, and the associated target patterns are

given be

Question

Tutor ModeCreate code using backpropagation network to store the following patterns. The input

patterns are the “letters” given in the 5x3 arrays, and the associated target patterns are

given below each pattern. Experiment with the number of hidden units (no more than 15),

the learning rate, and the initial weight values.

(a) For your network(s), try three different architectures (dif ferent number of hidden

neurons, 1 or 2 hidden layers), three sets of random initial weights holding everything else

constant, and three different learning rates (fast, medium and slow). Summarize the

performance of your networks and your training experiences with these alterations.

(b) Using one of your trained networks from part (a), do the following:

(i) Take your input file of patterns and corrupt them slightly (that is, change a few bits) .

How is the classification accuracy of your trained net?

(ii) Take your input file of patterns and corrupt them severely (that is, change more bits).
) How is the classification accuracy of your trained net?

x 3 #- ## #.
i fi i i
. #° BE 13 #°
(-1,-1,-1) 1-1, 1) (-1, 1,-1) -L 1,1)
# + J :
I: I: LH gl
## . “# -
(1,-1,-

VIPSolutions · Accepted Answer

Great — let’s treat this like a guided mini‑project. I’ll give you:

- a tiny 5×3 “letters” dataset (8 patterns, 15 inputs each, targets are 3-bit vectors with values ±1)
- a from‑scratch backprop MLP (tanh activations, MSE loss) that supports 1 or 2 hidden layers
- an experiment harness to vary architecture, learning rate, and initial weights
- a corruption test to measure robustness under slight vs severe noise
- tips on what you should expect and how to summarize results

If you prefer PyTorch instead of NumPy/backprop-by-hand, I can provide that too, but this version keeps the mechanics transparent.

1) Dataset and encoding

- Inputs: 5×3 bitmaps flattened to length 15 and encoded as +1 for filled and −1 for blank.
- Targets: 3 output neurons with the 8 combinations of (±1, ±1, ±1), matching the figure’s order.

The eight patterns below are simple 5×3 “block letters.” The exact glyphs aren’t sacred; consistency is what matters.

Code (copy-paste into a single file and run with Python 3.9+):

import numpy as np
from dataclasses import dataclass
from typing import List, Tuple, Optional

# ---------- Utilities ----------

def to_pm1(arr01):
    # Convert 0/1 array to -1/+1
    return np.where(arr01 > 0, 1.0, -1.0)

def sign01(x):
    # map outputs to -1/+1 by sign, with 0 -> +1
    s = np.where(x >= 0, 1.0, -1.0)
    return s

def accuracy_pm1(y_true, y_pred):
    yb = sign01(y_pred)
    return (yb == y_true).all(axis=1).mean()

def flatten(pm):
    return np.array(pm, dtype=float).reshape(-1)

def draw_pattern(vec15):
    # pretty-print a 5x3 pattern encoded as -1/1
    g = vec15.reshape(5,3)
    lines = []
    for r in range(5):
        lines.append("".join("#" if g[r,c] > 0 else "." for c in range(3)))
    return "
".join(lines)

# ---------- 5x3 letters (8 patterns) ----------
# Define 8 patterns as 0/1 grids then convert to +/-1

# A
A = [
 [0,1,0],
 [1,0,1],
 [1,1,1],
 [1,0,1],
 [1,0,1],
]
# B
B = [
 [1,1,0],
 [1,0,1],
 [1,1,0],
 [1,0,1],
 [1,1,0],
]
# C
C = [
 [0,1,1],
 [1,0,0],
 [1,0,0],
 [1,0,0],
 [0,1,1],
]
# H
H = [
 [1,0,1],
 [1,0,1],
 [1,1,1],
 [1,0,1],
 [1,0,1],
]
# E
E = [
 [1,1,1],
 [1,0,0],
 [1,1,1],
 [1,0,0],
 [1,1,1],
]
# F
F = [
 [1,1,1],
 [1,0,0],
 [1,1,1],
 [1,0,0],
 [1,0,0],
]
# G
G = [
 [0,1,1],
 [1,0,0],
 [1,0,1],
 [1,0,1],
 [0,1,1],
]
# I
I = [
 [1,1,1],
 [0,1,0],
 [0,1,0],
 [0,1,0],
 [1,1,1],
]

letters01 = [A,B,C,H,E,F,G,I]
X = np.stack([to_pm1(flatten(p)) for p in letters01], axis=0)   # shape (8,15)

# Targets: 8 combinations of (-1,+1)^3 in the same order as patterns
targets = np.array([
 [-1,-1,-1],
 [-1,-1, 1],
 [-1, 1,-1],
 [-1, 1, 1],
 [ 1,-1,-1],
 [ 1,-1, 1],
 [ 1, 1,-1],
 [ 1, 1, 1],
], dtype=float)

Y = targets

# ---------- MLP (from scratch) ----------

@dataclass
class TrainConfig:
    lr: float = 0.05
    max_epochs: int = 20000
    tol: float = 1e-4
    print_every: int = 0  # set >0 to see training logs
    seed: Optional[int] = None

class MLP:
    def __init__(self, input_dim: int, hidden: List[int], output_dim: int, seed: Optional[int]=None):
        self.layers = [input_dim] + hidden + [output_dim]
        self.n_layers = len(self.layers) - 1
        self.rng = np.random.default_rng(seed)
        self.W = []
        self.b = []
        for i in range(self.n_layers):
            fan_in, fan_out = self.layers[i], self.layers[i+1]
            # Xavier init for tanh
            limit = np.sqrt(6/(fan_in+fan_out))
            self.W.append(self.rng.uniform(-limit, limit, size=(fan_in, fan_out)))
            self.b.append(np.zeros((1,fan_out)))
        # caches
        self.z = []
        self.a = []

@staticmethod
    def act(x):
        return np.tanh(x)

@staticmethod
    def dact(a):
        # derivative wrt pre-activation; here we pass 'a' = tanh(z)
        return 1 - a**2

def forward(self, X):
        a = X
        self.z = []
        self.a = [a]
        for i in range(self.n_layers):
            z = a @ self.W[i] + self.b[i]
            a = self.act(z) if i < self.n_layers-1 else self.act(z)  # tanh for output to match ±1
            self.z.append(z)
            self.a.append(a)
        return a

def mse(self, y, yhat):
        return 0.5 * np.mean((y - yhat)**2)

def backward(self, X, Y, yhat, lr):
        m = X.shape[0]
        # Output layer delta
        delta = (yhat - Y) * self.dact(self.a[-1])  # shape (m, out)
        dW = []
        db = []
        for i in reversed(range(self.n_layers)):
            a_prev = self.a[i]                     # shape (m, fan_in)
            dWi = (a_prev.T @ delta) / m
            dbi = np.mean(delta, axis=0, keepdims=True)
            dW.insert(0, dWi)
            db.insert(0, dbi)
            if i > 0:
                delta = (delta @ self.W[i].T) * self.dact(self.a[i])
        # gradient step
        for i in range(self.n_layers):
            self.W[i] -= lr * dW[i]
            self.b[i] -= lr * db[i]

def fit(self, X, Y, cfg: TrainConfig):
        if cfg.seed is not None:
            sel

✨ VIPSolutions

Question:

Thought Process:

Final Answer:

📢 Instructions:

Study Help, Fast Answers