Why does single-layer perceptron converge so slow without normalization, even when the margin is large?
问题 The bounty expires in 2 days . Answers to this question are eligible for a +100 reputation bounty. AlwaysLearning wants to draw more attention to this question. This question is totally re-written after I confirmed my results (the Python Notebook can be found here) with a piece of code written by someone else (can be found here). Here is that code instrumented by me to work with my data and to count epochs till convergence: import numpy as np from matplotlib import pyplot as plt class