ZC-Swish: Stabilizing Deep BN-Free Networks for Edge and Micro-Batch Applications

Researchers propose Zero-Centered Swish, an activation function designed to stabilize training in batch-normalization-free deep networks, addressing a critical pain point in micro-batch regimes like 3D medical imaging and federated learning where standard activations cause gradient collapse.
MentionsZC-Swish · Swish · ReLU · Batch Normalization
Read full story at arXiv cs.LG →(arxiv.org)
Modelwire summarizes — we don’t republish. The full article lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.