A new design of a multi-dimensional real-time VLSI convolver is presented. A custom VLSI chip is proposed which, when accompanied by memory buffers, can be used to assemble a convolver of arbitrary dimension and with arbitrary input size. The convolver is optimal with respect to the size of memory and has very small latency. Numerous modifications of the basic design are introduced in a framework of a unified graph-theoretic transformation called retiming. This approach guarantees functional equivalence of the original and modified systems.