The ultimate goal for future telecommunication is highly effective inter-personal information exchange. The effectiveness of telecommunication is greatly enhanced by 3-D telepresence. This requires that visual information is presented in such a way that the viewer is under the impression of actually being physically close to the party with whom the communication takes place. One way to achieve a natural 3-D impression is to encode image sequences using 3-D model objects and animate them again by computer graphic means regarding the observers eye positions. This concept will use a parametric 3-D scene description in order to model a scene. The parameters of the model objects will be estimated from trinocular input image sequences by means of image analysis. This paper starts with an overview on the European ACTS project PANORAMA, in which the above mentioned concept will be realized and evaluated. In the main part the shape initialization of physical objects from a multiview image sequence will be discussed. For this the range information given by three disparity maps from different stereo views is backprojected into 3-D space. The resulting cloud of 3-D points is then approximated by a flexible triangular net by using a technique named discrete smooth interpolation. The discrete smooth interpolation is a particular surface interpolation technique, which is solved by an iterative approach. It allows to generate a surface, defined as a wireframe mesh, that fits (or interpolates) a given set of 3D points by observing, at the same time, some given constraints about the surface characteristics, like roughness, behavior at the boundaries, etc. The finally presented results show the capabilities of this approach in video communication.