In the design of a magazine cover, making a set of decisions regarding the color distribution of the cover image and the colors of other graphical and textual elements is considered to be the concept of color design. This concept addresses a number of subjective challenges, specifically how to determine a set of colors that is aesthetically pleasing yet also contributes to the functionality of the design, the legibility of textual elements, and the stylistic consistency of the class of magazine. Our solution to automatic color design includes the quantification of these challenges by deploying a number of well-known color theories. These color theories span both color harmony and color semantics. The former includes a set of geometric structures that suggest which colors are in harmony together. The latter suggests a higher level of abstraction. Color semantics means to bridge sets of color combinations with color mood descriptors. For automatic design, we aim to deploy these two viewpoints by applying geometric structures for the design of text color and color semantics for the selection of cover images.
Automated publishing requires large databases containing document page layout templates. The number of
layout templates that need to be created and stored grows exponentially with the complexity of the document
layouts. A better approach for automated publishing is to reuse layout templates of existing documents for
the generation of new documents. In this paper, we present an algorithm for template extraction from a docu-
ment page image. We use the cost-optimized segmentation algorithm (COS) to segment the image, and Voronoi
decomposition to cluster the text regions. Then, we create a block image where each block represents a homo-
geneous region of the document page. We construct a geometrical tree that describes the hierarchical structure
of the document page. We also implement a font recognition algorithm to analyze the font of each text region.
We present a detailed description of the algorithm and our preliminary results.
The popularity of tablets in recent years has given rise to a new style of presenting web content, one that mimics the appearance of print magazines. We present a pipeline which automatically harvests and reformats web content from arbitrary sources into a paginated multi-column form and deliver the publication to the reader via print or mobile. We are working to integrate this pipeline into HP’s Scheduled Delivery program.
We present HP SmartPrint, a novel web browser plug-in which automatically suggests print-worthy content within a web
page and provides an intuitive UI for users to make corrections to the initial suggestion, if needed. The resulting prints
contain only user desired content and excludes noise such as ads, thus increasing the desirability of the prints while
minimizing the cost. This solution provides a streamlined web printing experience and will be shipping with most HP
printers starting in 2011.
In this paper, we propose a system for automatic design of magazine covers that quantifies a number of concepts from
art and aesthetics. Our solution to automatic design of this type of media has been shaped by input from professional
designers, magazine art directors and editorial boards, and journalists. Consequently, a number of principles in design
and rules in designing magazine covers are delineated. Several techniques are derived and employed in order to quantify
and implement these principles and rules in the format of a software framework. At this stage, our framework divides the
task of design into three main modules: layout of magazine cover elements, choice of color for masthead and cover lines,
and typography of cover lines. Feedback from professional designers on our designs suggests that our results are
congruent with their intuition.
We present hp2.me, a URL shortener service for improving the mobile web consumption experience. Unlike
other such services, given a short URL, hp2.me returns an image rendered from the salient regions of a web
page. This approach to displaying web content improves mobile web reading experience through reduced
latency and improved clarity. It is faster to load a few large image files over a cellular data network than
many small files, and limited mobile screen real estate can be better utilized to display relevant content. The
hp2.me service is currently in external beta, and we present results to illustrate the advantages of this
Even though technology has allowed us to measure many different aspects of images, it is still a challenge to
objectively measure their aesthetic appeal. A more complex challenge is presented when an arrangement of
images is to be analyzed, such as in a photo-book page. Several approaches have been proposed to measure the
appeal of a document layout that, in general, make use of geometric features such as the position and size of a
single object relative to the overall layout. Fewer efforts have been made to include in a metric the influence of
the content and composition of images in the layout. Many of the aesthetic characteristics that graphic designers
and artists use in their daily work have been either left out of the analysis or only roughly approximated in an
effort to materialize the concepts.
Moreover, graphic design tools such as transparency and layering play an important role in the professional
creation of layouts for documents such as posters and flyers. The main goal of our study is to apply similar
techniques within an automated photo-layout generation tool. Among other design techniques, the tool makes
use of layering and transparency in the layout to produce a professional-looking arrangement of the pictures.
Two series of experiments with people from different levels of expertise with graphic design provided us with the
tools to make the results of our system more appealing. In this paper, we discuss the results of our experiments
in the context of distinct graphic design concepts.
Web article pages usually have hyperlinks (or links) that lead to print-friendly web pages containing mainly the article
content. Content extraction using these print-friendly pages is generally easier and more reliable, but there are many
variations of the print-link representations in HTML that made robust print-link detection more difficult than it first
appears. First, the link can be text-based, image-based, or both. For example, there is a lexicon of phrases used to
indicate print-friendly pages, such as "print", "print article", "print-friendly version", etc. In addition, some links use
printer-resembling image icons with or without a print phrase present. To complicate the matter further, not all the links
no URL is available for extraction. We estimate that there are more than 90% of the Web article pages have print-links,
of which about 35% of them have valid print-friendly URLs, which is a good percentage. Our solution to the print-link
extraction problem takes on two stages: (1) the detection of the print-link, (2) the retrieval of the print-friendly page
URL from the link attributes, including the test for its validity. Experimental results based on roughly 2000 web article
pages suggest our solution is capable of achieving over 99% precision and 97% recall performance measures.
Managing large document databases is an important task today. Being able to automatically com-
pare document layouts and classify and search documents with respect to their visual appearance
proves to be desirable in many applications. We measure single page documents' similarity with
respect to distance functions between three document components: background, text, and saliency.
Each document component is represented as a Gaussian mixture distribution; and distances between
dierent documents' components are calculated as probabilistic similarities between corresponding
distributions. The similarity measure between documents is represented as a weighted sum of the
components' distances. Using this document similarity measure, we propose a browsing mechanism
operating on a document dataset. For these purposes, we use a hierarchical browsing environment
which we call the document similarity pyramid. It allows the user to browse a large document dataset
and to search for documents in the dataset that are similar to the query. The user can browse the
dataset on dierent levels of the pyramid, and zoom into the documents that are of interest.
We describe a cloud-based automated-publishing platform that allows third party developers to embed our software
components into their applications, enabling their users to rapidly create documents for interactive viewing, or
fulfillment via mail or retail printing. We also describe how applications built on this platform can integrate with a
variety of different consumer digital ecosystems, and how we will address the quality and scaling challenges.
Consumers are increasingly using their smart phones to view web pages. However, there is no native operating system
support for printing these web pages. We propose to overcome two barriers to printing from mobile devices - the
inability to connect and transmit to a printer and the typically poor format of printed web pages. Our system includes a
client component that causes the web browser to upload the page (as a URL reference for public pages or the DOM
content for private pages) to a cloud service that extracts the content and formats it for printing. We transfer the printready
content to the HP CloudPrint service and leverage its ability to locate printers and transmit print jobs. We have
built a working system the uses iPhones and Windows Mobile devices clients, but the system can be extended to include
Businesses have traditionally relied on different types of media to communicate with existing and potential customers.
With the emergence of the Web, the relation between the use of print and electronic media has continually evolved. In
this paper, we investigate one possible scenario that combines the use of the Web and print. Specifically, we consider the
scenario where a small- or medium-sized business (SMB) has an existing web site from which they wish to pull content
to create a print piece. Our assumption is that the web site was developed by a professional designer, working in
conjunction with the business owner or marketing team, and that it contains a rich assembly of content that is presented
in an aesthetically pleasing manner. Our goal is to understand the process that a designer would follow to create an
effective and aesthetically pleasing print piece. We are particularly interested to understand the choices made by the
designer with respect to placement and size of the text and graphic elements on the page. Toward this end, we conducted
an experiment in which professional designers worked with SMBs to create print pieces from their respective web pages.
In this paper, we report our findings from this experiment, and examine the underlying conclusions regarding the
resulting document aesthetics in the context of the existing design, and engineering and computer science literatures that
address this topic
We describe an approach for automatically laying out content for high quality printed formats such as magazines or
brochures, producing an aesthetically pleasing layout that correctly conveys the semantic structure of the content and
elicits the desired experiential affect in the reader. The semantic structure of the content includes the reading order graph,
the association of illustrations with referring paragraphs, and the preservation of perceived text hierarchies.
We appropriate a popular conceptual tool used by graphic designers called the grid. A well-designed grid will cause a
pleasing uniformity through all the pages of a publication while still allowing flexibility in the layout of each page.
In the space of different automatic layout systems, our approach is somewhere between template-based techniques and
generative techniques, with the aesthetics determined by the combination of the grid and a generative algorithm
One consequence of using the grid is that it greatly reduces the space of possible layouts from a high dimensional
continuous space to a discrete space. Using a simple greedy algorithm, our first results are promising.