Project Title: Image-Based Information Access and Organization

Project Award Number: ITR-011389


Principal Investigator:

James D. Hollan
Distributed Cognition and HCI Laboratory
Department of Cognitive Science
University of California, San Diego
La Jolla, CA 92093-0515
Phone: 858 534-8156 Fax: 858 822-2476
Email: hollan@cogsci.ucsd.edu
URL: http://hci.ucsd.edu/hollan

Collaborators:

Dan Bauer (Graduate Student)
Department of Cognitive Science
University of California, San Diego
La Jolla, CA 92093-0515
Email: dsbauer@cogsci.ucsd.edu

Pierre Fastrez (Fulbright Postdoctoral Fellow 2002-2003)
Département de communication
Université catholique de Louvain
Ruelle de la Lanterne Magique, 14
1348 Louvain-la-Neuve
Belgique
Email: fastrez@reco.ucl.ac.be

Bill Griswold (Professor)
Department of Computer Science & Engineering
University of California, San Diego
La Jolla, CA 92093-0114
Email: wgg@cs.ucsd.edu

Ryan Sit (Graduate Student)
Department of Computer Science & Engineering
University of California, San Diego
La Jolla, CA 92093-0114
Email: rsit@cs.ucsd.edu

Ron Stanonik
Department of Cognitive Science
University of California, San Diego
La Jolla, CA 92093
Email: stanonik@cogsci.ucsd.edu

Keywords: cognitive ethnography, digital photo organization, Dynapad, graphical interfaces, image browsers, information visualization, interface physics, multiscale interfaces, personal information collections, PDF paper organization, piles

Project Summary

There is an astonishing amount of information on the web and it is constantly increasing. To avoid being overwhelmed by the volume of information available and confused by its uneven quality, people need assistance in efficiently finding task-relevant information and in effectively managing complex dynamic information collections.
Current interfaces primarily employ textual representations for accessing and organizing personal information collections. Access is either via taxonomies or queries to search engines and results are typically organized as lists or hierarchies of web page titles. Given the ability of images to assist memory and the common exploitation of space in everyday problem solving to simplify choice, perception, and mental computation, it is surprising that so little use is made of images and spatial organization to aid information access and organization. In this project we are examining how spatial and temporal organization of images can serve as effective interface components for the design of personal information environments. We have developed a flexible multiscale software system (Dynapad) and an extensible set of region tools to provide various forms of subtask-specific support in local regions of collection-management workspaces. Our software development efforts are driven by ethnographic and experimental studies of spatial and temporal strategies for image-based access and organization of information.

Publications and Products

Dan Bauer, Personal Information Geographies, Proceedings of ACM Conference on Human Factors in Computing Systems, Extended Abstracts, CHI2002, 538-539, 2002.

Dan Bauer, A Multiscale Workspace for Managing and Exploring Personal Digital Libraries, Proceedings of the ACM Symposium on User Interface Software and Technology, UIST’03:Doctoral Symposium, Vancouver, British Columbia, Canada., November 2003.

Daniel Bauer and James D. Hollan, IRYS: A Visualization Tool for Temporal Analysis of Multimodal Interactions, Proceedings of the ACM 5th International Conference on Multimodal Interfaces,
Vancouver, British Columbia, Canada. 285-288, 2003.

Daniel Bauer, Pierre Fastrez, and Jim Hollan, Computationally-Enriched "Piles'' for Managing Digital Photo Collections, Proceedings of Visual Languages and Human-Centric Computing, Rome, Italy, September 26-29, 2004.

Dan Bauer, Pierre Fastrez, and Jim Hollan, Spatial Tools for Managing Personal Information Collections, Proceedings of the Hawaii International Conference on System Science, Hawaii, January 3-6, 2005

Ryan Y. Sit, James D. Hollan, and William G. Griswold, Digital Photos as Conversational Anchors, Proceedings of the Hawaii International Conference on System Science, Hawaii, January 3-6, 2005

Project Impact

Design and Implementation of Dynapad Multiscale Interface and Visualization Software Infrastructure
Dynapad is the third generation of our multiscale interface and visualization software. The Dynapad software infrastructure supports our exploration of spatial tools to assist with managing personal information collections.

One impact of Dynapad is to allow us to begin to generalize a notion of spatially-located physics — to develop an infrastructure of regional tools whose physics both automate the creation of microstructure and guide the management of macrostructure at multiple scales, to create variegated, interactive, task-specific workspaces. But we also realize that no software, however clever, will be able to anticipate the full variety of arrangements and strategies people employ in such reflective and opportunistic interactions. A tool to support, rather than dominate, such activity must afford the user both the authority to override its initiatives and the expressiveness to employ a variety of strategies. The unlocking and reediting of a portrait-collage as discussed below is one example. In short, any automation or physics must be adequately humble.

Region Tools for Managing Personal Information Collections
Our primary research objective is to explore and generalize the notion of a “pile” as the foundation for a versatile suite of region-tools which provide unobtrusive assistance for organizational and other sensemaking activities. Our multiscale piles are provided with a cognitively convivial physics to self-adjust their size as elements are added or deleted, to allow one to naturally zoom to "ideal" levels to interact with individual members or specific piles, and to assist navigation in rich multiscale workspaces. While this and other structure-preserving affordances are useful, the most important impact of our effort has been the development of proactive behaviors to assist the user in creating meaningful structure. We have developed a range of tools. For example a collection of digital photos can be organized along a timeline to show when they were taken, a collection of iconic representations of PDF files can be organized in terms of the dates the PDFs were created or the dates they entered one's personal collection of papers. We also support linked brushing not just between instances of the same object (to help see that the same object appears in multiple piles), but also between objects related in various ways: for example, files in the same directory, photos taken the same day, papers by the same author, and citation relations. Our software architecture allow any source of metadata to be represented in this way.

Dynamic Multiscale Iconic Representations
Our research emphasizes the value of visual access to information. One content type, photographs, are already visual and are represented in a Dynapad workspace as thumbnails. Other types of content, however, are more challenging to convert to a graphical form. We have adapted Dynapad to support graphical representations of PDF documents, typically focusing on collections of research papers downloaded from the web. Because images from these papers can be effective retrieval cues, we extract and collage them into “portraits” or “enriched thumbnails” of the documents. One of the figures below shows sample portraits of a paper and many of its references. In Dynapad thumbnails are automatically replaced by high-resolution versions when users zoom into them. In addition, other applications can be accessed via the images (e.g., image editing programs for digital photos and PDF viewers for files associated with the portraits).

Currently, the algorithm we use to generate document portraits is relatively simple: we automatically extract all component images, sort them by file size (which reflects both image size and complexity, and therefore salience), and arrange the top few over a background image of the document’s
cover page. We are exploring more sophisticated strategies as well. But of course, no algorithm can always guess correctly what will be the most effective portrait for a given paper; Dynapad uses an evolving set of heuristics to make a best guess, but lets the user edit any portrait-collage. When a portrait is unlocked for editing, all component images may be moved and resized (although they are forced to stay contiguous with the background image). When locked, the collage is cropped to the boundaries of the background and unused images are “stored” out of sight and may be accessed later if the collage is again unlocked. We are experimenting with including customizable text fragments in the portraits – for example, paper title words, keywords, or author names that can be automatically extract from PDFs. In addition, we are exploring dynamic portraits, whose appearance changes in different contexts (e.g., showing a slideshow of the contained images as the mouse hovers over a portrait) or at different viewing scales (e.g., using “semantic zooming” to show only the largest image when zoomed out or a representative sampling when zoomed in). Finally, in new work to integrate Dynapad with the Stanford Diver system for annotating audio-video source material, we have prototyped portrait-collages for video clips in which heuristically chosen keyframes serve roles similar to images from a PDF file.

Digital Photos as Conversational Anchors
During shared viewing printed photos frequently serve as anchors for conversations. Conversational interactions are less common when images are not printed. People do not commonly gather around a monitor to look at digital images, although the passing around of cell phones and digital cameras displaying images is increasingly common. The congeniality of paper has many advantages for synchronous sharing. Although both printed and digital images can be shared asynchronously, the effort required frequently precludes such sharing. JussPress (www.jusspress.com) is a system designed to
minimize the effort involved in sharing digital photos. It enables instant publishing and organization. Automatic temporal organization of the photos is shown to have many advantages. Additionally, by making comments easy to make, read, and check, web-based photos also became anchors for conversations. We have examined the coherence of these conversations, instances of them spanning multiple photos, their organizational impact, and why conversations were infrequent for some users.

Goals, Objectives and Targeted Activities

People spend substantial time maintaining personal collections of varied types of digital information: photos, video clips, web bookmarks, email archives, professional documents, and other files. Our on-going research goal is both to understand the cognitive strategies people use in managing such collections in a visual workspace, and to build a versatile infrastructure of tools to support those strategies. Although diverse content types are often supported by different applications, our premise is that the same basic cognitive strategies likely underlie the activities of exploring and organizing any collection. This may be why spatial arrangements of elements in piles have proven to be such a natural and effective mechanism for managing physical desktops. Considerable research has explored how people make use of space to organize information. Likewise, the utility of image-based and time-based workspaces has drawn increasing interest. Our work continues these themes. When using spatial workspaces, people typically allocate regions of space to play specialized roles within broader activities. For example, piles can be used to categorize items or to function as reservoirs of items yet to be examined. Our work shows that the affordances of physical piles can be dissociated and selectively engineered in digital environments.

While our research explores spatial tools for managing many types of digital information, our objective in this project is to explore tools to assist in managing personal collections of digital photos and iconic representations of PDF documents such as journal articles. A central targeted activity is to demonstrate how a generalization of the “pile” metaphor can serve as a foundation for new types of tools to enrich regions of a collection-management workspace with local task-specific behaviors. Throughout our research effort we have draw on ethnographic observations of
people using developing versions of our tools to organize their own collections of digital photos and research documents.

Area Background

Considerable research has explored how people make use of space to organize information [2, 6, 7, 8]. Likewise, the utility of image-based [1, 4, 5, 10, 11] and time-based [3, 9] workspaces has drawn increasing interest. Our work continues these themes.

Area References and Related Projects

  1. B. B. Bederson. Photomesa: a zoomable image browser using quantum treemaps and bubblemaps. In Proceedings of ACM CHI Conference on Human Factors in Computing Systems, pages 71–80, 2001.
  2. B. B. Bederson, J. D. Hollan, K. Perlin, J. Meyer, D. Bacon, and G. Furnas. Pad++: A zoomable graphical sketchpad for exploring alternate interface physics. Journal of Visual Languages and Computing, 7:3–31, 1996.
  3. S. Fertig, E. Freeman, and D. Gelernter. Lifestreams: an alternative to the desktop metaphor. In Proceedings of ACM SIGCHI Conference, 410–411, 1996.
  4. K. D. Grant, A. Graham, T. Nguyen, A. Paepcke, and T. Winograd. Beyond the shoe box: Foundations for flexibly organizing photographs on a computer. Technical report, Computer Science Department, Stanford University, 2003.
  5. H. Kang and B. Shneiderman. Visualizing methods for personal photocollections: Browsing and searching in the photofinder. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME2000), 15390–1542, 2000.
  6. D. Kirsh. The intelligent use of space. Artificial Intelligence(1-2), 73:31–68, 1995.
  7. R. Mander, G. Salomon, and Y. Y. Wong. A “pile” metaphor for supporting casual organization of information. In Proceedings of the SIGCHI conference on Human factors in computing systems, 627–634. ACM Press, 1992.
  8. K. Nakakoji, Y. Yamamoto, S. Takada, and B. Reeves. Two dimensional spatial positioning as a means of reflection in design. In Proceedings of ACM DIS’00, pages 145–154, 2000.
  9. J. Rekimoto. Timescape: A time-machine for the desktop environment. In Proceedings of ACM CHI’99 Extended Abstracts, 180–181, 1999.
  10. G. Robertson, M. Czerwinski, K. Larson, D. C. Robbins, D. Thiel, and M. van Dantzich. Data mountain: Using spatial memory for document management. In Proceedings of ACM UIST’98, pages 153–162, 1998.
  11. K. Rodden and K. Wood. How do people manage their digital photographs? In Proceedings of ACM CHI’03, 409–416, April 2003.

Project Websites

Image-Based Information Access Project Website

This web site provides a general overview of the project, selected publications, and a set of images.

Illustrations

Video Example. Dynapad Piles of Digital Photos.
This video shows brief examples of piles and lenses for managing digital photo collections.

Illustration 1. A Dynapad Pile Example.
The image in the center is an iconic representation of one of the papers describing project. The images around it are icons of the papers it references. Clicking on one of these selects the PDF document from which it was generated.

Illustration 2. Editing a Portrait-Collage of a PDF Document.
Left: An initial locked portrait. Center: Portrait is unlocked for editing, allowing positioning of hidden images in a new arrangement. Right: New arrangement is cropped and locked.

Illustration 3. A Dynapad Timeline Lens.
A timeline lens is positioned over document piles in a Dynapad workspace. The lens provides a temporary chronological ordering (by either PDF-creation date or date of entry into the file system) of the documents. Year and month are indicated along the bottom of the lens.

Online Software

We are in the process of making the Dynapad software available for other researchers. It will be announced on this website when it is available. Feel free to contact hollan@cogsci.ucsd.edu for information about the current status of the software.