Analysis of “Perceptually-Supported Image Editing of Text and Graphics”
Comments Made Elsewhere:
Discussion:
Seek to create an scanned document, image editing application, particularly for doodles and sketches. Present new ways for segmenting and selecting elements. One mechanism is a combination rectangle/lasso selection that is inferred from the user’s mouse movement. The selection becomes a new image. All images can be grouped on the parent image, but not in a hierarchy (allows multiple groupings). Subsequent clicks on an image will cycle through the groups.
Also create algorithm for keying out the background automatically. Discussing how confusing it was to “anchor” images when a user wanted to make a subselection.
Uses “perceptual organization” principles for fragmenting into strokes and blogs and then grouping together. Strokes based on curvilinear smootness and closure. Blogs based on spatial proximity and curvilinear alignment to look like words. Uses this for automatic segmentation as invoked by the user.
Are working on an interface that just uses a pen next.
Summary:
There combination rectangle/lasso idea is a good one.
I have no problem with the checkboard pattern to signify transparency. It gives a sense of depth, that you can see past the opaque parts.
I’m also interested in how to seamless switch between editing/drawing pen input. With a explicit modal switching device, all you have is time, gesture, pressure, and angle. No one wants to perform “holds” or start a stroke with a certain segment in order to get to the correct mode. Perhaps by tapping an arrangement of dots? But then why not give them a button?
Good paper, though, and I’m always a fan of cool UI techniques.