Applying computational tools and theories to vision

Seeing is such a big part of everyday life that about half the brain is involved in the process. Shockingly, however, the part of vision with the highest fidelity occupies only 1% of our visual field. The remaining 99% falls on our peripheral vision. Our brain must use sophisticated “algorithms” to piece together vision despite not having all the information needed. Dr. Ruth Rosenholtz, of Massachusetts Institute of Technology, is fundamentally rethinking vision from the ground up, with the goal of gaining a better understanding of both underlying mechanisms and the strengths and limitations of our visual processing. For most of us, vision is a big determinant of how well we do a wide range of daily tasks; it affects our quality of life and ability to be independent. Dr. Rosenholtz’s research may eventually help better understand disorders such as dyslexia, age-related macular degeneration, and cognitive impairments as we age. In addition, her basic research aids in increasing our knowledge of how normal vision works, and therefore may also lead to the better design of tools for those with normal vision and cognition, such as maps, web pages, cell phones, and GPS systems.

While in the last decades neuroscientists and psychologists have learned more about vision than about many other neural processes, the field has still been stuck due to a lack of computational tools and theories. Dr. Rosenholtz’s commitment to computational thinking helps to answer many of the questions that remain unanswered. She and her team use state-of-the-art computational techniques from artificial intelligence, computer vision, machine learning, and image processing to understand human vision. This movement towards computational tools has allowed for the development of new models that explain vision and the brain in ways that are both precise and testable. Thus, Dr. Rosenholtz is pushing the fields of neuroscience and psychology past old models of the brain and towards novel and exciting research that makes clear predictions and true progress.

Current research includes:

  • Peripheral Vision: While peripheral vision is worse than central vision, its capabilities are incredibly important determinants of how well we can perform visual tasks. Peripheral vision seems to encode its inputs in a way that makes it impossible to perfectly reconstruct the information coming into the eye. Dr. Rosenholtz is working to makes sense of the implications of this encoding and the loss of information for real-world visual tasks like navigating, recognizing our surroundings, reading, or using a map.

  • Paying Attention: What does it mean to “pay attention”? Dr. Rosenholtz is studying the impact of paying attention on performance of visual tasks. What tasks require you to pay attention, and what are easy to do even when distracted?

  • Decision-Making: Dr. Rosenholtz is interested in elucidating a new understanding from computer and human vision on how the brain might represent the scene in front of it. She and her team are using this understanding to shed light on a diversity of cognitive processes, such as visual memory and decision-making.

At just ten years old, Dr. Rosenholtz found herself fascinated by the vision and illusions chapter of a used Psychology textbook. After attending a talk about optical image processing to improve the quality of images from outer space, she was hooked on the problems of processing and understanding images. For many years, she worked on computer vision and image processing -- trying to get computers to see as well as we can, or maximizing the quality of images while minimizing their size on the computer or the bandwidth required to transmit them. However, she was torn between being an engineer and doing more basic science. Over time, Dr. Rosenholtz came to realize that she was more interested in the science; in solving the puzzles of how human vision works, and in building predictive models to capture that understanding in a concise way. Nonetheless, her time designing computer vision systems and understanding what it takes for a computer to efficiently encode real world images, has been invaluable in how she thinks about human vision, which likely operates on related computational principles.

Aside from research, in her free time, Dr. Rosenholtz enjoys reading, cooking, and spending time with her children. In fact, recently she and her children wired electrical wands inspired by Harry Potter!   

Website: http://persci.mit.edu/mongrels/index.html

 

University of California at Berkeley Eliahu Jury Award for excellence in control, systems, and signal processing research, 1995

Methods and systems for generating enhanced thumbnails

R. Rosenholtz, A. Woodruff, & A. Faulring, U.S. patent #7,069,506 (2006).

Methods and systems for document navigation using enhanced thumbnails

R. Rosenholtz, A. Woodruff, & A. Faulring, U.S. patent #6,993,726 (2006).

Methods and systems for generating enhanced thumbnails usable for document navigation

R. Rosenholtz, A. Woodruff, & A. Faulring, U.S. patent #6,883,138 (2005).

Systems and method for automatically choosing visual characteristics to highlight a target against a background

R. Rosenholtz, U.S. patent #7,130,461 (2006).

Methods and systems for transitioning between thumbnails and documents based upon thumbnail appearance

R. Rosenholtz, A. Woodruff, & A. Faulring, U.S. patent #7,337,396 (2008)

Visual identifiers for digital data

J. P. Lewis, R. Rosenholtz, N. Fong, & U. Neumann. Pending