Camera-based system designed by Israeli start-up gives visually-impaired ability to ‘read’

OrCam device clips on to the wearer’s glasses with a small magnet and reads aloud the words or object pointed to by the user

The Orcam device clips on to the wearer’s glasses and is designed to both recognise and speak “text in the wild”, a term used to describe newspaper articles as well as bus numbers, and objects as diverse as landmarks, traffic lights and the faces of friends. Photograph: Orcam via The New York Times
The Orcam device clips on to the wearer’s glasses and is designed to both recognise and speak “text in the wild”, a term used to describe newspaper articles as well as bus numbers, and objects as diverse as landmarks, traffic lights and the faces of friends. Photograph: Orcam via The New York Times

Liat Negrin, an Israeli who has been visually impaired since childhood, walked into a grocery store here recently, picked up a can of vegetables and easily read its label using a simple and unobtrusive camera attached to her glasses.

Negrin, who has coloboma, a birth defect that perforates a structure of the eye and afflicts about one in 10,000 people, is an employee at OrCam, an Israeli start-up that has developed a camera-based system intended to give the visually impaired the ability to both “read” easily and move freely.

Until now, reading aids for the visually impaired and the blind have been cumbersome devices that recognise text in restricted environments, or, more recently, have been software applications on smartphones that have limited capabilities.

In contrast, the OrCam device is a small camera worn in the style of Google Glass, connected by a thin cable to a portable computer designed to fit in the wearer's pocket.

READ SOME MORE

The system clips on to the wearer’s glasses with a small magnet and uses a bone-conduction speaker to offer clear speech as it reads aloud the words or object pointed to by the user.

The system is designed to both recognise and speak “text in the wild,” a term used to describe newspaper articles as well as bus numbers, and objects as diverse as landmarks, traffic lights and the faces of friends.

It currently recognises English-language text and beginning this week will be sold through the company's website for $2,500, about the cost of a mid-range hearing aid. It is the only product, so far, of the privately held company, which is part of the high-tech boom in Israel.

The device is quite different from other technology that has been developed to give some vision to people who are blind, such as the artificial retina system called Argus II, made by Second Sight Medical Products. That system, which was approved by the Food and Drug Administration in February, allows visual signals to bypass a damaged retina and be transmitted to the brain.

The OrCam device is also dramatically different from Google Glass, which also offers the wearer a camera but is designed for people with normal vision and has limited visual recognition and local computing power.

OrCam was founded several years ago by Amnon Shashua, a well-known researcher who is a computer science professor at Hebrew University in Jerusalem.

It is based on computer vision algorithms that he has pioneered with another faculty member, Shai Shalev-Shwartz, and one of his former graduate students, Yonatan Wexler.

"What is remarkable is that the device learns from the user to recognise a new product," said Tomaso Poggio, a computer scientist at MIT, who is a computer vision expert and with whom Shashua studied as a graduate student. "This is more complex than it appears and, as an expert, I find it really impressive."

The advance is the result of both rapidly improving computing processing power that can now be carried comfortably in a wearer’s pocket and the computer vision algorithm developed by the scientists.

On a broader technology level, the OrCam system is representative of a wide range of rapid improvements being made in the field of artificial intelligence, in particular with vision systems for manufacturing as well as fields like autonomous motor vehicles. (Shashua previously founded Mobileye, a corporation that supplies camera technology to the automobile industry that can recognise objects such as pedestrians and cyclists and can keep a car in a lane on a freeway.)

Speech recognition is now routinely used by tens of millions of people on both iPhones and Android smartphones. Moreover, natural language processing is making it possible for computer systems to “read” documents, which is having a significant impact in the legal field, among others.

There are now at least six competing approaches in the field of computer vision. For example, researchers at Google and elsewhere have begun using what are known as “deep learning” techniques that attempt to mimic biological vision systems. However, they require vast computing resources for accurate recognition.


Minimum control
In contrast, the OrCam technique, which was described in a technical paper in 2011 by the Hebrew University researchers, offers a reasonable trade-off between recognition accuracy and speed.

The technique, known as Shareboost, is distinguished by the fact that as the number of objects it needs to recognise grows, the system minimises the amount of additional computer power required.

“The challenges are huge,” said Wexler, a co-author of the paper and vice president of research and development at OrCam. “People who have low vision will continue to have low vision, but we want to harness computer science to help them.”

Additionally, the OrCam system is designed to have a minimal control system, or user interface. To recognise an object or text, the wearer simply points at it with his or her finger, and the device then interprets the scene.

The system recognises a pre-stored set of objects and allows the user to add to its library – for example, text on a label or billboard, or a stop light or street sign – by simply waving his or her hand, or the object, in the camera’s field of view.

One of the key challenges, Shashua said, was allowing quick optical character recognition in a variety of lighting conditions as well as on flexible surfaces.

“The professional optical character readers today will work very well when the image is good, but we have additional challenges – we must read text on flexible surfaces like a hand-held newspaper,” he said.


Visual impairment
Although the system is usable by the blind, OrCam is initially planning to sell the device to people in the United States who are visually impaired, which means that their vision cannot be adequately corrected with glasses.

In the US, there are now 21.2 million people above the age of 18 who have some kind of visual impairment, including age-related conditions, diseases and birth defects, according to the 2011 National Health Survey by the US National Center for Health Statistics.

OrCam said that worldwide there are 342 million adults with significant visual impairment, and of them 52 million have middle-class incomes. – (New York Times News Service)