Creative AI is teaching robots to read

Important legal information about the email you will be sending. By using this service, you agree to input your real email address and only send it to people you know. It is a violation of law in some jurisdictions to falsely identify yourself in an email. All information you provide will be used by Fidelity solely for the purpose of sending the email on your behalf. The subject line of the email you send will be "Fidelity.com: "

When reading for work, most of us skim or scan the contents looking for words, phrases or formatting that provide clues that something might be important to us. Information Foraging Theory,¹ a concept that emerged in 1993 and correlates the behavior of humans looking for information to animals looking for food, gives reasons for this: (a) we want to maximize our reward (in the form of information or food) relative to our effort, and (b) as a result, we have developed learned behaviors that help us find what we are looking for quickly when reading for informational purposes.² When we skim our goal is to get the general gist of the information we seek, often focusing on indexes or tables of contents, titles, subtitles and headings, bulleted lists, bold or underlined words, tables, charts and pictures. We also scan to find specific information, e.g. looking for specific words or phrases, ordering, or formatting on a page.³

In our project work, we found evidence of our business users implementing these methods. In one use case, users were always flipping to the last few pages of a document for the information they needed. In another use case, the important information was always in a bulleted list, and in yet another it was always in a table.

We used these observed behaviors when training our AI models. We utilized image processing techniques to “visually” scan for lines indicative of a table when teaching the system to process tabular data. We interpreted formatting metadata indicative of bulleted lists when teaching the system to look for requests, usually coming in this format. We taught the system how to recognize key:value pairs based on location and formatting cues. We taught it to find monetary amounts, dates, addresses and ID numbers based on location and formatting as well. Within paragraphs, we used leading and trailing language markers and letter case to teach it to identify names of people and companies and other specific relevant terms.

Yes, it is possible to teach a robot to read. It starts with understanding how we humans learn to read and transferring those same skills and techniques to our robot assistants. All of which perfectly illustrates the fact that, for humans and robots to be successful in their work, reading is fundamental.

Colleen McCretton is Director, User Experience Design, in FCAT

¹ https://psycnet.apa.org/doiLanding?doi=10.1037%2F0033-295X.106.4.643
² https://www.nngroup.com/articles/information-foraging/
³ https://www.utc.edu/enrollment-management-and-student-affairs/center-for-academic-support-and-advisement/tips-for-academic-success/skimming

991168.1.0

Mutual Funds and Mutual Fund Investing - Fidelity Investments