U Michigan's Sirius Sees Things Siri Is Blind to

Virtual intelligent personal assistants (IPA) — think Siri, Cortana or Now — are about to go open source. A team of researchers at the University of Michigan has developed an open standalone speech- and vision-based IPA service similar to what Apple, Microsoft and Google have introduced into their products.

Sirius, as it's named, incorporates speech recognition, image matching, natural language processing and a question-and-answer system. It was developed by the university's Clarity Lab and is the subject of a paper that will be formaly published in a few days.at the 2015 International Conference on Architectural Support for Programming Languages and Operating Systems in Istanbul.

"Now the core technology is out of the bag, and we all have access to it," said Jason Mars, assistant professor of computer science and engineering and co-director of the lab. "Instead of making an app to run on the Apple Watch, for example, maybe I could make my own watch. We're very excited to see what the world comes together to build and learn with Sirius as a starting point."

In its initial release, users can enter their query solely by talking to a device or by using speech and capturing an image to pose a question such as, "When does this place close?" According to the researchers that kind of functionality isn't offered currently in commercial systems.

"What we've done with Sirius is pushed the limits of the traditional intelligent personal assistant," said Johann Hauswald, a doctoral student in the lab. "Not only can you interact with your voice but you can also ask questions about what you're seeing, which is a new way to interact with this type of device."
U Michigan researchers review Sirius. From left, Lingjia Tang, Jason Mars, Johann Hauswald and Yiping Kang. Source: Joseph Xu, Michigan Engineering, Communications & Marketing.

A demo version of the technology is loaded with a static version of Wikipedia, allowing users to ask questions that would be found there. However, the team noted, that knowledge base could be switched out with other types of information. For example, focused assistants could be built for specific fields — cooking, auto repair or health. Right now, the university is working with IBM to create an assistant that would be useful for academic advising.

Sirius was built by "stitching together" several other open source projects that the researchers said tap into techniques and algorithms that resemble those in commercial offerings. Those include:

Aside from interest in the IPA itself, Mars said he anticipates that organizations will use Sirius to study how computing infrastructure will stand up under voice-enabled queries, which are a major part of wearable technologies. The work of converting questions into code that can be understood by the program is done in the cloud. The research team has found that the process can be a hundred times more computationally intensive than a straightforward text query. "We have to think of new ways to redesign our cloud platforms to support this type of workload," Mars said.

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • glowing brain, connected circuits, and abstract representations of a book and graduation cap on a light gray gradient background

    Snowflake Launches Program to Upskill 100,000 People in Data and AI

    Cloud data platform Snowflake is embarking on an effort to train and certify more than 100,000 users on its AI Data Cloud by 2027. The One Million Minds + One Platform program will provide Snowflake-delivered courses, training materials, and free access to Snowflake software, at no cost to learners.

  • two abstract humanoid figures made of interconnected lines and polygons, glowing slightly against a dark gradient background

    Microsoft Introduces Copilot Chat Agents for Education

    Microsoft recently announced Microsoft 365 Copilot Chat, a new pay-as-you-go offering that adds AI agents to its existing free chat tool for Microsoft 365 education customers.

  • hand touching glowing connected dots

    Registration Now Open for Tech Tactics in Education: Thriving in the Age of AI

    Tech Tactics in Education has officially opened registration for its May 7 virtual conference on "Thriving in the Age of AI." The annual event, brought to you by the producers of Campus Technology and THE Journal, offers hands-on learning and interactive discussions on the most critical technology issues and practices across K–12 and higher education.

  • Three cubes of noticeably increasing sizes are arranged in a straight row on a subtle abstract background

    A Sense of Scale

    Gardner Campbell explores the notion of scale in education and shares some of his own experience "playing with scale" — scaling up and/or scaling down — in an English course at VCU.