U Michigan's Sirius Sees Things Siri Is Blind to

Virtual intelligent personal assistants (IPA) — think Siri, Cortana or Now — are about to go open source. A team of researchers at the University of Michigan has developed an open standalone speech- and vision-based IPA service similar to what Apple, Microsoft and Google have introduced into their products.

Sirius, as it's named, incorporates speech recognition, image matching, natural language processing and a question-and-answer system. It was developed by the university's Clarity Lab and is the subject of a paper that will be formaly published in a few days.at the 2015 International Conference on Architectural Support for Programming Languages and Operating Systems in Istanbul.

"Now the core technology is out of the bag, and we all have access to it," said Jason Mars, assistant professor of computer science and engineering and co-director of the lab. "Instead of making an app to run on the Apple Watch, for example, maybe I could make my own watch. We're very excited to see what the world comes together to build and learn with Sirius as a starting point."

In its initial release, users can enter their query solely by talking to a device or by using speech and capturing an image to pose a question such as, "When does this place close?" According to the researchers that kind of functionality isn't offered currently in commercial systems.

"What we've done with Sirius is pushed the limits of the traditional intelligent personal assistant," said Johann Hauswald, a doctoral student in the lab. "Not only can you interact with your voice but you can also ask questions about what you're seeing, which is a new way to interact with this type of device."
U Michigan researchers review Sirius. From left, Lingjia Tang, Jason Mars, Johann Hauswald and Yiping Kang. Source: Joseph Xu, Michigan Engineering, Communications & Marketing.

A demo version of the technology is loaded with a static version of Wikipedia, allowing users to ask questions that would be found there. However, the team noted, that knowledge base could be switched out with other types of information. For example, focused assistants could be built for specific fields — cooking, auto repair or health. Right now, the university is working with IBM to create an assistant that would be useful for academic advising.

Sirius was built by "stitching together" several other open source projects that the researchers said tap into techniques and algorithms that resemble those in commercial offerings. Those include:

Aside from interest in the IPA itself, Mars said he anticipates that organizations will use Sirius to study how computing infrastructure will stand up under voice-enabled queries, which are a major part of wearable technologies. The work of converting questions into code that can be understood by the program is done in the cloud. The research team has found that the process can be a hundred times more computationally intensive than a straightforward text query. "We have to think of new ways to redesign our cloud platforms to support this type of workload," Mars said.

About the Author

Dian Schaffhauser is a former senior contributing editor for 1105 Media's education publications THE Journal, Campus Technology and Spaces4Learning.

Featured

  • glowing digital brain-shaped neural network surrounded by charts, graphs, and data visualizations

    Google Releases Advanced AI Model for Complex Reasoning Tasks

    Google has released Gemini 2.5 Deep Think, an advanced artificial intelligence model designed for complex reasoning tasks.

  • abstract pattern of cybersecurity, ai and cloud imagery

    OpenAI Report Identifies Malicious Use of AI in Cloud-Based Cyber Threats

    A report from OpenAI identifies the misuse of artificial intelligence in cybercrime, social engineering, and influence operations, particularly those targeting or operating through cloud infrastructure. In "Disrupting Malicious Uses of AI: June 2025," the company outlines how threat actors are weaponizing large language models for malicious ends — and how OpenAI is pushing back.

  • cybersecurity book with a shield and padlock

    NIST Proposes New Cybersecurity Guidelines for AI Systems

    The National Institute of Standards and Technology has unveiled plans to issue a new set of cybersecurity guidelines aimed at safeguarding artificial intelligence systems, citing rising concerns over risks tied to generative models, predictive analytics, and autonomous agents.

  • magnifying glass highlighting a human profile silhouette, set over a collage of framed icons including landscapes, charts, and education symbols

    AWS, DeepBrain AI Launch AI-Generated Multimedia Content Detector

    Amazon Web Services (AWS) and DeepBrain AI have introduced AI Detector, an enterprise-grade solution designed to identify and manage AI-generated content across multiple media types. The collaboration targets organizations in government, finance, media, law, and education sectors that need to validate content authenticity at scale.