MIT Researchers Advance Lecture Capture with Search Capabilities

Researchers in MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) have developed a new Web-based technology that's designed to take recorded classroom lectures to the next level. The technology, developed by a team led by MIT's Regina Barzilay and James Glass, provides search functionality for classroom video recordings. At present, the prototype only works with MIT's online lectures made available to the public through the university's OpenCourseWare initiative, but it may be made available to other institutions in the future.

The functionality of MIT's new technology shares a little in common with traditional search technology from a user perspective. That is, if you enter in a search term, the system calls up a list of results, along with relevant details, such as the title of the lecture, running time, highlighted search terms within the results, etc. And that's where the similarity ends. From this point, clicking on any word within the results calls up not only the relevant lecture, but takes the user to the precise point in the lecture where that term is used.

The screen shot below shows this, although you'd get a better idea by performing a search yourself.

Click image to enlarge

Along with the running video and audio of the lecture, a transcript of the lecture also appears, which scrolls with the lecture and underlines words in the lecture as they're spoken. The text transcripts are created using speech recognition software with a little manual help from the researchers themselves.

"Our goal is to develop a speech and language technology that will help educators provide structure to these video recordings, so it's easier for students to access the material," said Glass, who is also head of CSAIL's Spoken Language Systems Group, in a release issued this week by MIT's News Office.

The system isn't without its flaws. In the sample above, the phrase spoken by the lecturer "V cosine theta," for example, shows up textually as "v co signed fatal," but, in large part, the text follows the lecture with a fair degree of accuracy. The researchers plug technical terms into the computer to help improve that accuracy, and, according to MIT, the system is getting about 80 percent of the words right.

Following the creation of the transcript, another program breaks up the lecture into chunks of text about 100 words long each, which are "compared with each other using a mathematical formula that calculates the number of overlapping words between the text blocks," according to MIT. "Each word is weighted so that repetition of key terms has more weight than less important words, and chunks with the most similar words are grouped into sections."

The researchers said they hope to add new functionality to the system in the future, including lecture summarization and collaboration features that would allow users to make corrections to the transcripts and also add lecture notes.

Read More:

About the Author

David Nagel is the former editorial director of 1105 Media's Education Group and editor-in-chief of THE Journal, STEAM Universe, and Spaces4Learning. A 30-year publishing veteran, Nagel has led or contributed to dozens of technology, art, marketing, media, and business publications.

He can be reached at [email protected]. You can also connect with him on LinkedIn at https://www.linkedin.com/in/davidrnagel/ .


Featured

  • white desk with an open digital tablet showing AI-related icons like gears and neural networks

    Elon University and AAC&U Release Student Guide to AI

    A new publication from Elon University 's Imagining the Digital Future Center and the American Association of Colleges and Universities offers students key principles for navigating college in the age of artificial intelligence.

  • glowing blue nodes connected by thin lines in an abstract network on a dark gray to black gradient background

    Report: Generative AI Taking Over SD-WAN Management

    In a few years, nearly three quarters of network operators will use generative AI for SD-WAN management, according to a new report from research firm Gartner.

  • landscape photo with an AI rubber stamp on top

    California AI Watermarking Bill Garners OpenAI Support

    ChatGPT creator OpenAI is backing a California bill that would require tech companies to label AI-generated content in the form of a digital "watermark." The proposed legislation, known as the "California Digital Content Provenance Standards" (AB 3211), aims to ensure transparency in digital media by identifying content created through artificial intelligence. This requirement would apply to a broad range of AI-generated material, from harmless memes to deepfakes that could be used to spread misinformation about political candidates.

  • file folders floating in the clouds, with glowing AI circuitry and data lines intertwined

    OneDrive Update Adds AI Agents, Copilot Interactions

    Microsoft has announced new enterprise capabilities in its OneDrive cloud storage service, many of which leverage the company's Copilot AI technologies.