Microsoft Steers OOXML into Apache POI Project

Microsoft this week disclosed some collaborative efforts that will help Java applications read Microsoft Office file formats. The company is working with Sourcesense, a European open source systems integrator, on the Apache POI project.

The project was established by the open source Apache Software Foundation to create Java libraries that can read the various binary file formats used in Microsoft Office applications. Microsoft and Sourcesense are "contributing to a new version of Apache POI," according to an announcement issued by Microsoft.

In addition, Microsoft donated code to the Apache Software Foundation, according to Sam Ramji, Microsoft's senior director of platform technology strategy -- although exactly what code was donated wasn't specified in Microsoft's news release.

The news comes not long after another open source milestone for Microsoft, of sorts, when Ramji announced Microsoft's work with the Eclipse Foundation to enable Eclipse tools support for Microsoft's Windows Presentation Foundation.

Microsoft's collaboration with Sourcesense aims to get Apache POI project support for the Ecma Office Open XML File Formats, which is the Ecma designation for the file formats used in Microsoft Office 2007 applications, such as Excel, PowerPoint and Word. The standard, also known as Office Open XML (OOXML), has been ratified as Ecma 376, but it was voted down as an ISO/IEC open international standard in September of last year.

The Apache POI project is working on application programming interfaces (APIs) that can work with Microsoft's OLE2 Compound Document formats (.doc, .ppt and .xls).
"POI" apparently stands for "Poor Obfuscation Implementation," according to a Wikipedia definition, because the Microsoft Office file formats were deemed to be "obfuscated" but not enough to prevent the reverse engineering of them.

The Apache Foundation plans to release Apache POI support for OOXML sometime during the second quarter of 2008, according to Microsoft's release.

The OOXML file format standard is not the only one out there. The Open Document Format, originally developed by Sun Microsystems, has already been published as an open ISO/IEC international standard. ODF is used in alternative office productivity suites such as OpenOffice.org.

Some in the open source communities cried foul as Microsoft's OOXML was fast-tracked in ISO/IEC. However, Microsoft has slowly been gaining new momentum. The U.S. government signaled its support for modified OOXML standards. Microsoft Office Program Manager Brian Jones points to recent "Yes" decisions for OOXML as an ISO standard by the Czech Republic and Germany in his blog.

Final ISO/IEC international votes on the proposed OOXML standard will be tallied on March 29.

About the Author

Kurt Mackie is online news editor, Enterprise Group, at 1105 Media Inc.

Featured

  • two large brackets facing each other with various arrows, circles, and rectangles flowing between them

    1EdTech Partners with DXtera to Support Ed Tech Interoperability

    1EdTech Consortium and DXtera Institute have announced a partnership aimed at improving access to learning data in postsecondary and higher education.

  • Abstract geometric shapes including hexagons, circles, and triangles in blue, silver, and white

    Google Launches Its Most Advanced AI Model Yet

    Google has introduced Gemini 2.5 Pro Experimental, a new artificial intelligence model designed to reason through problems before delivering answers, a shift that marks a major leap in AI capability, according to the company.

  •  laptop on a clean desk with digital padlock icon on the screen

    Study: Data Privacy a Top Concern as Orgs Scale Up AI Agents

    As organizations race to integrate AI agents into their cloud operations and business workflows, they face a crucial reality: while enthusiasm is high, major adoption barriers remain, according to a new Cloudera report. Chief among them is the challenge of safeguarding sensitive data.

  • stylized AI code and a neural network symbol, paired with glitching code and a red warning triangle

    New Anthropic AI Models Demonstrate Coding Prowess, Behavior Risks

    Anthropic has released Claude Opus 4 and Claude Sonnet 4, its most advanced artificial intelligence models to date, boasting a significant leap in autonomous coding capabilities while simultaneously revealing troubling tendencies toward self-preservation that include attempted blackmail.