Skip to content Skip to sidebar Skip to footer

Reading Coreproperties Keywords From Docx File With Python-docx

From the script here I see how to set document keywords with the coreproperties function of python-docx. I want to look at the keywords already in a document written by someone el

Solution 1:

The python-docx library doesn't have support for core properties as of v0.5.0. But as it happens, that should be relatively easy to remedy.

The python-pptx sister project has support for core properties, as explained here: http://python-pptx.readthedocs.org/en/latest/api/presentation.html#coreproperties-objects

Since the two projects are based on the same architecture, that code should be reusable essentially as-is. It turns out the core-properties bits are common to the the Open Packaging Convention, which is the same for all three of the MS Office XML file formats.

If you'll add an issue on the GitHub issue tracker I'll see how soon we can get to it. https://github.com/python-openxml/python-docx/issues

Post a Comment for "Reading Coreproperties Keywords From Docx File With Python-docx"