Reading Coreproperties Keywords From Docx File With Python-docx
From the script here I see how to set document keywords with the coreproperties function of python-docx. I want to look at the keywords already in a document written by someone el
Solution 1:
The python-docx
library doesn't have support for core properties as of v0.5.0. But as it happens, that should be relatively easy to remedy.
The python-pptx
sister project has support for core properties, as explained here:
http://python-pptx.readthedocs.org/en/latest/api/presentation.html#coreproperties-objects
Since the two projects are based on the same architecture, that code should be reusable essentially as-is. It turns out the core-properties bits are common to the the Open Packaging Convention, which is the same for all three of the MS Office XML file formats.
If you'll add an issue on the GitHub issue tracker I'll see how soon we can get to it. https://github.com/python-openxml/python-docx/issues
Post a Comment for "Reading Coreproperties Keywords From Docx File With Python-docx"