You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Presenter notes are a valuable part of a Powerpoint presentation and are worth extracting. Docling uses uses the python-pptx library for parsing Powerpoint pptx files, which supports reading from the presenter notes, and which are stored as notes slides. The code to read the notes is fairly trivial and could look something like:
but I'm not sure how the core team would like to see provenance handled, since there is no bounding boxes to use but the model will not accept None. Should we use an empty bbox? How would you want something like this handled?
Alternatives
Since the python-pptx library used already supports this feature, no alternatives are necessary and it can be integrated as-is.
The text was updated successfully, but these errors were encountered:
Requested feature
Presenter notes are a valuable part of a Powerpoint presentation and are worth extracting. Docling uses uses the python-pptx library for parsing Powerpoint pptx files, which supports reading from the presenter notes, and which are stored as notes slides. The code to read the notes is fairly trivial and could look something like:
but I'm not sure how the core team would like to see provenance handled, since there is no bounding boxes to use but the model will not accept
None
. Should we use an empty bbox? How would you want something like this handled?Alternatives
Since the python-pptx library used already supports this feature, no alternatives are necessary and it can be integrated as-is.
The text was updated successfully, but these errors were encountered: