ID persistence #19

pajot · 2019-10-13T13:26:56Z

We copy tags from shape.text to shape.name at template instantiation to make them resistant to user mischief :)

Add _get_all_shapes internal method. The test simply checks to see if the attributes we expect to use the shape for are present and whether they can be coerced to strings.

Copy tags at template instantiation from shape.text to shape.name to make them resistant to user clobbering.

pptx_blueprint/__init__.py

Co-Authored-By: Tim Hoffmann <[email protected]>

pptx_blueprint/__init__.py

…nto id_persistence

into id_persistence

Use the regex match group for shape.name. This way we also include the curly braces and ignore non-matching text in shape.text.

cmahr · 2019-10-16T19:38:43Z

pptx_blueprint/__init__.py

        return matched_shapes

+    def _get_all_shapes(self) -> Iterable[BaseShape]:
+        # Do we need all the shapes? Perhaps we should filter on tags here.


We need to iterate over each shape at least once and apply the re.match to identify whether there's some work for us to do. It's a decision of style if you want to first get all shapes and then match and work, or if you get all matching slides and then do some work. For a (extremely) big presentation, however, generating a list of all slides may be quite memory expensive and possibly slow. Both issues could be eliminated if you'd just replace the square brackets with regular brackets, thereby changing the list comprehension into a generator expression.

Also, please inline...

Also, please inline...

Do you mean the comment? Isn't it a bit long to be inlined? (PEP8 says "use inline comments sparingly ;) )

Inlining means to eliminate the function and use the code of the function body instead of the function call. This is reasonable in this particular case because

the function is only used once (and unlikely to be needed multiple times)

the inline version is almost equally short and equally descriptive.
instead of
all_shapes = self._get_all_shapes()
just use
all_shapes = (shape for slide in self._presentation.slides for shape in slide.shapes)

Thanks for clarifying my comment, Tim. I now see that I could have been a little more verbose :-)

cmahr · 2019-10-16T19:39:45Z

pptx_blueprint/__init__.py

+        return all_shapes
+
+    def _copy_tags_to_name(self) -> None:
+        all_shapes = self._get_all_shapes()


please inline

cmahr · 2019-10-16T19:44:25Z

pptx_blueprint/__init__.py

+    def _copy_tags_to_name(self) -> None:
+        all_shapes = self._get_all_shapes()
+        # This regex matches on tags
+        regex_tag = re.compile(r'^\s*(\{\w+\})\s*$')


Could this be defined as a class constant?

Further, why are the curly brackets part of the group? Do we really want to set the shape.name to e.g. '{location}' instead of 'location'?

Further, why are the curly brackets part of the group? Do we really want to set the shape.name to e.g. '{location}' instead of 'location'?

Tim and I had discussed this, yes. Leaving the curly braces in makes it clear that this is a tag.

Could this be defined as a class constant?

Could do, but if so, please private. I don't want to make this user-configurable as of now. Strictly speaking, it would even be sufficient not to explicitly compile the regexp and just use inline if re.match(r'^\s*(\{\w+\})\s*$', shape.text):. re caches the last few (I belive to remember 100?) expressions. Therefore, the overhead for this basic call vs. the compiled regexp is just a dict lookup, which is negligible.

Do we really want to set the shape.name to e.g. '{location}' instead of 'location'?

I tend to say yes. While the curly brace feels a bit bulky here, it serves to highly reduce the likelihood of a name collision between a tag and an existing shape name.

cmahr · 2019-10-16T19:49:35Z

tests/test_template.py

+def test_get_all_shapes(template):
+    shapes = template._get_all_shapes()
+    for shape in shapes:
+        assert str(shape.text) and str(shape.name)


So how does this test that I 'get all the shapes'? I'm afraid that you'd have to mock the _presentation to verify this...

I simply named the test after the function it is testing. But no, it is not actually testing that I have all the shapes. But code to test this is necessarily going to be as complex as the code that performs it.

At the very least, the code checks that what was obtained has at least some properties we would expect to find in slides :)

Yes, the test would be rather complex. At least you'd have to monkey patch the Presentation class (because we don't use dependency injection) and configure the mock (which seems tedious). Further, there are people out there that say "don't test private methods, just test the public interface", i.e., you would not write a test for _get_all_shapes() at all, but only test it indirectly by testing the public method it is used in.

Would probably the simplest to just give the test a more descriptive (cool slang for "longer") name ;-)

cmahr · 2019-10-16T19:53:15Z

tests/test_template.py

+    regex_tag = re.compile(r'\{[\s\w]*\}')
+    for shape in all_shapes:
+        if regex_tag.match(shape.text):
+            assert shape.text == shape.name


Is this really what you wan't to test? Taking a look at the regex you defined in the implementation, for a shape having the text ' {location} ', you would set the name as '{location}'. Hence, in this case, shape.text != shape.name.

You're right, this test needs to be fixed :)

lysnikolaou and others added 7 commits October 12, 2019 15:00

Fix typo in .travis.yml

001766f

Update .travis.yml

2cd9987

Merge branch 'master' of https://github.com/timhoffm/pptx-blueprint

52dc9d2

Merge branch 'master' of https://github.com/timhoffm/pptx-blueprint

206735f

Merge remote-tracking branch 'upstream/master'

26f4554

Add _get_all_shapes internal method

8df7afc

Add _get_all_shapes internal method. The test simply checks to see if the attributes we expect to use the shape for are present and whether they can be coerced to strings.

Copy tags to shape.name attribute for persistence

853eee7

Copy tags at template instantiation from shape.text to shape.name to make them resistant to user clobbering.

timhoffm reviewed Oct 13, 2019

View reviewed changes

pptx_blueprint/__init__.py Outdated Show resolved Hide resolved

pptx_blueprint/__init__.py Outdated Show resolved Hide resolved

pajot and others added 2 commits October 14, 2019 21:06

Update pptx_blueprint/__init__.py

03465f9

Co-Authored-By: Tim Hoffmann <[email protected]>

Update pptx_blueprint/__init__.py

f3a5392

Co-Authored-By: Tim Hoffmann <[email protected]>

timhoffm reviewed Oct 14, 2019

View reviewed changes

pptx_blueprint/__init__.py Outdated Show resolved Hide resolved

timhoffm mentioned this pull request Oct 14, 2019

Add _get_all_shapes private method #16

Closed

pajot added 3 commits October 15, 2019 10:55

Merge branch 'master' of https://github.com/timhoffm/pptx-blueprint i…

814b5e3

…nto id_persistence

Merge branch 'id_persistence' of https://github.com/pajot/pptx-blueprint

159c00f

into id_persistence

Use the regex match group for shape.name

b7ede7a

Use the regex match group for shape.name. This way we also include the curly braces and ignore non-matching text in shape.text.

cmahr suggested changes Oct 16, 2019

View reviewed changes

ID persistence #19

Are you sure you want to change the base?

ID persistence #19

Uh oh!

Conversation

pajot commented Oct 13, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pajot Oct 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

timhoffm Oct 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pajot Oct 16, 2019 •

edited

Loading

timhoffm Oct 16, 2019 •

edited

Loading