Update string format representation used in instrument file output #70

g5t · 2023-06-08T11:52:07Z

Background

When a parameter needs to be written into a McStas/McXtrace .instr file McStasScript makes use of printf-style format specifiers, notably %d, %s, and %G.
Importantly, the last is used for floating point values which seems reasonable since, quoting the Wikipedia entry for %g/%G

double in either normal or exponential notation, whichever is more appropriate for its magnitude. g uses lower-case letters, G uses upper-case letters. This type differs slightly from fixed-point notation in that insignificant zeroes to the right of the decimal point are not included. Also, the decimal point is not included on whole numbers.

Problem

Unfortunately in some cases this style of string formatting can lead to truncation for floating point values, e.g.,

>>> x = 1.234567
>>> '%G' % x
'1.23457'

where the last digit has been truncated and the resulting string representation is rounded up.
This behaviour led to the discovery of a bug in Elliptic_guide_gravity.comp where the lengths of a number of guide segments were written by McStasScript into a DECLARE array and their total length was written by McStasScript as a component parameter. Truncation of both the individual element lengths and their total length produced a C instrument file where the sum of the segment lengths and the input total length were no longer in agreement.

Solution

This PR corrects this problem by moving to f-string formatting and dropping the format specifier for str and float values, since Python does a good job of accurately representing values via str() already.
For example, the value above is no longer truncated

>>> x = 1.234567
>>> f'{x}'
'1.234567'

and this works for numbers with large-magnitude exponents as well

>>> x = 1.23456789e29
>>> f'{x}'
'1.23456789e+29'

Side effects

A benefit of f-strings in Python is that the resulting code is easier to read, with the position of values directly related to their position in the resulting string.
For example,

fo.write(f"{self.type} {self.name} = {self.value}; {self.comment}")

or

fo.write(f"AT {tuple(self.AT_data[:3])}")

g5t · 2023-06-09T08:35:11Z

My latest commit is due to changes I made, visible in the 'side effects' section above, that caused test cases to fail.

I did not expect self.comment to include a space before the one-line comment marker, e.g., `' //...'.
Python's string representation of a tuple is a parenthesized list separated by commas and spaces, but the old behavior did not include spaces, i.e., old (0,0,0) new (0, 0, 0).

For the first case I changed the formatting lines to remove the space, ...{self.value};{self.comment}.
For the second I changed the test cases to add spaces since it makes no difference on the C side and I think the Python representation is easier to read. This can be changed back if the old behavior is important for some reason.

g5t added 2 commits June 8, 2023 13:26

[Apply][Fix] On top of the up-to-date master

2614d29

[Fix] white space issues for tests

4366b32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update string format representation used in instrument file output #70

Update string format representation used in instrument file output #70

g5t commented Jun 8, 2023

g5t commented Jun 9, 2023 •

edited

Loading

Update string format representation used in instrument file output #70

Are you sure you want to change the base?

Update string format representation used in instrument file output #70

Conversation

g5t commented Jun 8, 2023

Background

Problem

Solution

Side effects

g5t commented Jun 9, 2023 • edited Loading

g5t commented Jun 9, 2023 •

edited

Loading