Chapter 6 - Test File #1

safeisrisky · 2020-08-08T07:09:50Z

Hi,

I think there is a bug in the test file for Chapter 6 - tiny_python_projects/06_wc/test.py/

The following is a test function

def test_more():
    """Test on more than one file"""
    rv, out = getstatusoutput(f'{prg} {fox} {sonnet}')
    expected = ('       1       9      45 ../inputs/fox.txt\n'
                '      17     118     661 ../inputs/sonnet-29.txt\n'
                '      18     127     706 total')
    assert rv == 0
    assert out.rstrip() == expected

I think the above test function should have been

def test_more():
    """Test on more than one file"""
    rv, out = getstatusoutput(f'{prg} {fox} {sonnet}')
    expected = ('       1       9      45 ../inputs/fox.txt\n'
                '      17     118     669 ../inputs/sonnet-29.txt\n'
                '      18     127     714 total')
    assert rv == 0
    assert out.rstrip() == expected

The text was updated successfully, but these errors were encountered:

AlbertUlysses · 2021-01-29T23:19:13Z

Yes, I can also confirm this. I think the method the python script uses disregards some bytes. I double checked using the Linux's wc command
@kyclark
do you have any input on this?

AlbertUlysses · 2021-01-29T23:44:36Z

I just saw in the book the solution is correct on page 111, when it uses the linux's wc command it has the 714 total, but the test.py in the repo is wrong.
However, when we enter solution.py into the test.py in the repo it passes all the test.
when we compare the two:

>>> import os
>>> len(open('../inputs/sonnet-29.txt').read())
661
>>> os.path.getsize('../inputs/sonnet-29.txt')
669
>>>

We can see that the len of the file's string isn't the same as a byte size.
Basically the different between the two is explained here:

A. To count number of characters in str object, you can use len() function:

>>> print(len('please anwser my question'))
25

B. To get memory size in bytes allocated to store str object, you can use sys.getsizeof() function

>>> from sys import getsizeof
>>> print(getsizeof('please anwser my question'))
50

source: https://stackoverflow.com/questions/4967580/how-to-get-the-size-of-a-string-in-python

kyclark · 2021-01-30T03:32:29Z

Well, first off, my apologies for taking so long to address the original bug @safeisrisky. Yes, I can see there is a discrepancy between how I'm counting bytes using the length of a string and the actual size of the string. At this point, I can't fix the book, so I think it will just have to remain "wrong" where here "wrong" means "not the same as wc." It's unfortunate, but the point of the exercise is to get the reader to think about lines, words, and characters. I hate that I missed the distinction between characters and memory! Thanks for pointing this out.

AlbertUlysses · 2021-02-04T05:54:23Z

Hi!
I wanted to say one last thing, which is mostly for anyone curious about this. There isn't anything "wrong" with the code. The "issue" occurs because Python strings are in latin1, and the text file (sonnet-29.txt) has UTF8 characters (the apostrophes). So when we read the string, it returns a slightly smaller number. The easiest way to "fix" this is to change one variable in the solution code to:

num_bytes += len(line.encode('UTF8'))

With this change, there should be a change in the test.py

Also, thank you @kyclark for all the work you did on this book. I'm really enjoying it.

Fixes kyclark#1 Use map function when assigning low, high in read_csv to split the string at the '-' character, and convert the strings to integers using int() function, the first map argument. Fixes kyclark#2 Initiate empty list as wod. Use a foor loop of random.sample(exercises, ...) which creates a list of tuples. Assign 3 variables for each iteration of the loop: exercise, low, and high. Pick a random number between low and high, and append the tuple to wod. If the easy flag is given as argument, divide reps by two. Use the int() function to convert the result to an integer.

ZakiRucker pushed a commit to ZakiRucker/tiny_python_projects that referenced this issue Sep 12, 2021

completed and updated projects kyclark#1-3

8364db7

NowHappy pushed a commit to NowHappy/tiny_python_projects that referenced this issue Oct 1, 2021

kyclark#1 joy.y

90e2cd6

NowHappy pushed a commit to NowHappy/tiny_python_projects that referenced this issue Oct 1, 2021

kyclark#1 joy.y

7d819f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chapter 6 - Test File #1

Chapter 6 - Test File #1

safeisrisky commented Aug 8, 2020

AlbertUlysses commented Jan 29, 2021 •

edited

Loading

AlbertUlysses commented Jan 29, 2021 •

edited

Loading

kyclark commented Jan 30, 2021

AlbertUlysses commented Feb 4, 2021 •

edited

Loading

Chapter 6 - Test File #1

Chapter 6 - Test File #1

Comments

safeisrisky commented Aug 8, 2020

AlbertUlysses commented Jan 29, 2021 • edited Loading

AlbertUlysses commented Jan 29, 2021 • edited Loading

kyclark commented Jan 30, 2021

AlbertUlysses commented Feb 4, 2021 • edited Loading

AlbertUlysses commented Jan 29, 2021 •

edited

Loading

AlbertUlysses commented Jan 29, 2021 •

edited

Loading

AlbertUlysses commented Feb 4, 2021 •

edited

Loading