I wasn't able to load the sudoku image properly using iio.im

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Thanks for the quick reply ! I have now installed <code class="notra

Which OS are you running? Were you able to follow <a href="https://datacarpentry.org/i

I can reproduce this. <a target="_blank" rel="noopener noreferrer no

Well done <a class="user-mention notranslate" data-hovercard-type="user" data-hovercar

Ep3: iio.imread is unreliable about image-processing HOT 8 CLOSED

gcapes commented on September 10, 2024

Ep3: iio.imread is unreliable

from image-processing.

Comments (8)

mkcor commented on September 10, 2024

Hi @gcapes,

There is a sentence related to this in episode 5 and I realize it should be moved to episode 3 instead, when using mode="L" first appears. Sorry!

"The first argument to iio.imread() is the filename of the image. The second argument mode="L" defines the type and depth of a pixel in the image (e.g., an 8-bit pixel has a range of 0-255). This argument is forwarded to the pillow backend, for which mode “L” means 8-bit pixels and single-channel (i.e., grayscale). pillow is a Python imaging library; which backend is used by iio.imread() may be specified (to use pillow, you would pass this argument: plugin="pillow"); if unspecified, iio.imread() determines the backend to use based on the image type."

Do you have pillow installed?

from image-processing.

gcapes commented on September 10, 2024

Thanks for the quick reply !

I have now installed pillow into my venv but that doesn't look to be the issue. mode="L" doesn't look to be the problem either. If I remove that argument, I get a completely white image instead of completely black.

from image-processing.

mkcor commented on September 10, 2024

Which OS are you running? Were you able to follow https://datacarpentry.org/image-processing/setup/? I guess the test at the end will give you a blank image at this point...

from image-processing.

gcapes commented on September 10, 2024

I'm on Linux Mint. I'm not using Anaconda, but have set up a venv with all the required packages as far as I can tell, including scikit-image version 0.19.3 using python 3.10.

The test gives me what looks to be success:

I've worked round this problem it for the tutorial, so I'm only really reporting it as something which might affect other learners. It only seems to affect that png image. JPG and TIFF are loading ok.

Feel free to close if this isn't reproducible. Thanks!

from image-processing.

mkcor commented on September 10, 2024

Strange... I cannot reproduce it on my end, indeed (the PNG image is displaying fine for me); I'm running Python 3.9.15 on Debian GNU/Linux 11 (bullseye).

from image-processing.

tobyhodges commented on September 10, 2024

I can reproduce this.

(Aside: there is also a small error in the solution block, where we have forgotten to import imageio as iio.)

The immediate issue is coming from the way that mode='L' will return 8-bit integer values (rather than floating point intensities between 0 and 1 as produced by skimage.color.rgb2gray):

The rest of the solution assumes that the image array contains values between 0 and 1 but really they are between 0 and 255.

@gcapes here is a solution that should work for the exercise:

import imageio.v3 as iio
import matplotlib.pyplot as plt

image = iio.imread(uri="data/sudoku.png", mode="L")
image[image > 190] = 190
plt.imshow(image, cmap='gray', vmin=0, vmax=255)

Moving onto the question of how to fix this, it is made quite complicated by the fact that the sudoku image has an alpha channel. So skimage.color.rgb2gray returns an error:

import skimage.color
image = iio.imread(uri="data/sudoku.png")
gray_image = skimage.color.rgb2gray(image)

ValueError: the input array must have size 3 along `channel_axis`, got (900, 900, 4)

I can propose four options from here:

We find a way to load the image as floating point intensity values
We adjust the rest of the exercise to work with 8-bit integer values, and highlight the difference between the outputs of skimage.color.rgb2gray and iio.imread(mode='L') in the Converting colour images to grayscale section above the exercise.
We adopt the skimage.color.rgb2gray approach to solve this exercise, and talk through the existence of the alpha channel and how to deal with it.
We remove the alpha channel from the image, and provide a three-channel version as the example file in the data folder, then use the rgb2gray approach.

Here are my thoughts on each of these:

I do not know if this is possible. A quick review of the Pillow documentation did not seem to show me a mode we could use to achieve it.
We should add the note about the different outputs to the Converting colour images to grayscale section.
I would really prefer to avoid a discussion of alpha channels at this stage of the lesson, and the middle of an exercise is definitely not the right place for it regardless.
This would be easy enough to do, even if it does mean creating a new version of the dataset on FigShare. But I wonder if really we want to be recommending mode='L' over rgb2gray in general.

Thoughts very welcome from anyone with more working experience of image processing @mkcor @bobturneruk @K-Meech @quist00 @uschille

from image-processing.

gcapes commented on September 10, 2024

Well done @tobyhodges for getting to the bottom of this!
I would suggest option 4. It requires the least extra explaining and keeps to the learning objectives.

from image-processing.

mkcor commented on September 10, 2024

Oh, sorry, I had skipped the vmin=0, vmax=1 arguments in the plotting function... Mystery solved.

I agree that using a conversion function such as rgb2gray is a more robust practice and it is more transparent to the learner.

from image-processing.

Ep3: iio.imread is unreliable about image-processing HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent