The gzcandels_datapaper from vrooje

edit depth section for clarity

It feels a little glossy right now.

references needed, particularly in the introduction

GZ references, of course, but also non-GZ references.

Hoping some of the CANDELS team will also point out where I've accidentally omitted seminal references to their papers.

Discussion of resolution issues

There isn't one, and there should be. At what redshifts are we resolving what features? What constraints does a "smooth" classification put on feature sizes? For extended sources, what does a plot of size versus p_artifact look like?

Add pie charts to paper

I'm thinking we should start the "Use of classifications in practice" section with this, as in "this is overall, but it contains no appropriate selections on redshift, luminosity, mass, etc.; here's what you do if you want to do a specific study".

What do you think? @willettk

define a "clean" sample of smooth disks?

e.g. "clean" sample of disks selected via various disk features, but with clean-smooth B/Tot < 0.5 as well?

triple-check for compliance with definition of terms in S3.1

Check that we haven't used any of the following terms wrongly (with what we should use in parentheses):

volunteer (classifier)
user (classifier)
The only uses of these should be exceptions based on the context.

Also check to make sure all these easily-confused terms are being used correctly:

task vs question (a task is a unit in a workflow and can contain a question, but not vice-versa)
response vs answer (out of N possible responses to a question in a task, a classifier may only select 1 answer)

add response icons from site to tree diagram

In a way that still looks like a coherent design but preserves the icons.

Fill in acknowledgments

Send emails asking people for their acknowledgments
I have an Einstein acknowledgment now...

Figure 4: additional column showing histograms for each p_value

At the end of each row of images showing various examples of p_values for different responses, we should show small histograms plotting the distribution of those responses. It wouldn't take up much more real estate and it would add a lot of value.

Figure 5 should be a sankey diagram

Something like @CKrawczyk's node tree but one diagram for the whole population.

Coleman, how hard would this be for you to produce?

verify number of subjects we applied depth correction to

it shouldn't be that many, but verify it.

uniform capitalisation (or not) for text descriptions of questions & answers

pick one and stick with it.

check definitions of "clean" samples

Now that we have new weighted classifications to T00, check the definition of "clean" samples to see if they need to be re-defined.

Section 4: use Spearman's rho, not Pearson r

Spearman's is more robust even if the data is not that well-behaved (e.g. not Gaussian distributed etc). And the values are nearly identical, so report the one that's marginally more appropriate.

Need and example figure with colour?

Early on Fig 4 is referenced as showing the colour images. It doesn't. Also you can't point to Fig 4 before Fig 1, 2, 3. MNRAS won't allow it....

double-check consistencies

I have spot-checked my way through several consistency calculations, but am I really really sure of this? the consistency distribution has 5 people with consistency < 0.2 and otherwise cuts off really sharply at just above 0.3. GZ2 didn't seem to do that. That's troubling.

Histogram of number of classifications per user

Would it make sense to add a histogram of number of classifications per user in Section 3.3? I like to show those in talks....

Electronic format

What's the ultimate release format of the data? The paper says it'll be on http://data.galaxyzoo.org, which we should do, but I want to include it in more formats for posterity's sake.

Possible options:

update Kartaltepe et al. (2014) --> (2015) references

And K14 --> K15 in the text.

Figure 6: stack 2D plots? interactive 3D plots?

The current F6 is not successful at convincing people of the quality of fits to \Delta f_value as a function of f_value and surface brightness. A simple fix is to just plot the planes as two line plots for each response (and probably include fewer responses), but we could explore having interactive 3D plots (which apparently MNRAS supports). Does anyone know how to do this? @willettk @rjsmethurst @chrislintott @CKrawczyk etc?

If not I'm tempted to just do the 2 2D plots because I don't want to get bogged down in this.

smooth_disks as flag in catalog, not separate table

I suspect the referee will dislike Table 5 as much as I do now that I'm in the post-submission clarity phase. Since we aren't publishing B/Tot ratios in this paper (those are for @BorisHaeussler to publish as he sees fit), those galaxies should just be identified via a flag in the main catalog.

Further checking for errant bots

Did we look at the CANDELS classifications for signs of errant/non-human behavior aside from the star/artifact question? If so, would it be quick to run? I think it'd be good if we had the ability to write a sentence in 3.4 akin to "We have also analyzed the percentages of the remaining top-level categories (smooth and features/disk) for all users and find no/some/lots of evidence for bot-based classifications".

decision tree table is too long for 1 column

\begin{table*} etc. doesn't help as the number of line breaks is set by the number of responses. It needs to be split across 2 columns.

verify image-creation description & stats with Jeyhan

Need to check that I've written up the correct details of how the subject images were created (linearity, stretch, etc.). Those are in Section 2.1.

Also check with Jeyhan the status of the UDS classification paper (listed as in-prep in S4).

Expand on the discussion of Wisnioski findings

I'd like to see more on what Wisnioski found about the dynamics of high z galaxies in the section on Smooth discs.

smooth vs featured fractions for different luminosity ranges

Pick ranges of dM* and compute smooth vs featured fractions to add to the text describing the sankey diagram... could even be a new plot. Depends what the referee says.

Figure 8: compare with z=0 results

Use Willett et al. (2013) and Lackner & Gunn (2012) to add the z=0 comparison to the figure.

I don't suppose @willettk might want to take this on?
(I have the Lackner & Gunn tables if they're not easily accessible online.)

Table with example data in S3.7

As in Willett et al. (2013), we should show a couple of example subjects in a partial table. In addition to #30.

Section 4: new paragraph on translating from one set of classifications to the other

For ease of usability of the classifications, some CANDELS team members have pointed out it would be useful to add some additional discussion to S4 discussing how to translate between classification systems, and when this is and is not a good idea.

This might be a good opportunity to go into further detail on e.g. how to use both together to select merging systems, and how the differences in clumpy classifications might be used to do interesting science.

Add Kassin+12 to results section

Relate the smooth disks to kinematic downsizing via S14?

Also there's a SINS ref I should include too.

Remove Adler from my institutions

make figure of images across many different surface brightnesses and redshifts

(NT)

double-check all numbers

(now that we have new weighted classifications for task T00), re-check numbers, particularly in:

Section 3.5
Section 4 (all)
Section 5

Missing figure - consistencies_iterations2.eps

When compiling, I get this error:

LaTeX Warning: File `consistencies_iterations2.eps' not found on input line 516

double-check figures

now that we have a new set of weighted classifications for task T00, double-check (or just re-make) Figures:

3 (iterative distributions of user consistencies)
4 (example images; see #5)
5 (depth corrections; this needs bigger axis labels too)
6 (comparison with K15 classifications)
7 (B/Tot for smooth and featured)

Upper limits on p-values in Section 4

We quote a lot of p-values in Section 4 for the CANDELS team-GZ comparisons. I think an upper limit (p < 2e-16) is more appropriate than putting p~0, since posterity won't necessarily know what our machine precision was.

Labels to histograms in Fig 4

I think we should label the rows in FIg 4 - perhaps in the white space in each histogram - with shorthand of the question being answered.

make figure of images with all/some of the different example classification categories

which... is a lot. Fun!

Summary include "future work"?

Suggestions for investigations e.g. mergers, clumpy galaxies, further investigation of smooth disks?

Should we put the missing wide depth images of the deep sample on the site?

In Section 3.8 we say that we do not have wide-field depth classifications for 8130 subjects with deep exposures. We could collect them.... Should we?

author list

better highlighting of the existing z >~0 GZ results

Make space in the introduction to note the work GZ has done with Hubble data already. Melvin+, Simmons+, Cheung+.

all blue/note text needs to be resolved

Mostly this is covered by other items, but just double-check to make sure there's none left before submitting.

answers to T04 in Table 1 and Figure 1 are out of order

Not sure how that happened, but I've double-checked and it's fine in the data itself... just the table & figure are wrong.

Figure 3: zoom in on a relevant part to show details

(In addition to the full-size plot - zoom in somewhere to show convergence.)

Side note: I've always been sort of uncomfortable with the way the convergence here is quite sudden - the consistencies change a lot between the second and third iteration and then there is mostly no change between 3-4 and 4-5. I've double-checked it all and if something went wrong, I can't find it... so I think it looks real.

Just noting this, though, in case someone can a) find the trouble, or b) reassure me...

Section 4: new CANDELS visual parameters to compare to

Dan McIntosh pointed out that there are now some parameters that add value to the K15 visual classification raw fractions, e.g. p_merger that combines the various merger votes into one value that goes from 0 to 1, a p_diskiness and also an artifact metric. These would be really interesting comparisons and could resolve some of the issues we had with combining parameters. It's worth exploring adding them as an additional plot. (Dan has sent me the info as I think some of these are currently unpublished).

vrooje / gzcandels_datapaper Goto Github PK

gzcandels_datapaper's People

Stargazers

Watchers

gzcandels_datapaper's Issues

Recommend Projects

Recommend Topics

Recommend Org