Quantcast

Plotting question

classic Classic list List threaded Threaded
8 messages Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Plotting question

Andrew McCulloch
Hi,

I use R to draw my graphs. I have 100 points on a simple xy-plot. The points are
distinguished by a third variable which is categorical with 10 levels. I have
been plotting x against y and using gray scales to distinguish the level of the
categorical variable for each point. It looks ok to me but a journal reviewer
says this is not any use. I cannot afford to pay for colour prints. Any ideas on
what is the best way to distinguish 10 groups on an xy scatter plot?



If all else fails I can just remove the graph and give them a table of
regression coefficients.


Thanks.

Yours Sincerely
Andrew McCulloch

______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Plotting question

Duncan Murdoch-2
On 11-08-01 5:44 AM, Andrew McCulloch wrote:
> Hi,
>
> I use R to draw my graphs. I have 100 points on a simple xy-plot. The points are
> distinguished by a third variable which is categorical with 10 levels. I have
> been plotting x against y and using gray scales to distinguish the level of the
> categorical variable for each point. It looks ok to me but a journal reviewer
> says this is not any use. I cannot afford to pay for colour prints. Any ideas on
> what is the best way to distinguish 10 groups on an xy scatter plot?

Plot digits or letters or other symbols.

Duncan Murdoch

>
>
>
> If all else fails I can just remove the graph and give them a table of
> regression coefficients.
>
>
> Thanks.
>
> Yours Sincerely
> Andrew McCulloch
>
> ______________________________________________
> [hidden email] mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.

______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Plotting question

Gene Leynes
In reply to this post by Andrew McCulloch
plot(1:10, pch=letters[1:10])

On Mon, Aug 1, 2011 at 4:44 AM, Andrew McCulloch <[hidden email]>wrote:

> Hi,
>
> I use R to draw my graphs. I have 100 points on a simple xy-plot. The
> points are
> distinguished by a third variable which is categorical with 10 levels. I
> have
> been plotting x against y and using gray scales to distinguish the level of
> the
> categorical variable for each point. It looks ok to me but a journal
> reviewer
> says this is not any use. I cannot afford to pay for colour prints. Any
> ideas on
> what is the best way to distinguish 10 groups on an xy scatter plot?
>
>
>
> If all else fails I can just remove the graph and give them a table of
> regression coefficients.
>
>
> Thanks.
>
> Yours Sincerely
> Andrew McCulloch
>
> ______________________________________________
> [hidden email] mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>

        [[alternative HTML version deleted]]

______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Plotting question

Bert Gunter
In reply to this post by Duncan Murdoch-2
IMHO:

On Mon, Aug 1, 2011 at 7:51 AM, Duncan Murdoch <[hidden email]> wrote:

> On 11-08-01 5:44 AM, Andrew McCulloch wrote:
>>
>> Hi,
>>
>> I use R to draw my graphs. I have 100 points on a simple xy-plot. The
>> points are
>> distinguished by a third variable which is categorical with 10 levels. I
>> have
>> been plotting x against y and using gray scales to distinguish the level
>> of the
>> categorical variable for each point. It looks ok to me but a journal
>> reviewer
>> says this is not any use. I cannot afford to pay for colour prints. Any
>> ideas on
>> what is the best way to distinguish 10 groups on an xy scatter plot?
>
> Plot digits or letters or other symbols.
>
> Duncan Murdoch
>
No, this does not work. See Cleveland's books (e.g. "Visualizing
Data"). 10 is too many symbols to constantly refer to a legend to keep
straight, and digits or letters do not allow you to readily perceive
the pattern. (Caveat: If "most" of the data are only 2 or 3 of the
symbols, then these can work).

I think the OP's idea of using gray scales was better. I would dispute
the reviewer and refer them to appropriate references. Alternatively,
thermometer plots (aka "filled rectangle" plots) would be best. Again,
Cleveland's books provide scientific justification rather than merely
the (possibly uninformed) aesthetic opinion of a reviewer. Presumably,
the journal editor would accept hard data and psychological research
in preference to opinions.

>>
>>
>>
>> If all else fails I can just remove the graph and give them a table of
>> regression coefficients.

No. I think your attempt to use a graph is a much better way to go.
Try to resist poor practices such as just publishing summary
statistics.

Cheers,
Bert

>>
>>
>> Thanks.
>>
>> Yours Sincerely
>> Andrew McCulloch
>>
>> ______________________________________________
>> [hidden email] mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>
> ______________________________________________
> [hidden email] mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



--
"Men by nature long to get on to the ultimate truths, and will often
be impatient with elementary studies or fight shy of them. If it were
possible to reach the ultimate truths without the elementary studies
usually prefixed to them, these would not be preparatory studies but
superfluous diversions."

-- Maimonides (1135-1204)

Bert Gunter
Genentech Nonclinical Biostatistics

______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Plotting question

Duncan Murdoch-2
On 11-08-01 11:48 AM, Bert Gunter wrote:

> IMHO:
>
> On Mon, Aug 1, 2011 at 7:51 AM, Duncan Murdoch<[hidden email]>  wrote:
>> On 11-08-01 5:44 AM, Andrew McCulloch wrote:
>>>
>>> Hi,
>>>
>>> I use R to draw my graphs. I have 100 points on a simple xy-plot. The
>>> points are
>>> distinguished by a third variable which is categorical with 10 levels. I
>>> have
>>> been plotting x against y and using gray scales to distinguish the level
>>> of the
>>> categorical variable for each point. It looks ok to me but a journal
>>> reviewer
>>> says this is not any use. I cannot afford to pay for colour prints. Any
>>> ideas on
>>> what is the best way to distinguish 10 groups on an xy scatter plot?
>>
>> Plot digits or letters or other symbols.
>>
>> Duncan Murdoch
>>
> No, this does not work.

You have amazing perception to know that it doesn't work in Andrew's
graph.  But then you go on to suggest that sometimes it does, and then
suggest using symbols.

Obviously you need to see the graph to know what works.  If the 10
categories are ordered, then something like thermometer plots would
work.  If they are grouped into a small number of variations on a small
number of groups, then digits or letters combined with shading might
work, especially if the groups are well separated, or there are clear
patterns.

I'd agree with the reviewer than 10 levels of shading is probably too
many to distinguish, and I'd agree with you that digits 0-9 in equal
quantities in an unstructured scatterplot are probably not a good
presentation, but I wouldn't want to give specific advice about plotting
a dataset without seeing it.

Duncan Murdoch

See Cleveland's books (e.g. "Visualizing

> Data"). 10 is too many symbols to constantly refer to a legend to keep
> straight, and digits or letters do not allow you to readily perceive
> the pattern. (Caveat: If "most" of the data are only 2 or 3 of the
> symbols, then these can work).
>
> I think the OP's idea of using gray scales was better. I would dispute
> the reviewer and refer them to appropriate references. Alternatively,
> thermometer plots (aka "filled rectangle" plots) would be best. Again,
> Cleveland's books provide scientific justification rather than merely
> the (possibly uninformed) aesthetic opinion of a reviewer. Presumably,
> the journal editor would accept hard data and psychological research
> in preference to opinions.
>
>>>
>>>
>>>
>>> If all else fails I can just remove the graph and give them a table of
>>> regression coefficients.
>
> No. I think your attempt to use a graph is a much better way to go.
> Try to resist poor practices such as just publishing summary
> statistics.
>
> Cheers,
> Bert
>>>
>>>
>>> Thanks.
>>>
>>> Yours Sincerely
>>> Andrew McCulloch
>>>
>>> ______________________________________________
>>> [hidden email] mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide
>>> http://www.R-project.org/posting-guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
>>
>> ______________________________________________
>> [hidden email] mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
>
>
>

______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Plotting question

Bert Gunter
Well stated, Duncan, and I plead guilty, though I did try to weasel
out with caveats.

Perhaps I may plead down to a lesser sentence or probation by saying
that I was offering what I still believe to be appropriate advice for
a general strategy for handling this sort of plotting issue; but that
as always, one's mileage may vary depending on the specifics.

Extra inline comments below.

Cheers,
Bert

And to return to R, note that any of these options is easy to
implement in any of at least 3 different graphics frameworks (base,
trellis, and ggplot). So Duncan and I can spar over what we'd like to
do without being limited by what software **allows** us to do.


On Mon, Aug 1, 2011 at 1:48 PM, Duncan Murdoch <[hidden email]> wrote:

> On 11-08-01 11:48 AM, Bert Gunter wrote:
>>
>> IMHO:
>>
>> On Mon, Aug 1, 2011 at 7:51 AM, Duncan Murdoch<[hidden email]>
>>  wrote:
>>>
>>> On 11-08-01 5:44 AM, Andrew McCulloch wrote:
>>>>
>>>> Hi,
>>>>
>>>> I use R to draw my graphs. I have 100 points on a simple xy-plot. The
>>>> points are
>>>> distinguished by a third variable which is categorical with 10 levels. I
>>>> have
>>>> been plotting x against y and using gray scales to distinguish the level
>>>> of the
>>>> categorical variable for each point. It looks ok to me but a journal
>>>> reviewer
>>>> says this is not any use. I cannot afford to pay for colour prints. Any
>>>> ideas on
>>>> what is the best way to distinguish 10 groups on an xy scatter plot?
>>>
>>> Plot digits or letters or other symbols.
>>>
>>> Duncan Murdoch
>>>
>> No, this does not work.
>
> You have amazing perception to know that it doesn't work in Andrew's graph.
>  But then you go on to suggest that sometimes it does, and then suggest
> using symbols.
>
> Obviously you need to see the graph to know what works.  If the 10
> categories are ordered, then something like thermometer plots would work.
>  If they are grouped into a small number of variations on a small number of
> groups, then digits or letters combined with shading might work, especially
> if the groups are well separated, or there are clear patterns.
>
> I'd agree with the reviewer than 10 levels of shading is probably too many
> to distinguish,

But for ordered categories you may not wish to distinguish so much as
give an overall gestalt, for which a gray scale with 10 levels could
work quite well. So it depends on the specifics of what's being
plotted, no?

and I'd agree with you that digits 0-9 in equal quantities

> in an unstructured scatterplot are probably not a good presentation, but I
> wouldn't want to give specific advice about plotting a dataset without
> seeing it.
>
> Duncan Murdoch
>
> See Cleveland's books (e.g. "Visualizing
>>
>> Data"). 10 is too many symbols to constantly refer to a legend to keep
>> straight, and digits or letters do not allow you to readily perceive
>> the pattern. (Caveat: If "most" of the data are only 2 or 3 of the
>> symbols, then these can work).
>>
>> I think the OP's idea of using gray scales was better. I would dispute
>> the reviewer and refer them to appropriate references. Alternatively,
>> thermometer plots (aka "filled rectangle" plots) would be best. Again,
>> Cleveland's books provide scientific justification rather than merely
>> the (possibly uninformed) aesthetic opinion of a reviewer. Presumably,
>> the journal editor would accept hard data and psychological research
>> in preference to opinions.
>>
>>>>
>>>>
>>>>
>>>> If all else fails I can just remove the graph and give them a table of
>>>> regression coefficients.
>>
>> No. I think your attempt to use a graph is a much better way to go.
>> Try to resist poor practices such as just publishing summary
>> statistics.
>>
>> Cheers,
>> Bert
>>>>
>>>>
>>>> Thanks.
>>>>
>>>> Yours Sincerely
>>>> Andrew McCulloch
>>>>
>>>> ______________________________________________
>>>> [hidden email] mailing list
>>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>>> PLEASE do read the posting guide
>>>> http://www.R-project.org/posting-guide.html
>>>> and provide commented, minimal, self-contained, reproducible code.
>>>
>>> ______________________________________________
>>> [hidden email] mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide
>>> http://www.R-project.org/posting-guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
>>>
>>
>>
>>
>
>



--
"Men by nature long to get on to the ultimate truths, and will often
be impatient with elementary studies or fight shy of them. If it were
possible to reach the ultimate truths without the elementary studies
usually prefixed to them, these would not be preparatory studies but
superfluous diversions."

-- Maimonides (1135-1204)

Bert Gunter
Genentech Nonclinical Biostatistics

______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Plotting question

Indrajit Sen Gupta
In reply to this post by Andrew McCulloch
Why can't you simply draw a 3 - D plot. The third variable being categorical can expressed as a bar.

Regards,
Indrajit



________________________________
From: Andrew McCulloch <[hidden email]>
To: [hidden email]
Sent: Monday, August 1, 2011 3:14 PM
Subject: [R] Plotting question

Hi,

I use R to draw my graphs. I have 100 points on a simple xy-plot. The points are
distinguished by a third variable which is categorical with 10 levels. I have
been plotting x against y and using gray scales to distinguish the level of the
categorical variable for each point. It looks ok to me but a journal reviewer
says this is not any use. I cannot afford to pay for colour prints. Any ideas on
what is the best way to distinguish 10 groups on an xy scatter plot?



If all else fails I can just remove the graph and give them a table of
regression coefficients.


Thanks.

Yours Sincerely
Andrew McCulloch

______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
        [[alternative HTML version deleted]]


______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Plotting question

Karl Ove Hufthammer
In reply to this post by Andrew McCulloch
Andrew McCulloch wrote:

> I use R to draw my graphs. I have 100 points on a simple xy-plot. The
> points are distinguished by a third variable which is categorical with 10
> levels. I have been plotting x against y and using gray scales to
> distinguish the level of the categorical variable for each point. It looks
> ok to me but a journal reviewer says this is not any use. I cannot afford
> to pay for colour prints. Any ideas on what is the best way to distinguish
> 10 groups on an xy scatter plot?

How about having *10* scatterplots + an identical grid in each plot? Try

  example(coplot)

for an idea about it could look (ignore the marginal plots). Of course, do
use the lattice or the ggplot2 package, not the coplot function.

Too bad you have 10 groups and not 9 (or 12), BTW ... :-/

--
Karl Ove Hufthammer

______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Loading...