Question for statisticians > General Discussion

Posted: 11/12/2012 9:50:28 PM EDT

I'm doing a chi-square contingency analysis on an 8x19 matrix. My result is huge, which is to be expected, with p = <0.001, phi = 0.91 and an effect size of 0.27. So obviously I have a very strong difference between f[o] and f[e] somewhere in the data. Now, I can eyeball where the major differences are, but I would really rather quantify them.

So my question is this: is there some kind of post-hoc test that I can run to come up with significant difference thresholds for variable-on-variable? Was thinking about running individual goodness of fit tests, using the expected values derived from the contingency test, but I'm not sure if this would be mathematically valid.

If it helps, the research question is based on 8 states and 19 categories of industry: are certain states preferred HQ locations for certain industrial sectors?

Are there any resources in SPSS for this, or do i just need to eyeball it?

Posted: 11/12/2012 10:43:20 PM EDT

[#1]

Quoted:
I'm doing a chi-square contingency analysis on an 8x19 matrix. My result is huge, which is to be expected, with p = <0.001, phi = 0.91 and an effect size of 0.27. So obviously I have a very strong difference between f[o] and f[e] somewhere in the data. Now, I can eyeball where the major differences are, but I would really rather quantify them.

So my question is this: is there some kind of post-hoc test that I can run to come up with significant difference thresholds for variable-on-variable? Was thinking about running individual goodness of fit tests, using the expected values derived from the contingency test, but I'm not sure if this would be mathematically valid.

If it helps, the research question is based on 8 states and 19 categories of industry: are certain states preferred HQ locations for certain industrial sectors?

Are there any resources in SPSS for this, or do i just need to eyeball it?

87?

Posted: 11/13/2012 3:06:11 AM EDT

[#2]

Quoted:

Quoted:
I'm doing a chi-square contingency analysis on an 8x19 matrix. My result is huge, which is to be expected, with p = <0.001, phi = 0.91 and an effect size of 0.27. So obviously I have a very strong difference between f[o] and f[e] somewhere in the data. Now, I can eyeball where the major differences are, but I would really rather quantify them.

So my question is this: is there some kind of post-hoc test that I can run to come up with significant difference thresholds for variable-on-variable? Was thinking about running individual goodness of fit tests, using the expected values derived from the contingency test, but I'm not sure if this would be mathematically valid.

If it helps, the research question is based on 8 states and 19 categories of industry: are certain states preferred HQ locations for certain industrial sectors?

Are there any resources in SPSS for this, or do i just need to eyeball it?

87?

That answer + your avatar = hilarious.

Posted: 11/13/2012 5:06:07 AM EDT

[#3]

ARFCOM can't even tell you what 48/2(9+3) is or whether or not you should (always) take what's behind the other curtain...and you ask that here?

Posted: 11/13/2012 5:09:54 AM EDT

[#4]

Quoted:
ARFCOM can't even tell you what 48/2(9+3) is or whether or not you should (always) take what's behind the other curtain...and you ask that here?

that would be 2

Posted: 11/13/2012 5:25:41 AM EDT

[#5]

Quoted:

Quoted:
ARFCOM can't even tell you what 48/2(9+3) is or whether or not you should (always) take what's behind the other curtain...and you ask that here?

that would be 288

Posted: 11/13/2012 5:35:10 AM EDT

[#6]

Quoted:

ARFCOM can't even tell you what 48/2(9+3) is or whether or not you should (always) take what's behind the other curtain...and you ask that here?

bah––we have a ton of legit math guys on the board, as well as a few professional statisticians.

Posted: 11/13/2012 6:25:43 PM EDT

[#7]

i'm going to give this one bump, then let it die if no one has any suggestions.

Posted: 11/13/2012 6:27:06 PM EDT

[#8]

87 percent of all statistics are made up.

Posted: 11/13/2012 6:29:01 PM EDT

[#9]

Quoted:
I'm doing a chi-square contingency analysis on an 8x19 matrix. My result is huge, which is to be expected, with p = <0.001, phi = 0.91 and an effect size of 0.27. So obviously I have a very strong difference between f[o] and f[e] somewhere in the data. Now, I can eyeball where the major differences are, but I would really rather quantify them.

So my question is this: is there some kind of post-hoc test that I can run to come up with significant difference thresholds for variable-on-variable? Was thinking about running individual goodness of fit tests, using the expected values derived from the contingency test, but I'm not sure if this would be mathematically valid.

If it helps, the research question is based on 8 states and 19 categories of industry: are certain states preferred HQ locations for certain industrial sectors?

Are there any resources in SPSS for this, or do i just need to eyeball it?

My statistical knowledge is restricted to insurance, but it seems to me like you could build a linear model by state and by category and do significance testing on the state factors.

Posted: 11/13/2012 6:32:53 PM EDT

[#10]

Quoted:
I'm doing a chi-square contingency analysis on an 8x19 matrix. My result is huge, which is to be expected, with p = <0.001, phi = 0.91 and an effect size of 0.27. So obviously I have a very strong difference between f[o] and f[e] somewhere in the data. Now, I can eyeball where the major differences are, but I would really rather quantify them.

So my question is this: is there some kind of post-hoc test that I can run to come up with significant difference thresholds for variable-on-variable? Was thinking about running individual goodness of fit tests, using the expected values derived from the contingency test, but I'm not sure if this would be mathematically valid.

If it helps, the research question is based on 8 states and 19 categories of industry: are certain states preferred HQ locations for certain industrial sectors?

Are there any resources in SPSS for this, or do i just need to eyeball it?

If I could rephrase your question thusly:
"Given a certain industry, are the HQs more likely to be in certain states than others?"
-You could do a separate chi-squared analysis by industry.
-You could do a t-test by industry and see if any of the states are outliers

Posted: 11/13/2012 6:43:36 PM EDT

[#11]

Quoted:

Quoted:

I'm doing a chi-square contingency analysis on an 8x19 matrix. My result is huge, which is to be expected, with p = <0.001, phi = 0.91 and an effect size of 0.27. So obviously I have a very strong difference between f[o] and f[e] somewhere in the data. Now, I can eyeball where the major differences are, but I would really rather quantify them.

So my question is this: is there some kind of post-hoc test that I can run to come up with significant difference thresholds for variable-on-variable? Was thinking about running individual goodness of fit tests, using the expected values derived from the contingency test, but I'm not sure if this would be mathematically valid.

If it helps, the research question is based on 8 states and 19 categories of industry: are certain states preferred HQ locations for certain industrial sectors?

Are there any resources in SPSS for this, or do i just need to eyeball it?

If I could rephrase your question thusly:

"Given a certain industry, are the HQs more likely to be in certain states than others?"

-You could do a separate chi-squared analysis by industry.

-You could do a t-test by industry and see if any of the states are outliers

that's actually the direction i was thinking, but i wasn't sure how to derive expected frequencies for the individual chi-square GoF tests. i had also considered doing 1-sample Ts by state, but i was having trouble setting up the null.

my question was sloppy––your formulation is much better. thanks!

Posted: 11/13/2012 7:53:02 PM EDT

[#12]

Quoted:

Quoted:
I'm doing a chi-square contingency analysis on an 8x19 matrix. My result is huge, which is to be expected, with p = <0.001, phi = 0.91 and an effect size of 0.27. So obviously I have a very strong difference between f[o] and f[e] somewhere in the data. Now, I can eyeball where the major differences are, but I would really rather quantify them.

So my question is this: is there some kind of post-hoc test that I can run to come up with significant difference thresholds for variable-on-variable? Was thinking about running individual goodness of fit tests, using the expected values derived from the contingency test, but I'm not sure if this would be mathematically valid.

If it helps, the research question is based on 8 states and 19 categories of industry: are certain states preferred HQ locations for certain industrial sectors?

Are there any resources in SPSS for this, or do i just need to eyeball it?

If I could rephrase your question thusly:
"Given a certain industry, are the HQs more likely to be in certain states than others?"
-You could do a separate chi-squared analysis by industry.
-You could do a t-test by industry and see if any of the states are outliers

that's actually the direction i was thinking, but i wasn't sure how to derive expected frequencies for the individual chi-square GoF tests. i had also considered doing 1-sample Ts by state, but i was having trouble setting up the null.

my question was sloppy––your formulation is much better. thanks!

The expected frequency for the individual chi-squared tests would be the average frequency across that industry, wouldn't it? If your data looks like this:
Industry 1
AL: 5
CA: 10
TX: 15
The expected would be 10 in each state.

I'm pretty rusty on T-tests, but as a sort of "soft" test, I guess you could determine, say, the 95th %ile confidence interval for the frequency in each state, and then see what percentage of states actually fall outside the range and compare that percentage to 5%.

Posted: 11/13/2012 8:23:25 PM EDT

[#13]

Quoted:

Quoted:

that's actually the direction i was thinking, but i wasn't sure how to derive expected frequencies for the individual chi-square GoF tests.

The expected frequency for the individual chi-squared tests would be the average frequency across that industry, wouldn't it? If your data looks like this:

Industry 1

AL: 5

CA: 10

TX: 15

The expected would be 10 in each state.

that makes sense. i'll need to sit down with my data tomorrow and confirm that that matches up with my derived expected freqs from the contingency test, because i'm going to get murdered if i present two different expected freqs. that's why i was thinking about going to the t-test, so that i could just present a t-score in results. but then i would have to compare a value to an average, instead of a mean to mu.

thanks for taking the time with this. you might be amused to know that my professor's father wrote a "stats for insurance professionals" textbook.

Warning

Confirm Action

[ARCHIVED THREAD] - Question for statisticians

[ARCHIVED THREAD] - Question for statisticians