-1

有 4 组(A、B、C、D),每组包含不同数量的男性和女性

male_A = 46
male_B = 241
male_C = 202
male_D = 113

female_A = 43
female_B = 134
female_C = 100
female_D = 53

如何识别具有统计上不同比例的男性和女性的组?使用 MATLAB 的建议将不胜感激......

可能的解决方案(请检查)

% 1st row: male
% 2nd row: female
cont = [46   241   202   113;
        43   134   100    53]

mychi(cont)


%this function should calculate the Chi2 
function mychi(cont)
    cont = [cont, sum(cont,2)];
    cont = [cont; sum(cont,1)];

    counter = 1;
    for i = 1 : size(cont,1)-1

        for j = 1 : size(cont,2)-1

            Observed(counter) = cont(i,j);
            Expected(counter) = cont(i,end)*cont(end,j)/cont(end:end);
            O_E_2(counter) = (abs(Observed(counter)-Expected(counter)).^2)/Expected(counter);
            counter = counter + 1;
        end

    end

    DOF = (size(cont,1)-2)*(size(cont,2)-2)
    CHI = sum(O_E_2)
end

返回的 CHI 应与可在此处找到的 p<0.05 的 CHI 进行比较

就我而言

DOF =

     3


CHI =

    8.0746

CHI > 0.352,因此这些群体的男性和女性人数存在偏差......

4

2 回答 2

0

不确定您正在寻找什么比较,但可以通过以下方式获得比率

p = 0.05;

ratio_A = male_A ./ (male_A + female_A);
ratio_B = male_B ./ (male_B + female_B);
ratio_C = male_C ./ (male_C + female_C);
ratio_D = male_D ./ (male_D + female_D);

%一旦你有了比率,你就可以按照 % http://au.mathworks.com/help/stats/hypothesis-testing.html上的说明进行分析

希望这可以帮助

于 2015-08-24T07:52:25.913 回答
-1

我建议将您的数据排列在一个矩阵中,并根据您的需求使用适当的索引。这里有一个例子:

male_A = 46;
male_B = 241;
male_C = 202;
male_D = 113;

female_A = 43;
female_B = 134;
female_C = 100;
female_D = 53;

matrix = [male_A female_A;
    male_B female_B;
    male_C female_C;
    male_D female_D];

groups = ['A', 'B', 'C', 'D'];

total = (matrix(:,1)+matrix(:,2));
male_percentage = matrix(:,1)./total*100
female_percentage = matrix(:,2)./total*100

threshold = 65;   %// Example threshold 65%
male_above_threshold = groups(male_percentage>threshold)
female_above_threshold = groups(female_percentage>threshold)

maximum_male_ratio = groups(male_percentage==max(male_percentage))
maximum_female_ratio = groups(female_percentage==max(female_percentage))

在您的示例中,您将获得:

male_percentage =

   51.6854
   64.2667
   66.8874
   68.0723


female_percentage =

   48.3146
   35.7333
   33.1126
   31.9277


male_above_threshold =

CD


female_above_threshold =

   Empty string: 1-by-0



maximum_male_ratio =

D


maximum_female_ratio =

A

找出统计上不同的组是另一个问题。您应该提供更多信息才能做到这一点。

于 2015-08-24T10:26:35.650 回答