13

我在matlab中有逻辑回归成本的代码:

function [J, grad] = costFunction(theta, X, y)

m = length(y); % number of training examples
thetas = size(theta,1);
features = size(X,2);
steps = 100;
alpha = 0.1;

J = 0;
grad = zeros(size(theta));


sums = [];
result = 0;

for i=1:m

%    sums = [sums; (y(i))*log10(sigmoid(X(i,:)*theta))+(1-y(i))*log10(1-sigmoid(X(i,:)*theta))]

    sums = [sums; -y(i)*log(sigmoid(theta'*X(i,:)'))-(1-y(i))*log(1-sigmoid(theta'*X(i,:)'))];

    %use log simple not log10, mistake
end

result = sum(sums);
J = (1/m)* result;


%gradient one step

tempo = [];
thetas_update = 0;
temp_thetas = [];


grad = temp_thetas;

for i = 1:size(theta)
    for j = 1:m
        tempo(j) = (sigmoid(theta'*X(j,:)')-y(j))*X(j,i);
    end
    temp_thetas(i) = sum(tempo);
    tempo = [];
end

grad = (1/m).*temp_thetas;

% =============================================================

end

我需要矢量化它,但我不知道它是怎么做的,为什么?我是一名程序员,所以我喜欢for's。但是要矢量化它,我是空白的。有什么帮助吗?谢谢。

4

2 回答 2

37
function [J, grad] = costFunction(theta, X, y)
hx = sigmoid(X * theta);
m = length(X);

J = (-y' * log(hx) - (1 - y')*log(1 - hx)) / m;
grad = X' * (hx - y) / m;

end
于 2013-11-12T19:56:35.993 回答
2

是代码应该是

    function [J, grad] = costFunction(theta, X, y)

    hx = sigmoid(X' * theta);
    m = length(X);

    J = sum(-y * log(hx) - (1 - y)*log(1 - hx)) / m;
    grad = X' * (hx - y) / m;


    end
于 2016-03-18T09:13:38.937 回答