ocaml - OCaml 中的逻辑回归

Question

我试图在 OCaml 中使用逻辑回归。我需要将它用作我正在解决的另一个问题的黑盒。我找到了以下网站：

http://math.umons.ac.be/anum/en/software/OCaml/Logistic_Regression/

我将此站点中的以下代码（进行了一些修改 - 我定义了自己的 iris_features 和 iris_label）粘贴到名为logistic_regression.ml 的文件中：

open Scanf
open Format
open Bigarray
open Lacaml.D

let log_reg ?(lambda=0.1) x y =
  (* [f_df] returns the value of the function to maximize and store
     its gradient in [g]. *)
  let f_df w g =
    let s = ref 0. in
    ignore(copy ~y:g w); (* g ← w *)
    scal (-. lambda) g;  (* g = -λ w *)
    for i = 0 to Array.length x - 1 do
      let yi = float y.(i) in
      let e = exp(-. yi *. dot w x.(i)) in
      s := !s +. log1p e;
      axpy g ~alpha:(yi *. e /. (1. +. e)) ~x:x.(i);
    done;
    -. !s -. 0.5 *. lambda *. dot w w
  in
  let w = Vec.make0 (Vec.dim x.(0)) in
  ignore(Lbfgs.F.max f_df w);
  w


let iris_features = [1 ; 2 ; 3] ;;
let iris_labels = 2 ;;

let proba w x y = 1. /. (1. +. exp(-. float y *. dot w x))
let () =
  let sol = log_reg iris_features iris_labels in
  printf "w = %a\n" Lacaml.Io.pp_fvec sol;
  let nwrongs = ref 0 in
  for i = 0 to Array.length iris_features - 1 do
    let p = proba sol iris_features.(i) iris_labels.(i) in
    printf "Label = %i prob = %g => %s\n" iris_labels.(i) p
      (if p > 0.5 then "correct" else (incr nwrongs; "wrong"))
  done;
  printf "Number of wrong labels: %i\n" !nwrongs

我有以下问题：

在尝试编译代码时，我收到错误消息：“ Error: Unbound module Lacaml”。我已经安装了 Lacaml；做了几次 opam init，试图提供一个标志 -package = Lacaml ；我不知道如何解决这个问题？
如您所见，我已经定义了自己的 iris_features 和 iris_labels 版本——类型是否正确，即函数 log_reg 中的类型是 x int 列表，而 y 的类型是 int？

score 1 · Accepted Answer

iris_features和iris_labels是 OCaml 中的数组和数组字面量都用 , 样式括号分隔，[|例如|]，


let iris_features = [|(* I don't know what to put here*)|]
let iris_labels = [|2|]

该iris_features数组有 type vec array，即一个向量数组，而不是一个整数数组，而且我没有挖得太深不知道该放什么，但语法如下，

let iris_features =[|
  Vec.of_list [1.; 2.; 3.;];
  Vec.of_list [4.; 5.; 6.;];
|]

自从编写代码以来，Lacaml 接口发生了一些变化，axpy不再接受带标签的~x参数（x 和 y 向量现在都是位置的）所以你需要删除~x并修复顺序（我假设它x.(i)在x表达式中a*x + y并且g对应于y，例如,

 axpy ~alpha:(yi *. e /. (1. +. e)) x.(i) g;

这段代码也依赖于lbfgs，所以你也需要安装它，

opam depext --install lbfgs

我建议您使用dune作为您的默认构建系统，但对于快速原型设计，您可以使用ocamlbuild. 将您的代码放入一个名为的文件中的空文件夹中regress.ml（您可以选择其他名称，只需相应地更新构建指令），现在您可以将其构建为本机可执行文件，如

ocamlbuild -pkg lacaml -pkg lbfgs regress.native

运行它

./regress.native

如果您在 OCaml 顶层（也称为解释器，即在ocaml解释器中运行代码）中进行游戏，您可以加载lacaml并lbfgs使用以下两个指令：

#use "topfind";;
#require "lacaml.top";;
#require "lbfgs";;

（#不是提示，而是指令语法的一部分，所以不要忘记键入它）。

现在您可以将代码复制粘贴到解释器中并使用它。

奖励轨道 - 用沙丘建造

创建一个空文件夹并放在regress.ml那里。
remove open Bigarrayand open Scanfas dune 对警告非常严格，并将它们变成错误（它会在这些行上警告你，因为它们实际上是未使用的）
创建沙丘项目

dune init exe regress --libs lacaml,lbfgs

构建并运行

dune exec ./regress.exe

ocaml - OCaml 中的逻辑回归

1 回答 1

奖励轨道 - 用沙丘建造

Related

Reference