11

我正在尝试whalebot使用 tar 文件配置爬虫whalebot-0.02.00.tar.gz。我已经正确提取了它:

root@Admin1:~/dls# tar xvzf whalebot-0.02.00.tar.gz

之后我想配置它:

root@Admin1:~/dls/whalebot# ./configure

它给了我错误:

bash: ./configure: 没有这样的文件或目录

我也运行了命令:

root@Admin1:~/dls/whalebot# cmake ./

它给了我以下结果:

root@Admin1:~/dls/whalebot# cmake ./
-- The C compiler identification is GNU
-- The CXX compiler identification is GNU
-- Check for working C compiler: /usr/bin/gcc
-- Check for working C compiler: /usr/bin/gcc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Boost version: 1.44.0
-- Found the following Boost libraries:
-- filesystem
-- system
-- thread
-- program_options
-- date_time
CMake Warning (dev) at webspider/CMakeLists.txt:25 (link_directories):
This command specifies the relative path

../statsem_string/bin

as a link directory.

Policy CMP0015 is not set: link_directories() treats paths relative to the
source dir. Run "cmake --help-policy CMP0015" for policy details. Use the
cmake_policy command to set the policy and suppress this warning.
This warning is for project developers. Use -Wno-dev to suppress it.

CMake Warning (dev) at webspider/CMakeLists.txt:25 (link_directories):
This command specifies the relative path

../3dparty/google-url

as a link directory.

Policy CMP0015 is not set: link_directories() treats paths relative to the
source dir. Run "cmake --help-policy CMP0015" for policy details. Use the
cmake_policy command to set the policy and suppress this warning.
This warning is for project developers. Use -Wno-dev to suppress it.

CMake Error: The following variables are used in this project, but they are set to NOTFOUND.
Please set them or make sure they are set and tested correctly in the CMake files:
HTMLCXX_LIBRARY
linked by target "whalebot" in directory /root/dls/whalebot/webspider

-- Configuring incomplete, errors occurred!

我该如何进行?

4

2 回答 2

8

CMake 似乎无法找到 htmlcxx 库。

whalebot 文档中,htmlcxx 被列为依赖项。


你需要下载 htmlcxx,解压,然后安装:

cd <path to unzipped htmlcxx>
./configure --enable-static=on --enable-shared=off
make
sudo make install

您可能需要添加#include <cstddef>到顶部html/tree.h才能使其成功构建。它将usr/local/默认安装到。


如果您还没有安装icu , 您还需要安装它:

sudo apt-get install libicu-dev


最后,您现在可以构建和安装whalebot。同样,如果你有一个相当最新的 boost 安装,make 可能会失败。

在第 57 行中webspider/src/webspider_options.cpp,您需要替换boost::filesystem::initial_path().native_directory_string()boost::filesystem::initial_path().string(). 然后你应该很好地构建和安装:

cd <path to unzipped whalebot>
mkdir build
cd build
cmake .. -DCMAKE_BUILD_TYPE=Release
make
sudo make install

这也将usr/local/默认安装到。

于 2012-05-23T10:43:39.753 回答
2

检查您是否在 CMakeLists 中编写了find_library(..)find_path(.),然后将其替换为find_package(..).

它解决了我的错误。

于 2020-08-26T08:23:24.587 回答