“portia”的相关标签问题_Stack Overflow中文网

0 投票

1 回答

719 浏览

python - Portia，如何将数据保存到数据库？

在portia中，我想将数据保存到Mysql之类的数据库中或者做一些事情来清理数据，但是我不知道该怎么做，你能给我一些建议吗？我是scrapy的新手，在线等，非常感谢！

2014-12-21T14:55:59.313

0 投票

1 回答

494 浏览

javascript - 如何在 portia 中呈现 javascript 页面？

我正在使用 portia 来使用 scrapinghub/splash 中间件渲染 JavaScript 页面。但在 portia 中加载作业页面时似乎出现以下错误。

错误：

您的网络浏览器必须启用 JavaScript 才能正确显示此应用程序。

平台：portia-scrapy + scrapinghub/splash。

请让我知道如何解决 mozila firefox 中的此错误。

注意：我也尝试过以下说明：

javascript python-2.7 scrapy portia scrapinghub

user4443904

2015-01-19T13:56:23.900

0 投票

1 回答

209 浏览

python-2.7 - 如何在 Portia scrapy 下拉列表中添加默认字段名称？

我已经从（https://github.com/scrapinghub/portia）下载了 Portia 并在我的 Windows 机器上安装了 Portia，同时启动 Portia 我可以注释页面。

如何添加默认字段下拉列表

我可以使用创建新选项根据需要选择字段并添加名称。

我的问题是我们如何添加默认字段名称，这样我就可以从下拉框中选择它而不是输入名称，而且它也是通用的。

例如，

在下拉列表中，我需要字段名称列表，例如，

职位名称、职位描述、职位位置

谁能帮助我，如何默认添加归档名称而不是创建新选项。

提前致谢。

python-2.7 web-scraping scrapy web-crawler portia

2015-01-20T05:42:06.220

0 投票

1 回答

535 浏览

python-2.7 - How to use regex in Portia visual scrapy?

I can able to annotate the web pages using Portia web crawler, my question is how can use the Regex while extracting the data.

For Example,

I have extracted Location filed from a page

Output looks like,

Location : Location xyz,abc

enter image description here

But I need only the xyz,abc values.

I have googled for solutions, but not getting more information.

Could you explain about regex in Portia scrapy?

python-2.7 web-crawler scrapy-spider portia

2015-01-21T16:18:38.327

0 投票

1 回答

945 浏览

macos - 尝试在 OSX 或 Ubuntu 上安装 Portia

有人可以帮助我吗？我一遍又一遍地安装 Portia。一切都很顺利，直到我使用了 twistd 命令并得到了这个：

(portia)Matts-Mac-mini:slyd matt$ twistd -n slyd Traceback (most> 最近调用最后一次): File "/Users/matt/portia/bin/twistd", line 14, in run() File "/Users /matt/portia/lib/python2.7/site-packages/twisted/scripts/twistd.py”，第 27 行，在运行 app.run(runApp, ServerOptions) 文件“/Users/matt/portia/lib/python2. 7/site-packages/twisted/application/app.py”，第 642 行，在运行 runApp(config) 文件“/Users/matt/portia/lib/python2.7/site-packages/twisted/scripts/twistd.py ”，第 23 行，runApp _SomeApplicationRunner(config).run() 文件“/Users/matt/portia/lib/python2.7/site-packages/twisted/application/app.py”，第 376 行，运行 self。 application = self.createOrGetApplication() 文件“/Users/matt/portia/lib/python2.7/site-packages/twisted/application/app.py”，第 436 行，在 createOrGetApplication ser = plg.makeService(self.config.subOptions) 文件“/Users/matt/portia/portia/slyd/slyd/tap.py”，第 74 行，在 makeService root = create_root(config) 文件“/Users/matt/portia/portia/ slyd/slyd/tap.py”，第 41 行，在 create_root from .projectspec 导入 create_project_resource 文件“/Users/matt/portia/portia/slyd/slyd/projectspec.py”，第 5 行，从 slybot.validation.schema 导入get_schema_validator

ImportError：没有名为 slybot.validation.schema 的模块。

我还注意到，即使我在正确的目录（[virtualenv-name]/portia/slyd）中尝试执行“pip install -r requirements.txt”，requirements.txt 文件不在 slyd 目录中，但是在 portia 目录中。

我在这里发疯了，非常感谢任何帮助。

macos python-2.7 ubuntu portia

2015-02-01T06:17:33.623

0 投票

1 回答

413 浏览