我正在尝试创建一个混合风暴拓扑,它使用基于 Java 的 spout 和基于 python 的 bolt。
对于基于 python 的螺栓,我编写了一个 Java 包装器:
class PythonBolt extends ShellBolt implements IRichBolt {
public PythonBolt() {
super("python", "C:\\somepath\\sample.py");
}
@Override
public void declareOutputFields(OutputFieldsDeclarer outputFieldsDeclarer) {
}
@Override
public Map<String, Object> getComponentConfiguration() {
return null;
}
}
这就是我的 sample.py 的样子:
进口风暴
class SplitSentenceBolt(storm.BasicBolt):
def process(self, tup):
print("Python rocks!")
words = tup.values[0].split(" ")
print(tup.values[0])
SplitSentenceBolt().run()
然后我将它们放在一起并尝试通过以下代码段运行:
public class SampleBolt {
public static void main(String[] args) throws Exception {
TopologyBuilder builder = new TopologyBuilder();
builder.setSpout("Hello", new RawDataLevelSpout(), 12);
builder.setBolt("World", new PythonBolt(), 12);
Config config = new Config();
config.setDebug(true);
LocalCluster cluster = new LocalCluster();
cluster.submitTopology("Hello-World-BaiJian", config, builder.createTopology());
Utils.sleep(100000);
cluster.killTopology("Hello-World-BaiJian");
cluster.shutdown();
}
}
这一切都正确启动,但是,我得到以下异常:
import storm
ModuleNotFoundError: No module named 'storm'
at org.apache.storm.utils.ShellProcess.launch(ShellProcess.java:94) ~[storm-client-2.0.0-SNAPSHOT.jar:2.0.0-SNAPSHOT]
at org.apache.storm.task.ShellBolt.prepare(ShellBolt.java:154) ~[storm-client-2.0.0-SNAPSHOT.jar:2.0.0-SNAPSHOT]
at org.apache.storm.executor.bolt.BoltExecutor.init(BoltExecutor.java:84) ~[storm-client-2.0.0-SNAPSHOT.jar:2.0.0-SNAPSHOT]
at org.apache.storm.executor.bolt.BoltExecutor.call(BoltExecutor.java:93) ~[storm-client-2.0.0-SNAPSHOT.jar:2.0.0-SNAPSHOT]
at org.apache.storm.executor.bolt.BoltExecutor.call(BoltExecutor.java:45) ~[storm-client-2.0.0-SNAPSHOT.jar:2.0.0-SNAPSHOT]
at org.apache.storm.utils.Utils$2.run(Utils.java:329) ~[storm-client-2.0.0-SNAPSHOT.jar:2.0.0-SNAPSHOT]
关于如何克服这个问题的任何提示?如何安装那个 python Storm 包?是否可以通过 Anaconda 安装它(我找不到包)?