我正在测量 kafka 生产者生产者的表现。目前我遇到了两个配置和使用略有不同的客户:
常见的:
def buildKafkaConfig(hosts: String, port: Int): Properties = {
val props = new Properties()
props.put("metadata.broker.list", brokers)
props.put("serializer.class", "kafka.serializer.StringEncoder")
props.put("producer.type", "async")
props.put("request.required.acks", "0")
props.put("queue.buffering.max.ms", "5000")
props.put("queue.buffering.max.messages", "2000")
props.put("batch.num.messages", "300")
props
}
第一个客户:
"org.apache.kafka" % "kafka_2.11" % "0.8.2.2"
用法:
val kafkaConfig = KafkaUtils.buildKafkaConfig("kafkahost", 9092)
val producer = new Producer[String, String](new ProducerConfig(kafkaConfig))
// ... somewhere in code
producer.send(new KeyedMessage[String, String]("my-topic", data))
第二个客户:
"org.apache.kafka" % "kafka-clients" % "0.8.2.2"
用法:
val kafkaConfig = KafkaUtils.buildKafkaConfig("kafkahost", 9092)
val producer = new KafkaProducer[String, String](kafkaConfig)
// ... somewhere in code
producer.send(new ProducerRecord[String, String]("my-topic", data))
我的问题是:
- 2个客户有什么区别?
- 对于大规模应用程序,我应该配置哪些属性,以实现最佳的高重写入性能?