-
Notifications
You must be signed in to change notification settings - Fork 0
kafka安装
yangyp8110 edited this page Jan 17, 2018
·
1 revision
- 官方参考资料:kafka
- server1:192.168.74.129(用来安装kafka)
- server2:192.168.74.131(用来安装kafka)
- 已经安装启动好的zookeeper集群
a. 下载地址:http://kafka.apache.org/downloads.html 官方推荐2.11版本
[root@yyp softpackages]# wget http://mirror.bit.edu.cn/apache/kafka/0.11.0.0/kafka_2.11-0.11.0.0.tgz
b. 解压缩
[root@yyp softpackages]# tar zxvf kafka_2.11-0.11.0.0.tgz -C /usr/local/install/kafka/
c. kafka文件目录
[root@yyp kafka]# cd kafka_2.11-0.11.0.0/
[root@yyp kafka_2.11-0.11.0.0]# ll
total 48
drwxr-xr-x. 3 root root 4096 Jun 22 18:09 bin ## 操作kafka的可执行脚本,还包含windows下脚本
drwxr-xr-x. 2 root root 4096 Jun 22 18:09 config ## 配置文件所在目录
drwxr-xr-x. 2 root root 4096 Jul 16 03:24 libs ## 依赖库目录
-rw-r--r--. 1 root root 28824 Jun 22 18:06 LICENSE
-rw-r--r--. 1 root root 336 Jun 22 18:06 NOTICE
drwxr-xr-x. 2 root root 46 Jun 22 18:10 site-docs
[root@yyp kafka_2.11-0.11.0.0]#
d. 查看conf和lib,发现kafka自带了zookeeper,这里不使用自带的zookeeper
a. 编辑/etc/profile
,末尾加入如下配置
# kafka
export KAFKA_HOME=/usr/local/install/kafka/kafka_2.11-0.11.0.0
export PATH=$PATH:$KAFKA_HOME/bin
# kafka
b. 立即生效
source /etc/profile
[root@yyp kafka]# scp -P 1003 -r kafka_2.11-0.11.0.0/ root@192.168.74.131:/usr/local/install/
The authenticity of host '[192.168.74.131]:1003 ([192.168.74.131]:1003)' can't be established.
ECDSA key fingerprint is b8:cd:43:2b:f9:bc:01:a0:00:bd:83:39:ac:86:cb:c1.
Are you sure you want to continue connecting (yes/no)? y
Please type 'yes' or 'no': yes
Warning: Permanently added '[192.168.74.131]:1003' (ECDSA) to the list of known hosts.
root@192.168.74.131's password:
配置server.properties
参考
- 配置 broker 的ID
broker.id 是 broker 的唯一标示,集群中不能有重复的ID
############################# Server Basics #############################
# The id of the broker. This must be set to a unique integer for each broker.
broker.id=0 # broker 的唯一标示,集群中不能有重复的ID
# Switch to enable topic deletion or not, default value is false
#delete.topic.enable=true
- 修改 log 的目录(先在kafka安装目录下新建
kafka-log-server
文件夹)
############################# Log Basics #############################
# A comma seperated list of directories under which to store log files
log.dirs=/usr/local/install/kafka/kafka_2.11-0.11.0.0/kafka-log-server
# The default number of log partitions per topic. More partitions allow greater
# parallelism for consumption, but this will also result in more files across
# the brokers.
num.partitions=1
# The number of threads per data directory to be used for log recovery at startup and flushing at shutdown.
# This value is recommended to be increased for installations with data dirs located in RAID array.
num.recovery.threads.per.data.dir=1
- 打开监听端口
############################# Socket Server Settings #############################
# The address the socket server listens on. It will get the value returned from
# java.net.InetAddress.getCanonicalHostName() if not configured.
# FORMAT:
# listeners = listener_name://host_name:port
# EXAMPLE:
# listeners = PLAINTEXT://your.host.name:9092
listeners=PLAINTEXT://:9092 # 取消这一行的注释
- 修改 zookeeper.connect
配置zookeeper集群
############################# Zookeeper #############################
# Zookeeper connection string (see zookeeper docs for details).
# This is a comma separated host:port pairs, each corresponding to a zk
# server. e.g. "127.0.0.1:3000,127.0.0.1:3001,127.0.0.1:3002".
# You can also append an optional chroot string to the urls to specify the
# root directory for all kafka znodes.
zookeeper.connect=192.168.74.129:2181,192.168.74.129:2182,192.168.74.129:2183 # 修改成为我们搭建的zookeeper集群
# Timeout in ms for connecting to zookeeper
zookeeper.connection.timeout.ms=6000
按相同的方式配置server2的kafka的server.properties。保证 broker.id 和 log.dirs 的唯一性就可以了。
- 先启动zookeeper
- 启动kafka
- [root@yyp kafka_2.11-0.11.0.0]# bin/kafka-server-start.sh config/server.properties
或者
- [root@yyp kafka_2.11-0.11.0.0]# bin/kafka-server-start.sh config/server.properties&
- 命令后面加的 ”&“ 是表示后台运行,不在控制台输出一系列信息。
- 启动结果 如果启动报错,参考文章结尾错误处理
……
-server/meta.properties (kafka.server.BrokerMetadataCheckpoint)[2017-07-16 11:14:13,267] INFO Kafka version : 0.11.0.0 (org.apache.kafka.common.utils.AppInfoParser)
[2017-07-16 11:14:13,267] INFO Kafka commitId : cb8625948210849f (org.apache.kafka.common.utils.AppInfoParser)
[2017-07-16 11:14:13,267] INFO [Kafka Server 1], started (kafka.server.KafkaServer)
- 创建topic命令:
bin/kafka-topics.sh --create --zookeeper 192.168.74.129:2181 --replication-factor 2 --partitions 1 --topic testTopic
- 指定了replication为2(server1、server2),指定3会报错
- 查看topic命令:
bin/kafka-topics.sh --list --zookeeper 192.168.74.129:2181
- 在server2执行存入消息:
[root@localhost kafka_2.11-0.11.0.0]# bin/kafka-topics.sh --create --zookeeper 192.168.74.129:2181 --replication-factor 2 --partitions 1 --topic testTopic
Created topic "testTopic".
[root@localhost kafka_2.11-0.11.0.0]# bin/kafka-topics.sh --list --zookeeper 192.168.74.129:2181
testTopic
[root@localhost kafka_2.11-0.11.0.0]#
- 登陆server1查看topic
[root@yyp kafka]# cd kafka_2.11-0.11.0.0/
[root@yyp kafka_2.11-0.11.0.0]# bin/kafka-topics.sh --list --zookeeper 192.168.74.129:2181
testTopic
[root@yyp kafka_2.11-0.11.0.0]#
broker.id: 每一个broker在集群中的唯一表示,要求是正数
listeners(效果同之前的版本的host.name及port):注意绑定host.name,否则可能出现莫名其妙的错误如consumer找不到broker。这个host.name是Kafka的server的机器名字,会注册到Zookeeper中。如配置监听(listeners=PLAINTEXT://192.168.74.129:9092)
log.dirs: kafka数据的存放地址,多个地址的话用逗号分割,多个目录分布在不同磁盘上可以提高读写性能
log.retention.hours: 数据文件保留多长时间, 存储的最大时间超过这个时间会根据log.cleanup.policy设置数据清除策略
zookeeper.connect: 指定ZooKeeper的connect string,以hostname:port的形式,可有多个以逗号分隔,如hostname1:port1,hostname2:port2,hostname3:port3,还可有路径,如:hostname1:port1,hostname2:port2,hostname3:port3/kafka,注意要事先在zk中创建/kafka节点,否则会报出错误:java.lang.IllegalArgumentException: Path length must be > 0
错误描述:
java.net.UnknownHostException: yyp.localdomain: yyp.localdomain: Name or service not known
at java.net.InetAddress.getLocalHost(InetAddress.java:1505)
at kafka.server.KafkaHealthcheck$$anonfun$1.apply(KafkaHealthcheck.scala:60)
at kafka.server.KafkaHealthcheck$$anonfun$1.apply(KafkaHealthcheck.scala:58)
at scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
at scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
at scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59)
at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:48)
at scala.collection.TraversableLike$class.map(TraversableLike.scala:234)
at scala.collection.AbstractTraversable.map(Traversable.scala:104)
at kafka.server.KafkaHealthcheck.register(KafkaHealthcheck.scala:58)
at kafka.server.KafkaHealthcheck.startup(KafkaHealthcheck.scala:50)
at kafka.server.KafkaServer.startup(KafkaServer.scala:280)
at kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:38)
at kafka.Kafka$.main(Kafka.scala:65)
at kafka.Kafka.main(Kafka.scala)
解决办法:
- 提示的是Host找不到,在/etc/hosts中映射的ip名称和主机名一致就可以了,编辑
/etc/hosts
加入host配置
192.168.74.129 master yyp.localdomain
- 或在监听配置的地方加上ip地址