Skip to content

kafka安装

yangyp8110 edited this page Jan 17, 2018 · 1 revision
  • 官方参考资料:kafka

环境

  • server1:192.168.74.129(用来安装kafka)
  • server2:192.168.74.131(用来安装kafka)
  • 已经安装启动好的zookeeper集群

安装

a. 下载地址:http://kafka.apache.org/downloads.html 官方推荐2.11版本

[root@yyp softpackages]# wget http://mirror.bit.edu.cn/apache/kafka/0.11.0.0/kafka_2.11-0.11.0.0.tgz

b. 解压缩

[root@yyp softpackages]# tar zxvf kafka_2.11-0.11.0.0.tgz -C /usr/local/install/kafka/

c. kafka文件目录

[root@yyp kafka]# cd kafka_2.11-0.11.0.0/
[root@yyp kafka_2.11-0.11.0.0]# ll
total 48
drwxr-xr-x. 3 root root  4096 Jun 22 18:09 bin  ## 操作kafka的可执行脚本,还包含windows下脚本
drwxr-xr-x. 2 root root  4096 Jun 22 18:09 config  ## 配置文件所在目录
drwxr-xr-x. 2 root root  4096 Jul 16 03:24 libs  ## 依赖库目录
-rw-r--r--. 1 root root 28824 Jun 22 18:06 LICENSE
-rw-r--r--. 1 root root   336 Jun 22 18:06 NOTICE
drwxr-xr-x. 2 root root    46 Jun 22 18:10 site-docs
[root@yyp kafka_2.11-0.11.0.0]# 

d. 查看conf和lib,发现kafka自带了zookeeper,这里不使用自带的zookeeper

配置

配置环境变量

a. 编辑/etc/profile,末尾加入如下配置

# kafka
export KAFKA_HOME=/usr/local/install/kafka/kafka_2.11-0.11.0.0
export PATH=$PATH:$KAFKA_HOME/bin
# kafka

b. 立即生效

source /etc/profile

将kafka解压缩文件同步到server2

[root@yyp kafka]# scp -P 1003 -r kafka_2.11-0.11.0.0/ root@192.168.74.131:/usr/local/install/
The authenticity of host '[192.168.74.131]:1003 ([192.168.74.131]:1003)' can't be established.
ECDSA key fingerprint is b8:cd:43:2b:f9:bc:01:a0:00:bd:83:39:ac:86:cb:c1.
Are you sure you want to continue connecting (yes/no)? y
Please type 'yes' or 'no': yes
Warning: Permanently added '[192.168.74.131]:1003' (ECDSA) to the list of known hosts.
root@192.168.74.131's password: 

配置server.properties参考

  • 配置 broker 的ID

broker.id 是 broker 的唯一标示,集群中不能有重复的ID


############################# Server Basics #############################

# The id of the broker. This must be set to a unique integer for each broker.
broker.id=0     # broker 的唯一标示,集群中不能有重复的ID

# Switch to enable topic deletion or not, default value is false
#delete.topic.enable=true
  • 修改 log 的目录(先在kafka安装目录下新建 kafka-log-server 文件夹)
############################# Log Basics #############################

# A comma seperated list of directories under which to store log files
log.dirs=/usr/local/install/kafka/kafka_2.11-0.11.0.0/kafka-log-server

# The default number of log partitions per topic. More partitions allow greater
# parallelism for consumption, but this will also result in more files across
# the brokers.
num.partitions=1

# The number of threads per data directory to be used for log recovery at startup and flushing at shutdown.
# This value is recommended to be increased for installations with data dirs located in RAID array.
num.recovery.threads.per.data.dir=1
  • 打开监听端口
############################# Socket Server Settings #############################

# The address the socket server listens on. It will get the value returned from
# java.net.InetAddress.getCanonicalHostName() if not configured.
#   FORMAT:
#     listeners = listener_name://host_name:port
#   EXAMPLE:
#     listeners = PLAINTEXT://your.host.name:9092
listeners=PLAINTEXT://:9092     # 取消这一行的注释
  • 修改 zookeeper.connect

配置zookeeper集群

############################# Zookeeper #############################

# Zookeeper connection string (see zookeeper docs for details).
# This is a comma separated host:port pairs, each corresponding to a zk
# server. e.g. "127.0.0.1:3000,127.0.0.1:3001,127.0.0.1:3002".
# You can also append an optional chroot string to the urls to specify the
# root directory for all kafka znodes.
zookeeper.connect=192.168.74.129:2181,192.168.74.129:2182,192.168.74.129:2183   # 修改成为我们搭建的zookeeper集群

# Timeout in ms for connecting to zookeeper
zookeeper.connection.timeout.ms=6000

按相同的方式配置server2的kafka的server.properties。保证 broker.id 和 log.dirs 的唯一性就可以了。

启动kafka

  • 先启动zookeeper
  • 启动kafka
    • [root@yyp kafka_2.11-0.11.0.0]# bin/kafka-server-start.sh config/server.properties

    或者

    • [root@yyp kafka_2.11-0.11.0.0]# bin/kafka-server-start.sh config/server.properties&
      • 命令后面加的 ”&“ 是表示后台运行,不在控制台输出一系列信息。
  • 启动结果 如果启动报错,参考文章结尾错误处理
……
-server/meta.properties (kafka.server.BrokerMetadataCheckpoint)[2017-07-16 11:14:13,267] INFO Kafka version : 0.11.0.0 (org.apache.kafka.common.utils.AppInfoParser)
[2017-07-16 11:14:13,267] INFO Kafka commitId : cb8625948210849f (org.apache.kafka.common.utils.AppInfoParser)
[2017-07-16 11:14:13,267] INFO [Kafka Server 1], started (kafka.server.KafkaServer)

命令

  • 创建topic命令:bin/kafka-topics.sh --create --zookeeper 192.168.74.129:2181 --replication-factor 2 --partitions 1 --topic testTopic
    • 指定了replication为2(server1、server2),指定3会报错
  • 查看topic命令:bin/kafka-topics.sh --list --zookeeper 192.168.74.129:2181

存储消息

  • 在server2执行存入消息:
[root@localhost kafka_2.11-0.11.0.0]# bin/kafka-topics.sh --create --zookeeper 192.168.74.129:2181 --replication-factor 2 --partitions 1 --topic testTopic
Created topic "testTopic".
[root@localhost kafka_2.11-0.11.0.0]# bin/kafka-topics.sh --list --zookeeper 192.168.74.129:2181
testTopic
[root@localhost kafka_2.11-0.11.0.0]#
  • 登陆server1查看topic
[root@yyp kafka]# cd kafka_2.11-0.11.0.0/
[root@yyp kafka_2.11-0.11.0.0]# bin/kafka-topics.sh --list --zookeeper 192.168.74.129:2181
testTopic
[root@yyp kafka_2.11-0.11.0.0]# 

server.property

broker.id:          每一个broker在集群中的唯一表示,要求是正数
listeners(效果同之前的版本的host.name及port):注意绑定host.name,否则可能出现莫名其妙的错误如consumer找不到broker。这个host.name是Kafka的server的机器名字,会注册到Zookeeper中。如配置监听(listeners=PLAINTEXT://192.168.74.129:9092)
log.dirs:           kafka数据的存放地址,多个地址的话用逗号分割,多个目录分布在不同磁盘上可以提高读写性能
log.retention.hours:    数据文件保留多长时间, 存储的最大时间超过这个时间会根据log.cleanup.policy设置数据清除策略
zookeeper.connect:     指定ZooKeeper的connect string,以hostname:port的形式,可有多个以逗号分隔,如hostname1:port1,hostname2:port2,hostname3:port3,还可有路径,如:hostname1:port1,hostname2:port2,hostname3:port3/kafka,注意要事先在zk中创建/kafka节点,否则会报出错误:java.lang.IllegalArgumentException: Path length must be > 0

错误处理

UnknownHostException

错误描述:

java.net.UnknownHostException: yyp.localdomain: yyp.localdomain: Name or service not known
	at java.net.InetAddress.getLocalHost(InetAddress.java:1505)
	at kafka.server.KafkaHealthcheck$$anonfun$1.apply(KafkaHealthcheck.scala:60)
	at kafka.server.KafkaHealthcheck$$anonfun$1.apply(KafkaHealthcheck.scala:58)
	at scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
	at scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
	at scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59)
	at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:48)
	at scala.collection.TraversableLike$class.map(TraversableLike.scala:234)
	at scala.collection.AbstractTraversable.map(Traversable.scala:104)
	at kafka.server.KafkaHealthcheck.register(KafkaHealthcheck.scala:58)
	at kafka.server.KafkaHealthcheck.startup(KafkaHealthcheck.scala:50)
	at kafka.server.KafkaServer.startup(KafkaServer.scala:280)
	at kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:38)
	at kafka.Kafka$.main(Kafka.scala:65)
	at kafka.Kafka.main(Kafka.scala)

解决办法:

  • 提示的是Host找不到,在/etc/hosts中映射的ip名称和主机名一致就可以了,编辑/etc/hosts加入host配置
192.168.74.129 master yyp.localdomain
  • 或在监听配置的地方加上ip地址
Clone this wiki locally