项目作者: evolution-gaming

项目描述 :
Scala wrapper for kafka consumer and producer
高级语言: Scala
项目地址: git://github.com/evolution-gaming/skafka.git
创建时间: 2018-02-01T15:49:17Z
项目社区:https://github.com/evolution-gaming/skafka

开源协议:MIT License

下载


Skafka

Build Status
Coverage Status
Codacy Badge
Version
License: MIT

Scala wrapper for kafka-clients v3.4.0

Motivation

Kafka provides an official Java client out of the box, which could be used from
Scala code without any additional modifications.

The main disadvantage of using an official client directly is that it implies
a very specific threading model to the application. I.e. the consumer is not
thread safe and also expects a rebalance listener to do the operations in the
same thread.

This makes wrapping a client with Cats Effect
classes a bit more complicated than just calling IO { consumer.poll() } unless
this is the only call, which is expected to be used.

Skafka does exactly that: a very thin wrapper over official Kafka client to
provide a ready-made Cats Effect API and handle some corner cases concerning
ConsumerRebalanceListener calls.

Comparing to more full-featured libraries such as
FS2 Kafka, it might be a little bit more
reliable, because there is little code/logic to hide the accidenital bugs in.

To summarize:

  1. If it suits your goals (i.e. you only ever need to do consumer.poll()
    without acting on rebalance etc.) then using an official Kafka client directly,
    optionally, wrapping all the calls with cats.effect.IO, is a totally fine idea.
  2. If more complicated integration to Cats Effect is required, i.e.
    ConsumerRebalanceListener is going to be used then consider using Skafka.
  3. If streaming with FS2 is required or any other features
    the library provides then FS2 Kafka could be a good choice. Note, that it is
    less trivial then Skafka and may contain more bugs on top of the official
    Kafka client.

Key features

  1. It provides null-less Scala apis for Producer & Consumer

  2. Makes it easy to use your effect monad with help of cats-effect

  3. Blocking calls are being executed on provided ExecutionContext.

  4. Simple case class based configuration

  5. Support of typesafe config

Producer usage example

  1. val producer = Producer.of[IO](config, ecBlocking)
  2. val metadata: IO[RecordMetadata] = producer.use { producer =>
  3. val record = ProducerRecord(topic = "topic", key = "key", value = "value")
  4. producer.send(record).flatten
  5. }

Consumer usage example

  1. val consumer = Consumer.of[IO, String, String](config, ecBlocking)
  2. val records: IO[ConsumerRecords[String, String]] = consumer.use { consumer =>
  3. for {
  4. _ <- consumer.subscribe(Nel("topic"), None)
  5. records <- consumer.poll(100.millis)
  6. } yield records
  7. }

Java client metrics example

The example below demonstrates creation of Consumer, but same can be done for Producer as well.

:warning: using ConsumerMetricsOf.withJavaClientMetrics (or its alternative metrics.exposeJavaClientMetrics)
registers new Prometheus collector under the hood. Please use unique prefixes for each collector
to avoid duplicated metrics in Prometheus (i.e. runtime exception on registration).
Prefix can be set as parameter in: ConsumerMetricsOf.withJavaClientMetrics(prometheus, Some("the_prefix"))

  1. import ConsumerMetricsOf.*
  2. val config: ConsumerConfig = ???
  3. val prometheus: CollectorRegistry = ???
  4. val metrics: ConsumerMetrics[IO] = ???
  5. for {
  6. metrics <- metrics.exposeJavaClientMetrics(prometheus)
  7. consumerOf = ConsumerOf.apply1(metrics1.some)
  8. consumer <- consumerOf(config)
  9. } yield ???

Setup

  1. addSbtPlugin("com.evolution" % "sbt-artifactory-plugin" % "0.0.2")
  2. libraryDependencies += "com.evolutiongaming" %% "skafka" % "15.0.0"

Notes

While Skafka provides an ability to use ConsumerRebalanceListener
functionality, not all of the method calls are supported.

See the following PRs for more details:
https://github.com/evolution-gaming/skafka/pull/150
https://github.com/evolution-gaming/skafka/pull/122

To our latest knowledge neither FS2 Kafka supports all of the
methods / functionality.

Release process

The release process is based on Git tags and makes use of sbt-dynver to automatically obtain the version from the latest Git tag. The flow is defined in .github/workflows/release.yml.
A typical release process is as follows:

  1. Create and push a new Git tag. The version should be in the format vX.Y.Z (example: v4.1.0). Example: git tag v4.1.0 && git push origin v4.1.0
  2. Create a new release in GitHub. Go to the Releases page, click Draft a new release, select Choose a tag, pick the tag you just created
  3. Press Generate release notes. Release title will be automatically filled with the tag name. Change the description if needed
  4. Press Publish release