Skip to content

Releases: aeron-io/aeron

1.44.7

10 Jul 10:55
7adc2da
Compare
Choose a tag to compare
  • [Archive] Copy ttl URI parameter from the original channel definition when creating recordings and replays. To ensure that the control stream will use the same TTL setting as the incoming data stream.
  • [C/C++ Wrapper] Include <chrono> to fix compilation error on Windows.
  • [Java] Backport releasing to Maven Central Portal.

1.48.4

27 Jun 16:47
120600a
Compare
Choose a tag to compare
  • [C] Fix aeron_cnc_is_file_length_sufficient, i.e. take into account error buffer length. (#1828)
  • [C] Preserve agent_on_start_func override during driver startup procedure. (#1826)
  • [C++ Wrapper] Add methods to get the registrationId of an asynchronously created resource.
  • Breaking: [Cluster] Allow ConsensusModuleExtension to do work while election is in progress. (#1827)
  • [Cluster] Add -ext suffix to ConsensusModuleExtension archive client request and response channel aliases.
  • [Driver] Add fragmentedFrameLength getter to Header. (#1829)
  • [AeronArchive] Allow NOT_CONNECTED to be returned without throwing an exception before client is closed.
  • [Cluster] Fix ClusterToolOperator#queryClusterMembers return value when query times out.
  • [Java] Bump Agrona to 2.2.4.
  • [Java] Bump SBE to 1.35.6.
  • [Java] Bump ByteBuddy to 1.17.6
  • [Java] Bump Shadow to 8.3.7.
  • [Java] Bump JUnit to 5.13.2.
  • [Java] Bump Checkstyle to 10.26.0.
  • [Java] Bump jgit to 7.3.0.202506031305-r.

1.48.3

20 Jun 14:12
9a62bd9
Compare
Choose a tag to compare
  • [Java/C] Use untethered-linger-timeout in IPC publications.
  • [Java] Bump Agrona to 2.2.3.
  • [Java] Bump SBE to 1.35.5.

1.48.2

12 Jun 14:55
Compare
Choose a tag to compare
  • [Java/C/C++] Poll Archive client on slow duty cycle. (#1817)
  • [C] Close send channel endpoints immediately once they have been released by the sender thread, don't wait for a managed resource cycle. (#1816)
  • [Java] Cluster tool exit code bugfix. (#1818)
  • [Java] Fix IndexedReplicatedRecording LIVE_CHANNEL config (#1815)
  • [Java] Bump JUnit to 5.13.1.

1.48.1

06 Jun 15:37
7a5dd15
Compare
Choose a tag to compare
  • [C++] Add warning that C++ API will be removed in 1.50.0.
  • [Java] Publish artifacts to Central Portal using OSSRH Staging API.
  • [Java] Bump Agrona to 2.2.2.
  • [Java] Bump SBE to 1.35.3.
  • [Java] Bump JGit to 7.2.1.202505142326-r.
  • [Java] Bump Checkstyle to 1.25.0.
  • [Java] Bump Gradle 8.14.2.

1.48.0

03 Jun 22:59
d800d60
Compare
Choose a tag to compare

Noteworthy Changes

  • ExclusivePublication#revoke.

    Release publisher and subscriber resources immediately with exclusive publication revoke. Publication will not linger and not allow any trailing loss to be resolved. Subscription will not wait for any data to be received.

    NB: Media driver and client code (publisher and subscriber) must run Aeron 1.48.0 or higher.

    For more information see Publication#revoke wiki page.

  • Image#reject.

    Reject incoming sessions from a publisher. This allows you to quickly stop data flow in scenarios where the data is no longer needed or is invalid.

    For more information see Image#reject wiki page.

  • Track connection status in AeronCluster.

    AeronCluster now contains a state machine to track connection status. The state machine is updated during poll operations (AeronCluster#pollEgress and AeronCluster#controlledPollEgress) and while sending data to the Cluster (i.e. AeronCluster#offer, AeronCluster#tryClaim, AeronCluster#sendKeepAlive). If a break in communication is detected and it lasts for more than AeronCluster.Context#newLeaderTimeoutNs() then AeronCluster will close itself.

    NB: When AeronCluster.Context#newLeaderTimeoutNs() is not set the AeronCluster will wait for double the leadership timeout from an actual Cluster. If that is not available (i.e. Cluster is running on an older Aeron version) then it will fallback to a 10 seconds default value, i.e. will wait for 20 seconds.

    If AeronCluster#ingressPublication or AeronCluster#egressSubscription are used directly then it is a user responsibility to call new APIs in order to update the connection tracking state machine, i.e.:

    • After each invocation of the offer/tryClaim on the AeronCluster#ingressPublication a call to AeronCluster#trackIngressPublicationResult must be made.
    • Every time AeronCluster#egressSubscription is polled a call to AeronCluster#pollStateChanges must be made.
  • Response channels GA.

    Response channels have been promoted from experimental to General Availability. Users no longer need to enable experimental features to use this feature.

  • C & C++ wrapper Archive client APIs GA.

    The APIs have been promoted from experimental to General Availability, achieving feature-completeness and parity with Java. Old C++ APIs will be decommissioned in 1.50.0.

  • Per-stream NAK counters.
    Two new stream-specific NAK counters where added:

    • snd-naks-received (typeId=19) - tracks the number of NAKs received by the sender.
    • rcv-naks-sent (typeId=20) - tracks the number of NAKs sent by the receiver.
  • Affinity setting AERON_DRIVER_ASYNC_EXECUTOR_CPU_AFFINITY for async thread (aeron_executor) was removed.

  • Retransmit Receiver Window Multiple

    To avoid overwhelming receivers in the event of retransmissions, Aeron limits the amount of data sent in a single retransmission to a multiple of the receiver window. Previously, this multiple was 16 for unicast, 16 for min and tagged multicast, and 4 for max multicast. It now defaults to 16 for unicast, 4 for all multicast strategies, and can be configured with the properties aeron.unicast.flow.control.rrwm and aeron.multicast.flow.control.rrwm.

  • Linger timeout

    There is a new option to control how long untethered subscriptions will linger before being removed from flow control. If the new untethered linger timeout is not set, the default timeout is equal to the untethered window limit timeout. Previously, the untethered linger timeout was always equal to the window limit timeout. Now they can be changed independently. The new property name is aeron.untethered.linger.timeout. It can also be set via untethered-linger-timeout URI parameter.

Changelog

  • [Java] Initialize archiveId early using CnC file if Aeron instance is not specified.
  • [Java] Close extension's Archive client.
  • [Java] Close snapshot replication before replay and recording are closed.
  • [Java] Adjust archive client name based on the configuration.
  • [C] Add client name for the implicit Aeron client created by the Archive client.
  • [Java] Name implicit Aeron clients based on their usage.
  • [C] Use untethered-linger-timeout on the receiver side.
  • [Java] Store untethered-linger-timeout in the log buffer metadata.
  • [Java] Fix a bug where untethered-linger-timeout was not added to the resulting URI.
  • [Java] Use untethered-linger-timeout on the receiver side.
  • [Java] Use Publication#revoke and Image#reject to close ControlSession resources.
  • [Java] Use Publication#revoke to abort replay session.
  • [Java] Don't through an exception when failing to copy a file within the data collector. This breaks other parts of the data collection on test failure (e.g. event log capture).
  • [C] Flow control retransmit receiver window multiple for C driver. (#1807)
  • [C] C version of untethered linger timeout. (#1808)
  • [Java] Require ArchiveThreadingMode.INVOKER if MediaDriver is running in the invoker mode.
  • [Java/C] Per stream NAKs. (#1806)
  • [Java] Add separate linger timeout for untethered subscriptions. (#1801)
  • [Java/C] Publication#revoke. (#1781)
  • [Java] Make cluster publish leader heartbeat timeout to clients. (#1805)
  • [Java] Require Aeron client to run in the invoker mode if MediaDriver is running with ThreadingMode.INVOKER, i.e. Aeron.Context.useConductorAgentInvoker(true) must be set when Aeron.Context.driverAgentInvoker() is set.
  • [Java] Add event code type for sequencer.
  • [Java/C] Image#reject. (#1785)
  • [Java] Fsync archive.catalog file to disc when shutting down Archive.
  • [C] Align flow control receiver timeout with Java, i.e. use AERON_FLOW_CONTROL_RECEIVER_TIMEOUT env variable instead of AERON_MIN_MULTICAST_FLOW_CONTROL_RECEIVER_TIMEOUT.
  • [Java] Remove legacy aeron.MinMulticastFlowControl.receiverTimeout config option, i.e. use aeron.flow.control.receiver.timeout directly.
  • [C] Remove experimental feature flag for response channels.
  • [Java] Remove experimental option for response channels for the response channels.
  • [C] MDC short send fix. (#1770)
  • [Java] Flow control retransmit receiver window multiple (#1800)
  • [Java] Prevent potential silent message loss on cluster ingress/egress.
  • [Java] Make AeronCluster track connection status.
  • [Java] Create new Ping message for archive client keepalive. (#1799)
  • [Java] File page aligned mark files. (#1789)
  • [Java] Increment snapshot counter after standby snapshots were successfully replicated.
  • [Config] Update code style to reduce use of '.*' imports.
  • [Java] Improve storage space exception detection.
  • [Java] Properly check for EOS flag. (#1795)
  • [C] Fix issue for untethered slow consumers impacting whole server. (#1792)
  • [C++ Wrapper] Remove 'experimental' indicator for C/C++ wrapper archive APIs. (#1793)
  • [Java] Refactor session liveness check.
  • [C] Use async-executor name for the async thread, i.e. align with the Java impl.
  • [Java] Use async-executor prefix for async threads.
  • [Bash] Simplify thread affinity listing.
  • [Java] Surface method to describe extension snapshot content in ClusterTool. Support printing snapshot entries as hex dumps.
  • [C++] Add #include <cstdint>.
  • [C++ Wrapper] Add missing header. (#1786)
  • [C] Remove affinity settings for the async thread (aeron_executor).
  • [Java] Add TestIdleStrategy.
  • [Java] Synchronize session ids across cluster nodes. (#1774)
  • [CI] Add Clang 20 to the build matrix.
  • [C] Call close_session in archive_close(). (#1778)
  • [Java] Added close reason to consensus module extension call back on session close.
  • [C] Create log buffers sparse by default.
  • [Java] Create log buffers sparse by default.
  • [Java] Add context to the disconnected control session warning message, i.e. show the response streamId/channel pair to help identify client that was disconnected.
  • [Java] Use separate fragment assemblers for IPC and UDP inputs.
  • [C++ Wrapper] Sync addAliasIfAbsent method to ChannelUri. (#1755)
  • [C++ Wrapper] Allow setting the recording events channel. (#1768)
  • [Java] Use MarkFile#timestampRelease.
  • [C++ Wrapper] fix uri_buffer length in Subscription.tryResolveChannelEndpointPort(). (#1767)
  • [Java] Don't report error if the publication is closed or not connected during replay.
  • [Doc] Document the reserved range for Aeron counter typeIds. (#1771)
  • [Java] Update sub-pos iff the image was not closed. Otherwise, the JVM might crash with SIGSEGV while accessing closed Position.
  • [CMake] Only link to client for signal test.
  • [C] Add TERM signal handling to C media driver and supporting test.
  • [C] Add missing header. (#1765)
  • ...
Read more

1.47.5

09 May 15:14
ae451f6
Compare
Choose a tag to compare

[Driver] Check if EOS flag bit is set instead of the entire mask. (#1795)
[Driver] Record bytes lost in the loss report only once when a loss is detected, i.e. do not count the same loss when resending NAKs. (#1796)
[Driver] Prevent NetworkPublication's pub-lmt from wrapping around into the dirty term. (#1794)
[Cluster] Prevent ConsensusModule's state (nextSessionId) diverging between leader and follower nodes when a session is rejected during the authentication phase. (#1774)
[Cluster] Only send TerminationAck to the leader that requested it. (#1797)
[Cluster] Use separate fragment assemblers for IPC and UDP inputs.
[Client: C] Do not update image list change number when retaining/releasing images as those can be called from a client conductor thread.
[Client: C++ Wrapper] Use const on Context.h copy constructor.
[Archive Client: C] Call close_session() in archive_close(). (#1778)

1.47.4

14 Mar 11:30
f797240
Compare
Choose a tag to compare
  • [Driver] Increment retransmit count only if data was actually sent.
  • [Cluster] Fix buffer reference for ClusterMarkFile. (#1753)
  • [Cluster/Archive] Protect against access to the closed mark file.
  • [Cluster/Archive] Prevent JVM crash when opening an old version of the mark file (i.e. without a message header).
  • [C++ Wrapper] Add an AsyncDestination type definition to Subscription.h. (#1749)
  • [C++ Wrapper] Change wrapper version of the Context so that it does not hold a pointer to the underlying C context and track the values directly on the object and pass them through during init. Keep the pointer to the C context on the Aeron object to be properly cleaned up. (#1730)
  • [C] Check that an image exists in the Subscription when retaining/releasing. (#1752)

1.47.3

14 Feb 13:55
73548bb
Compare
Choose a tag to compare
  • [Java] Reset ClusterBackup state if the Cluster node from which ClusterBackup is replaying the log is "not available", i.e. either no longer eligible (i.e. after an election) or the backup query cannot be sent to it (e.g. ConsensusModule is down).
  • [Java] Fix typo in ReplicationSession state change reason.
  • [C] Adding setter/getter methods for CPU affinity to media driver. (#1737)
  • [C] Support use of sendmmsg() without an address (i.e. when connect address is used). (#1742)
  • [C] Close image when it is being removed from a subscription.
  • [C++ Wrapper] Decrement ref count of an Image after it was created, because it was counted twice: once in the C code when looking the aeron_image_t and the second time by invoking aeron_subscription_image_retain inside the Image constructor.
  • [C++ Wrapper] Remove definitions that shadow aeron_logbuffer_descriptor.h definitions. (#1740)
  • [Java] Upgrade to Gradle 8.12.1.
  • [Java] Upgrade to Shadow 8.3.6.
  • [Java] Upgrade to Checkstyle 10.21.2.
  • [Java] Upgrade to ByteBuddy 1.17.1.

1.47.2

30 Jan 16:28
591b1c7
Compare
Choose a tag to compare

Known issues

  • [Java] ClusterBackup might connect to two different Cluster nodes simultaneously whereby one is used to provide the live Raft log replay and to download the snapshots, whereas the other one is used to fetch the latest list of snapshot entries and the recording log metadata. As long as all of the Cluster nodes are "in sync" (i.e. have the same set of snapshots) then everything is ok. However, if the second node from which ClusterBackup fetches the snapshots was down for some time (i.e. does not have all of the snapshots) then the ClusterBackup might end up with a broken recording log whereby recording log entries will have a different log position to the underlying snapshot recordings.
    Fixed in 1.47.3

Changelog

  • [Java] Fix a regression in AeronArchive#listRecording which could return arbitrary recording information when the specified recordingId is not found (does not exist or state is not VALID) instead of sending back ControlResponseCode.RECORDING_UNKNOWN.
  • [C] Apply aeron.conductor.cpu.affinity to the thead in SHARED threading mode and aeron.sender.cpu.affinity to sender/receiver thread in SHARED_NETWORK threading mode.
  • [C] Add support for setting CPU affinity for the async executor thread (aeron.driver.async.executor.cpu.affinity property and AERON_DRIVER_ASYNC_EXECUTOR_CPU_AFFINITY env variable).