Streaming Audio: Apache Kafka® & Real-Time Data

Von: Confluent founded by the original creators of Apache Kafka®
  • Inhaltsangabe

  • Streaming Audio features all things Apache Kafka®, Confluent, real-time data, and the cloud. We cover frequently asked questions, best practices, and use cases from the Kafka community—from Kafka connectors and distributed systems, to data mesh, data integration, modern data architectures, and data mesh built with Confluent and cloud Kafka as a service. Join our hosts as they stream through a series of interviews, stories, and use cases with guests from the data streaming industry. Apache®️, Apache Kafka, Kafka, and the Kafka logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. No endorsement by The Apache Software Foundation is implied by the use of these marks.

    © 2024 Streaming Audio: Apache Kafka® & Real-Time Data
    Mehr anzeigen Weniger anzeigen
activate_Holiday_promo_in_buybox_DT_T2
  • Apache Kafka 3.5 - Kafka Core, Connect, Streams, & Client Updates
    Jun 15 2023

    Apache Kafka® 3.5 is here with the capability of previewing migrations between ZooKeeper clusters to KRaft mode. Follow along as Danica Fine highlights key release updates.

    Kafka Core:

    • KIP-833 provides an updated timeline for KRaft.
    • KIP-866 now is preview and allows migration from an existing ZooKeeper cluster to KRaft mode.
    • KIP-900 introduces a way to bootstrap the KRaft controllers with SCRAM credentials.
    • KIP-903 prevents a data loss scenario by preventing replicas with stale broker epochs from joining the ISR list.
    • KIP-915 streamlines the process of downgrading Kafka's transaction and group coordinators by introducing tagged fields.


    Kafka Connect:

    • KIP-710 provides the option to use a REST API for internal server communication that can be enabled by setting `dedicated.mode.enable.internal.rest` equal to true.
    • KIP-875 offers support for native offset management in Kafka Connect. Connect cluster administrators can now read offsets for both source and sink connectors. This KIP adds a new STOPPED state for connectors, enabling users to shut down connectors and maintain connector configurations without utilizing resources.
    • KIP-894 makes `IncrementalAlterConfigs` API available for use in MirrorMaker 2 (MM2), adding a new use.incremental.alter.config configuration which takes values “requested,” “never,” and “required.”
    • KIP-911 adds a new source tag for metrics generated by the `MirrorSourceConnector` to help monitor mirroring deployments.


    Kafka Streams:

    • KIP-339 improves Kafka Streams' error-handling capabilities by addressing serialization errors that occur before message production and extending the interface for custom error handling.
    • KIP-889 introduces versioned state stores in Kafka Streams for temporal join semantics in stream-to-table joins.
    • KIP-904 simplifies table aggregation in Kafka by proposing a change in serialization format to enable one-step aggregation and reduce noise from events with old and new keys/values.
    • KIP-914 modifies how versioned state stores are used in Kafka Streams. Versioned state stores may impact different DSL processors in varying ways, see the documentation for details.


    Kafka Client:

    • KIP-881 is now complete and introduces new client-side assignor logic for rack-aware consumer balancing for Kafka Consumers.
    • KIP-887 adds the `EnvVarConfigProvider` implementation to Kafka so custom configurations stored in environment variables can be injected into the system by providing the map returned by `System.getEnv()`.
    • KIP 641 introduces the `RecordReader` interface to Kafka's clients module, replacing the deprecated MessageReader Scala trait.


    EPISODE LINKS

    • See release notes for Apache Kafka 3.5
    • Read the blog to learn more
    • Download and get started with Apache Kafka 3.5
    • Watch the video version of this podcast
    Mehr anzeigen Weniger anzeigen
    11 Min.
  • A Special Announcement from Streaming Audio
    Apr 13 2023

    After recording 64 episodes and featuring 58 amazing guests, the Streaming Audio podcast series has amassed over 130,000 plays on YouTube in the last year. We're extremely proud of these achievements and feel that it's time to take a well-deserved break. Streaming Audio will be taking a vacation! We want to express our gratitude to you, our valued listeners, for spending 10,000 hours with us on this incredible journey.

    Rest assured, we will be back with more episodes! In the meantime, feel free to revisit some of our previous episodes. For instance, you can listen to Anna McDonald share her stories about the worst Apache Kafka® bugs she’s ever seen, or listen to Jun Rao offer his expert advice on running Kafka in production. And who could forget the charming backstory behind Mitch Seymour's Kafka storybook, Gently Down the Stream?

    These memorable episodes brought us joy, and we're thrilled to have shared them with you. As we reflect on our accomplishments with pride, we also look forward to an exciting future. Until we meet again, happy listening!

    EPISODE LINKS

    • Top 6 Worst Apache Kafka JIRA Bugs
    • Running Apache Kafka in Production
    • Learn How Stream-Processing Works The Simplest Way Possible
    • Watch the video version of this podcast
    • Streaming Audio Playlist
    • Join the Confluent Community
    • Learn more with Kafka tutorials, resources, and guides at Confluent Developer
    • Live demo: Intro to Event-Driven Microservices with Confluent
    • Use PODCAST100 to get an additional $100 of free Confluent Cloud usage (details)
    Mehr anzeigen Weniger anzeigen
    1 Min.
  • How to use Data Contracts for Long-Term Schema Management
    Mar 21 2023

    Have you ever struggled with managing data long term, especially as the schema changes over time? In order to manage and leverage data across an organization, it’s essential to have well-defined guidelines and standards in place around data quality, enforcement, and data transfer. To get started, Abraham Leal (Customer Success Technical Architect, Confluent) suggests that organizations associate their Apache Kafka® data with a data contract (schema). A data contract is an agreement between a service provider and data consumers. It defines the management and intended usage of data within an organization. In this episode, Abraham talks to Kris about how to use data contracts and schema enforcement to ensure long-term data management.

    When an organization sends and stores critical and valuable data in Kafka, more often than not it would like to leverage that data in various valuable ways for multiple business units. Kafka is particularly suited for this use case, but it can be problematic later on if the governance rules aren’t established up front.

    With schema registry, evolution is easy due to its robust security guarantees. When managing data pipelines, you can also use GitOps automation features for an extra control layer. It allows you to be creative with topic versioning, upcasting/downcasting the data collected, and adding quality assurance steps at the end of each run to ensure your project remains reliable.

    Abraham explains that Protobuf and Avro are the best formats to use rather than XML or JSON because they are built to handle schema evolution. In addition, they have a much lower overhead per-record, so you can save bandwidth and data storage costs by adopting them.

    There’s so much more to consider, but if you are thinking about implementing or integrating with your data quality team, Abraham suggests that you use schema registry heavily from the beginning.

    If you have more questions, Kris invites you to join the conversation. You can also watch the KOR Financial Current talk Abraham mentions or take Danica Fine’s free course on how to use schema registry on Confluent Developer.

    EPISODE LINKS

    • OS project
    • KOR Financial Current Talk
    • The Key Concepts of Schema Registry
    • Schema Evolution and Compatibility
    • Schema Registry Made Simple by Confluent Cloud ft. Magesh Nandakumar
    • Kris Jenkins’ Twitter
    • Watch the video version of this podcast
    • Streaming Audio Playlist
    • Join the Confluent Community
    • Learn more with Kafka tutorials, resources, and guides at Confluent Developer
    • Live demo: Intro to Event-Driven Microservices with Confluent
    • Use PODCAST100 to get an additional $100 of free Confluent Cloud usage (details)
    Mehr anzeigen Weniger anzeigen
    57 Min.

Das sagen andere Hörer zu Streaming Audio: Apache Kafka® & Real-Time Data

Nur Nutzer, die den Titel gehört haben, können Rezensionen abgeben.

Rezensionen - mit Klick auf einen der beiden Reiter können Sie die Quelle der Rezensionen bestimmen.