Pinot Articles

  • image-of-the-overview-of-brooklin

    Load-balanced Brooklin Mirror Maker: Replicating large-scale Kafka clusters at LinkedIn

    April 11, 2022

    At LinkedIn, Apache Kafka is used heavily to store all kinds of data, such as member activity, log storage, metrics storage, and a multitude of inter-service messaging. LinkedIn maintains multiple data centers with multiple Kafka clusters per data center, each of which contains an independent set of data. Mirroring (i.e., replicating) Kafka topics across the...

  • a-graph-showing-keyword-search-query-p95-latency-with-increasing-qps-for-different-workloads

    Text analytics on LinkedIn Talent Insights using Apache Pinot

    June 16, 2021

    Co-authors: Siddharth Teotia and Tim Santos Introduction LinkedIn Talent Insights (LTI) is a platform that helps organizations understand the external labor market and their internal workforce, and enables the long term success of their employees. Users of LTI have the flexibility to construct searches using the various facets of the LinkedIn Economic Graph...

  • venn-diagram-showing-overlap-of-three-characteristics

    Solving for the cardinality of set intersection at scale with Pinot and Theta Sketches

    April 16, 2021

    Co-authors: Vincent Wang, Siddharth Teotia, Manoj Thakur, and Mayank Shrivastava As our LinkedIn Marketing Solutions Blog recently noted, companies and marketers “are once again peering ahead, setting their plans for success in a reshaped business environment.” One of the items businesses rely on to do this are insights, including the estimated reach of an...

  • from-lambda-to-lambdaless-lessons-learned

    From Lambda to Lambda-less: Lessons learned

    December 1, 2020

    Co-authors: Xiang Zhang and Jingyu Zhu Introduction The Lambda architecture has become a popular architectural style that promises...

  • mock-screenshot-of-the-recruiter-usage-dashboard

    Bridging batch and stream processing for the Recruiter...

    July 14, 2020

    Co-authors: Khai Tran and Steve Weiss Batch and streaming computations are often combined together in the Lambda architecture, but...

  • building-linkedin-talent-insights-to-democratize-data-driven-decision-making

    Building LinkedIn Talent Insights to democratize...

    June 29, 2020

    Co-authors: Timothy Santos and Jeremy Lwanga LinkedIn is a mission-driven organization, and we take our mission of “connecting the...