Open Source Articles

  • Open Sourcing Venice – LinkedIn’s Derived Data Platform

    September 26, 2022

    We are proud to announce the open sourcing of Venice, LinkedIn’s derived data platform that powers more than 1800 of our datasets and is leveraged by over 300 distinct applications. Venice is a high-throughput, low-latency, highly-available, horizontally-scalable, eventually-consistent storage system with first-class support for ingesting the output of batch and...

  • Feathr-joins-LF-AI-and-Data-Foundation

    Feathr joins LF AI & Data Foundation

    September 12, 2022

    Co-authors: Hangfei Lin, Jinghui Mo We’re excited to announce today that Feathr is joining LF AI & Data, the Linux Foundation’s umbrella foundation supporting open source innovation in artificial intelligence (AI) and data. Feathr is a feature store that simplifies machine learning (ML) feature serving and improves developer productivity. “We're excited to...

  • image-of-a-feather

    Open sourcing Feathr – LinkedIn’s feature store for productive machine learning

    April 12, 2022

    We are open sourcing Feathr – the feature store we built to simplify machine learning (ML) feature management and improve developer productivity. At LinkedIn, dozens of applications use Feathr to define features, compute them for training, deploy them in production, and share them across teams. With Feathr, users reported significantly reduced time required to...

  • image-of-the-overview-of-brooklin

    Load-balanced Brooklin Mirror Maker: Replicating...

    April 11, 2022

    At LinkedIn, Apache Kafka is used heavily to store all kinds of data, such as member activity, log storage, metrics storage, and a...

  • graph-of-fast-tree-shap-version-comparison

    FastTreeSHAP: Accelerating SHAP value computation for trees

    March 15, 2022

    Co-authors: Jilei Yang, Humberto Gonzalez, Parvez Ahammad In this blog post, we introduce and announce the open sourcing of the...

  • an-example-for-using-the-member-connection-graph-for-a-job-recommendation-task

    Performance-Adaptive Sampling Strategy (PASS) for GNNs:...

    March 7, 2022

    Co-authors: Jaewon Yang, Minji Yoon, Sufeng Niu, Dash Shi, and Qi He Graphs are a universal way to represent relationships among...