Co-authors: Jilei Yang, Humberto Gonzalez, Parvez Ahammad In this blog post, we introduce and announce the open sourcing of the FastTreeSHAP package, a Python package based on the paper Fast TreeSHAP: Accelerating SHAP Value Computation for Trees (presented at the NeurIPS2021 XAI4Debugging Workshop). FastTreeSHAP enables an efficient interpretation of tree-based...
Open Source Articles
-
Co-authors: Jaewon Yang, Minji Yoon, Sufeng Niu, Dash Shi, and Qi He Graphs are a universal way to represent relationships among entities. Social graphs represent how people interact with each other, professional graphs represent how people collaborate, and so on. Graph Neural Networks (GNNs) are deep learning models that are specialized for understanding graphs...
-
Co-authors: Venkata Krishnan Sowrirajan and Min Shen We are excited to announce that push-based shuffle (codenamed Project Magnet) is now available in Apache Spark as part of the 3.2 release. Since the SPIP vote on Project Magnet passed in September 2020, there has been a lot of interest in getting it into Apache Spark. As of March 2021, 100% of LinkedIn’s Spark...
- Topics:
- Spark,
- Open Source
-
Co-authors: Keqiu Hu, Jonathan Hung, Haibo Chen, and Sriram Rao At LinkedIn, we use Hadoop as our backbone for big data analytics and...
- Topics:
- Hadoop,
- Data,
- Open Source
-
Co-authors: Ze Mao, Matt Wise, Casey Getz, Justin Lin, Ashish Singhai, and Rob Block Introduction Ambry is LinkedIn's scalable...
- Topics:
- infrastructure,
- Data,
- Storage,
- Open Source
-
Co-authors: Kirill Talanine, Jeffrey D. Gee, Rohan Ramanath, Konstantin Salomatin, Gungor Polatkan, Onkar Dalal, and Deepak Kumar...