Co-authors: David Lu, Hong Liu, Thomas Kwan, Christopher Harris, Weiping Si Many companies, including LinkedIn, have experienced exponential data growth ever since the Apache Hadoop adoption a decade ago. With a proliferation of self-service data authoring tools and publishing platforms, different teams have created and shared datasets to address business needs...