About a year ago, we open sourced Gobblin, a universal data ingestion framework that aimed to solve data integration challenges faced by people working on big data problems. We have described how LinkedIn is using Gobblin to ingest data at massive scale from a variety of sources to HDFS, in many previous blog posts, publications, and talks. Today, we are very...
Gobblin Articles
-
- Topics:
- Hadoop,
- Big Data,
- Open Source,
- Data Ingestion,
- Distributed Systems,
- ETL,
- Gobblin,
- Kafka
-
We shared Gobblin with the open source community a year ago. Since then, we’ve seen increasing interest and adoption among engineers, researchers and analysts in using Gobblin to integrate data from a variety of sources into Hadoop. In previous blog posts, publications, and talks, we’ve described our motivations for building a unified ingestion framework that is...
- Topics:
- Hadoop,
- Data Ingestion,
- Gobblin,
- Kafka,
- Open Source
-
Genesis Less than a year ago, we introduced Gobblin, a unified ingestion framework, to the world of Big Data. Since then, we’ve shared ongoing progress through a talk at Hadoop Summit and a paper at VLDB. Today, we’re announcing the open source release of Gobblin 0.5.0, a big milestone that includes Apache Kafka integration. Our motivations for building Gobblin...
- Topics:
- Big Data,
- Hadoop,
- Open Source,
- Distributed Systems,
- ETL,
- Gobblin,
- Kafka
-
When dealing with massive amounts of data at scale, it’s important to have state of the art infrastructure and algorithms that can...
- Topics:
- Pinot,
- data analytics,
- Cubert,
- infrastructure,
- Gobblin
-
Authors: Shirshanka Das, Lin Qiao The holiday season for gobbling is upon us; and at LinkedIn, we’ve been getting better at gobbling...
- Topics:
- Big Data,
- Hadoop,
- Data Ingestion,
- Distributed Systems,
- Gobblin,
- Open Source