News

Data today is too distributed and too large to move it to where the processing is performed. Now, businesses must find ways to move the processing close to the data—and the closer, the better.
If you haven’t heard of Flink until now, get ready for the deluge. As one of a stream of Apache incubator-to-top-level projects turned commercial effort, the data processing engine’s promise is to ...
Managed from a single control plane, a distributed data cloud enables analytic applications to be provisioned at the point of need.
Today, Samsung Next, the venture capital arm of Samsung Electronics, announced it is making a strategic investment in Seattle-based Expanso, a startup looking to power distributed data processing ...
Apache Flink, a distributed in-memory data processing framework project born out of Germany, this week graduated the Apache Incubator stage and became a Top-Level Project at the open source software ...
Written in Java, Hive is a specialized execution front end for Hadoop. Hive lets you write data queries in an SQL-like language. You're using Hadoop, but it feels like you're talking SQL to a ...
Still, Hive is an ideal express-entry into the large-scale distributed data processing world of Hadoop. All the ease of SQL with all the power of Hadoop — sounds good to me.
It is time to stop the stampede to create capacity to analyze big data and instead pursue a more balanced approach that focuses on finding more data sets and understanding how to use them to ...