MSRG Publication:: Hardware Acceleration Landscape for Distributed Real-time Analytics: Virtues and Limitations

Hardware Acceleration Landscape for Distributed Real-time Analytics: Virtues and Limitations

Mohammadreza Najafi, Kaiwen Zhang, Mohammad Sadoghi, and Hans-Arno Jacobsen.

In ICDCS, 2017.

Abstract

We are witnessing a technological revolution with a broad impact ranging from daily life (e.g., personalized medicine and education) to industry (e.g., data-driven healthcare, commerce, agriculture, and mining). At the core of this transformation lies “data”. This transformation is facilitated by embedded devices, collectively known as Internet of Things (IoT), which produce real-time feeds of sensor data which are collected and processed to produce a dynamic physical model used for optimized real-time decision making. At the infrastructure level, there is a need to develop a scalable architecture for processing massive volumes of present and historical data at an unprecedented velocity to support the IoT paradigm. To cope with such extreme scale, we argue for the need to revisit the hardware and software co-design landscape in light of two key technological advancements. First is the virtualization of computation and storage over highly distributed data centers spanning across continents. Second is the emergence of a variety of specialized hardware accelerators that complement traditional general-purpose processors. Further efforts are required to unify these two trends in order to harness the power of big data. In this paper, we present a formulation and characterization of the hardware acceleration landscape geared towards real-time analytics in the cloud. Our goal is to assist both researchers and practitioners navigating the newly revived field of software and hardware co-design for building next generation distributed systems. We further present a case study to explore software and hardware interplay for designing distributed real-time stream processing.

Download



Tags: fpga, stream joins, big data


Readers who enjoyed the above work, may also like the following:


  • Multi-query Stream Processing on FPGAs.
    Mohammad Sadoghi, Rija Javed, Naif Tarafdar, Rohan Palaniappan, Harshvardhan Pratap Singh, and Hans-Arno Jacobsen.
    In 28th International Conference on Data Engineering, 2012.
    Tags: fpga, content-based matching, composite subscription, publish/subscribe, topss
  • fpga-ToPSS: Line-speed Event Processing on FPGAs.
    Mohammad Sadoghi, Harshvardhan Pratap Singh, and Hans-Arno Jacobsen.
    In The 5th ACM International Conference on Distributed Event-Based Systems (DEBS), pages 373-374, July 2011.
    Tags: content-based matching, content-based publish/subscribe, fpga, event processing
  • Towards Highly Parallel Event Processing Through Reconfigurable Hardware.
    Mohammad Sadoghi, Harshvardhan Pratap Singh, and Hans-Arno Jacobsen.
    In The 7th International Workshop on Data Management on New Hardware (DaMoN, co-located with SIGMOD), pages 27-32, June 2011.
    Tags: content-based publish/subscribe, content-based matching, fpga