Skip to content

Latest commit

 

History

History
91 lines (52 loc) · 4.86 KB

README.md

File metadata and controls

91 lines (52 loc) · 4.86 KB

EVALUATING THE PERFORMANCE OF COMPUTING PLATFORMS USING A SET OF COMPUTE KERNELS

This project describes the implementation of the standard benchmarks on the FPGA overlay for the performance analysis and comparing the performance measure such as the throughput (Gops/sec) and the speedup against the other embedded hard processors such as ARM coterx A9, SIMD NEON and INTEL I3. MXP VectorBlox is the FPGA overlay that we have used in our work. For detailed information on the power of the MXP soft-vector processor checkout MXP.

Table of Contents

1) General Information

1.1: MXP

The VectorBlox Matrix Processor is a scalable soft core processor designed on FPGAs. It implements classic massively parallel vector processor algorithms traditionally used in scientific super-computers.

1.2: Xilinux Distribution

This project is implemented and tested using Xilinux distribution. Xilinux Distribution is a demo kit with ubuntu 12.04 for Processing System (PS). We have provided the linux support for accessing the MXP overlay on the ZedBoard.

2) Pre-Requisites

2.1: Zedboard Pre-Requisites

  • Booting Xilinux on zedboard: In order to install the xilinux on zedboard with the base xillybus bitstream please follow the instructions given in the Mxp Linux set up file mxplinuxsetup.doc that tells about configuring the MXP overlay with the Xilinx Zynq.

2.2: Repositories

In this project, we used following repositories:

3) Throughput Analysis

  • PERFORMANCE GRAPHS:

FilterGraphs:

  • FILTERGRAPHS consist of the performance graphs for the filters CHEBYSHEV, MIBENCH and QSPLINE

Alt text

Alt text

Alt text

KernelGraphs:

  • KERNELGRAPHS consist of the FFT, KMEANS, MM, SPMV, STENCIL and MRI.

Alt text

Alt text

Alt text

PolytestGraphs:

  • POLYTESTGRAPHS consist of the polynomial performance results.

Alt text

Alt text

Alt text

PolynomialGraphs:

  • Number of Samples vs time taken plots for the polynomial (quadratic and cubic samples) in different platforms present in polynomial_graphs_of_samplesvstimetaken(msec).

Alt text

Alt text

4) Speedup Analysis

Speedup Analysis with respect to ARM cortex A9 at byte, halfword and word level

Alt text

Alt text

Alt text