摘要:Summary In this paper, we proposed superscalar pipelined inner product computation unit for signed-unsigned number operating at 16GHz. This is designed using five stage pipelined operation with four 8×8 multipliers operating in parallel. Superscalar pipelined is designed to compute four 8×8 products in parallel in three clock cycles. In the fourth clock cycle of the pipeline operation, two inner products are computed using two adders in parallel. Fifth stage of the pipeline is designed to compute the final product by adding two inner partial products. Upon the pipeline is filled up, every clock cycle the new product of 16×16-bit signed unsigned number is obtained. The worst delay measured among the pipeline stage is 0.062ns, and this delay is considered as the clock cycle period. With the delay of 0.062ns clock cycle period, the pipeline stage can be operated with 16GHz synchronous clock signal. Each superscalar pipeline stage is implemented using 45nm CMOS process technology, and the comparison of results shows that the delay is decreased by 38%, area is reduced by 45% and power dissipation is saved by 32%.