How do you implement a FIR filter?
Structurally, FIR filters consist of just two things: a sample delay line and a set of coefficients. To implement the filter: Put the input sample into the delay line. Multiply each sample in the delay line by the corresponding coefficient and accumulate the result.
Why is implementing a parallel FIR filter necessary?
Parallel processing is a powerful technique because it can be used to increase the throughput of a FIR filter or reduce the power consumption of a FIR filter. However, a traditional block filter implementation causes a linear increase in the hardware cost (area) by a factor of L, the block size.