Improved GHASH pclmul implementation (parallel processing of four blocks, +70% speed).