Software efficiency always improves a lot by parallelizing. Here is an implementation of fox algorithm, which is one of the algorithms calculating matrix multiplication, using MPI.
Message Passing Interface is a specification for an API that allows many computers to communicate with one another.
MPI is an parallel library to help programming. Its main idea is transferring messages between processes, which are paralleling running in different cores or even CPUs. About fox algorithm, I recommend everyone read this pdf file.