Fast multi-threaded memory optimized tool to compute cosine similarity on very large matrices imported from NumPy.