Commit ea06e4e
[SPARK-16469] enhanced simulate multiply
## What changes were proposed in this pull request?
We have a use case of multiplying very big sparse matrices. we have about 1000x1000 distributed block matrices multiplication and the simulate multiply goes like O(n^4) (n being 1000). it takes about 1.5 hours. We modified it slightly with classical hashmap and now run in about 30 seconds O(n^2).
## How was this patch tested?
We have added a performance test and verified the reduced time.
Author: oraviv <[email protected]>
Closes #14068 from uzadude/master.1 parent 51ade51 commit ea06e4e
File tree
1 file changed
+9
-4
lines changed- mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed
1 file changed
+9
-4
lines changedLines changed: 9 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
426 | 426 | | |
427 | 427 | | |
428 | 428 | | |
| 429 | + | |
| 430 | + | |
429 | 431 | | |
430 | | - | |
431 | | - | |
| 432 | + | |
| 433 | + | |
432 | 434 | | |
433 | 435 | | |
| 436 | + | |
| 437 | + | |
434 | 438 | | |
435 | | - | |
436 | | - | |
| 439 | + | |
| 440 | + | |
437 | 441 | | |
438 | 442 | | |
| 443 | + | |
439 | 444 | | |
440 | 445 | | |
441 | 446 | | |
| |||
0 commit comments