================================================================================================
Dataset Benchmark
================================================================================================

OpenJDK 64-Bit Server VM 21.0.10+7-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
back-to-back map long:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                6470           6651         255         15.5          64.7       1.0X
DataFrame                                          1215           1298         117         82.3          12.2       5.3X
Dataset                                            1782           1841          84         56.1          17.8       3.6X

OpenJDK 64-Bit Server VM 21.0.10+7-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
back-to-back map:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                7504           7521          24         13.3          75.0       1.0X
DataFrame                                          2805           2813          12         35.6          28.1       2.7X
Dataset                                            7538           7570          46         13.3          75.4       1.0X

OpenJDK 64-Bit Server VM 21.0.10+7-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter Long:                 Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                4369           4441         101         22.9          43.7       1.0X
DataFrame                                           724            759          42        138.1           7.2       6.0X
Dataset                                            2397           2416          27         41.7          24.0       1.8X

OpenJDK 64-Bit Server VM 21.0.10+7-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                2111           2113           3         47.4          21.1       1.0X
DataFrame                                           112            122           8        893.3           1.1      18.9X
Dataset                                            2413           2420           9         41.4          24.1       0.9X

OpenJDK 64-Bit Server VM 21.0.10+7-LTS on Linux 6.14.0-1017-azure
AMD EPYC 7763 64-Core Processor
aggregate:                                Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum                                            1416           1427          15         70.6          14.2       1.0X
DataFrame sum                                        68             84          11       1473.9           0.7      20.9X
Dataset sum using Aggregator                       1965           2047         115         50.9          19.7       0.7X
Dataset complex Aggregator                         5152           5313         227         19.4          51.5       0.3X


