You can add one option to your build for production:
-localWorkers 8
–
Where 8 is the number of concurrent threads that calculate permutations. All you have to do is to adjust this number to the number that is more convenient to you. See GWT compilation performance (thanks to Dennis Ich comment).
If you are compiling to the testing environment, you can also use:
-draftCompile
which enables faster, but less-optimized compilations
-optimize 0
which does not optimize your code (9 is the max optimization value)
Another thing that more than doubled the build and hosted mode performance was the use of an SSD disk (now hostedmode works like a charm). It's not an cheap solution, but depending on how much you use GWT and the cost of your time, it may worth it!
Hope this helps you!