Efficient Algorithms And Optimizations For Scientific Computing On Many-Core Processors