Based on the multilevel memory architectural high performance characteristics of the most popular microprocessors, this paper summarized and discussed some key techniques for parallelization and optimization of seven typical applied codes and model physical problems under both message passing MPI and shared memory OpenMP standard parallel programming paradigms. Typical benchmark results under six parallel computers are also given in detail in this paper.