在同一时间执行多个进程时速度急剧减慢

program main implicit none real*8 begin, end, Ht(2, 2), ls(4) integer i, j, k, ii, jj, kk integer,parameter::N_tiles = 20 integer,parameter::N_tilings = 100 integer,parameter::max_t_steps = 50 real*8,dimension(N_tiles*N_tilings,max_t_steps,5)::test_e, test_theta real*8 rand_val call random_seed() do i = 1, N_tiles*N_tilings do j = 1, max_t_steps do k = 1, 5 call random_number(rand_val) test_e(i, j, k) = rand_val call random_number(rand_val) test_theta(i, j, k) = rand_val end do end do end do call CPU_TIME(begin) do i = 1, 1001 do j = 1, 50 test_theta = test_theta+0.5d0*test_e end do end do call CPU_TIME(end) write(*, *) 'total time cost is : ', end-begin end program main

import numpy as np import time N_tiles = 20 N_tilings = 100 max_t_steps = 50 theta = np.ones((N_tiles*N_tilings, max_t_steps, 5), dtype=np.float64) e = np.ones((N_tiles*N_tilings, max_t_steps, 5), dtype=np.float64) begin = time.clock() for i in range(1001): for j in range(50): theta += 0.5*e end = time.clock() print('total time cost is {} s'.format(end-begin))

1条回答

网友

1楼 · 发布于 2024-09-27 21:33:31

代码在多次运行时可能很慢，因为必须通过有限带宽内存总线的内存越来越多。你知道吗

如果您只运行一个进程，一次只运行一个阵列，但启用OpenMP线程，则可以使其更快：

integer*8 :: begin, end, rate
...

call system_clock(count_rate=rate)
call system_clock(count=begin)

!$omp parallel do
do i = 1, 1001
  do j = 1, 50
    test_theta = test_theta+0.5d0*test_e
  end do
end do
!$omp end parallel do

call system_clock(count=end)
write(*, *) 'total time cost is : ', (end-begin)*1.d0/rate

在四核CPU上：

> gfortran -O3 testperformance.f90 -o result
> ./result 
 total time cost is :    15.135917384000001
> gfortran -O3 testperformance.f90 -fopenmp -o result
> ./result 
 total time cost is :    3.9464441830000001

相关问题更多 >

编程相关推荐

热门问题

热门文章