CUDA is a general purpose parallel computing architecture that leverages the parallel compute engine in NVIDIA graphics processing units (GPUs) to solve many complex computational problems in a fraction of the time required on a CPU. It includes the CUDA Instruction Set Architecture (ISA) and the parallel compute engine in the GPU. To program to the CUDATM architecture, developers can, today, use C, one of the most widely used high-level programming languages, which can then be run at great performance on a CUDATM enabled processor. Other languages will be supported in the future, including FORTRAN and C++.
shelxh_omp.f:15100.72:
52 IF(T.GT.AQtmp)GOTO 50
1
shelxh_omp.f:15104.72:
50 end do
2
Warning: Deleted feature: GOTO at (1) jumps to END of construct at (2)
shelxh_omp.f:15101.72:
IF(N.GT.LX)GOTO 50
1
shelxh_omp.f:15104.72:
50 end do
2
Warning: Deleted feature: GOTO at (1) jumps to END of construct at (2)gfortran -fopenmp -O3 -ffast-math -funroll-all-loops -march=native -mtune=native -ftree-vectorize -fvect-cost-model shelxh_omp.f shelxlv_omp.f -o shelxlifc3real 0m1.147s
user 0m2.594s
sys 0m0.109sreal 0m3.354s
user 0m11.618s
sys 0m0.073s
real 0m8.552s
user 0m8.501s
sys 0m0.050s
real 0m8.314s
user 0m8.263s
sys 0m0.040sreal 0m8.330s
user 0m8.300s
sys 0m0.027sreal 0m3.011s
user 0m2.999s
sys 0m0.012sreal 0m3.163s
user 0m2.963s
sys 0m0.033s-fstrict-aliasing -pipe -fomit-frame-pointerg77 shelxl.f shelxlv.f etime_.c fdate_.c -O3 -ffast-math -fstrict-aliasing -pipe -fomit-frame-pointer -o bench/shelxl10
real 0m4.352s
user 0m4.325s
sys 0m0.026s
gfortran shelxl.f shelxlv.f etime_.c fdate_.c -O3 -ffast-math -fstrict-aliasing -pipe -fomit-frame-pointer -o bench/shelxl11real 0m8.469s
user 0m8.426s
sys 0m0.038stime shelxl 6rxn
........................
real 0m3.220s
user 0m3.018s
sys 0m0.035stime ./shelxlcuda 6rxn
.......................
real 0m1.646s
user 0m1.533s
sys 0m0.106sUsers browsing this forum: CommonCrawl [Bot] and 0 guests