T113-S3 ARM 及 DSP Benchmark
-
ARM 双核SMP ,系统 rtems5,主频 1.08G
[/] # coremark 2K performance run parameters for coremark. CoreMark Size : 666 Total ticks : 12680 Total time (secs): 12.680000 Iterations/Sec : 3154.574132 Iterations : 40000 Compiler version : GCC7.5.0 20191114 (RTEMS 5, RSB 5.not_released, Newlib 7947581) Compiler flags : -O2 Memory location : Please put data memory location here (e.g. code in flash, data on heap etc) seedcrc : 0xe9f5 [0]crclist : 0xe714 [0]crcmatrix : 0x1fd7 [0]crcstate : 0x8e3a [0]crcfinal : 0x25b5 Correct operation validated. See README.md for run and reporting rules. CoreMark 1.0 : 3154.574132 / GCC7.5.0 20191114 (RTEMS 5, RSB 5.not_released, Newlib 7947581) -O2 / Heap [/] # dhry 10000000 Dhrystone Benchmark, Version 2.1 (Language: C) Program compiled without 'register' attribute Execution starts, 10000000 runs through Dhrystone Execution ends Final values of the variables used in the benchmark: Int_Glob: 5 should be: 5 Bool_Glob: 1 should be: 1 Ch_1_Glob: A should be: A Ch_2_Glob: B should be: B Arr_1_Glob[8]: 7 should be: 7 Arr_2_Glob[8][7]: 10000010 should be: Number_Of_Runs + 10 Ptr_Glob-> Ptr_Comp: 1080941968 should be: (implementation-dependent) Discr: 0 should be: 0 Enum_Comp: 2 should be: 2 Int_Comp: 17 should be: 17 Str_Comp: DHRYSTONE PROGRAM, SOME STRING should be: DHRYSTONE PROGRAM, SOME STRING Next_Ptr_Glob-> Ptr_Comp: 1080941968 should be: (implementation-dependent), same as above Discr: 0 should be: 0 Enum_Comp: 1 should be: 1 Int_Comp: 18 should be: 18 Str_Comp: DHRYSTONE PROGRAM, SOME STRING should be: DHRYSTONE PROGRAM, SOME STRING Int_1_Loc: 5 should be: 5 Int_2_Loc: 13 should be: 13 Int_3_Loc: 7 should be: 7 Enum_Loc: 1 should be: 1 Str_1_Loc: DHRYSTONE PROGRAM, 1'ST STRING should be: DHRYSTONE PROGRAM, 1'ST STRING Str_2_Loc: DHRYSTONE PROGRAM, 2'ND STRING should be: DHRYSTONE PROGRAM, 2'ND STRING Microseconds for one run through Dhrystone: 0.3 Dhrystones per Second: 3527884.4 DMIPS: 2007.90 [/] # linpack 100 Rolled Double Precision Linpack Benchmark - PC Version in 'C/C++' Compiler rtems arm-rtems5-gcc 7.5.0 20191114 Optimisation -O2 norm resid resid machep x[0]-1 x[n-1]-1 1.7 7.41628980e-14 2.22044605e-16 -1.49880108e-14 -1.89848137e-14 Times are reported for matrices of order 100 1 pass times for array with leading dimension of 201 dgefa dgesl total Mflops unit ratio 0.00519 0.00016 0.00535 128.35 0.0156 0.0955 Calculating matgen overhead 100 times 0.04 seconds 1000 times 0.37 seconds 2000 times 0.74 seconds 4000 times 1.48 seconds 8000 times 2.96 seconds 16000 times 5.92 seconds Overhead for 1 matgen 0.00037 seconds Calculating matgen/dgefa passes for 5 seconds 100 times 0.55 seconds 200 times 1.09 seconds 400 times 2.18 seconds 800 times 4.36 seconds 1600 times 8.72 seconds Passes used 917 Times for array with leading dimension of 201 dgefa dgesl total Mflops unit ratio 0.00508 0.00016 0.00524 131.12 0.0153 0.0935 0.00508 0.00016 0.00524 131.12 0.0153 0.0935 0.00508 0.00016 0.00524 131.11 0.0153 0.0935 0.00508 0.00016 0.00524 131.13 0.0153 0.0935 0.00508 0.00016 0.00524 131.13 0.0153 0.0935 Average 131.12 Calculating matgen2 overhead Overhead for 1 matgen 0.00037 seconds Times for array with leading dimension of 200 dgefa dgesl total Mflops unit ratio 0.00508 0.00016 0.00523 131.22 0.0152 0.0934 0.00508 0.00016 0.00523 131.22 0.0152 0.0934 0.00508 0.00016 0.00523 131.21 0.0152 0.0935 0.00508 0.00016 0.00523 131.21 0.0152 0.0935 0.00508 0.00016 0.00523 131.21 0.0152 0.0935 Average 131.21 Rolled Double Precision 131.12 Mflops **hifi4 dsp ,主频600M,系统 freertos** Dhrystone Benchmark, Version 2.1 (Language: C) Program compiled without 'register' attribute Execution starts, 10000000 runs through Dhrystone Execution ends Final values of the variables used in the benchmark: Int_Glob: 5 should be: 5 Bool_Glob: 1 should be: 1 Ch_1_Glob: A should be: A Ch_2_Glob: B should be: B Arr_1_Glob[8]: 7 should be: 7 Arr_2_Glob[8][7]: 10000010 should be: Number_Of_Runs + 10 Ptr_Glob-> Ptr_Comp: 933400000 should be: (implementation-dependent) Discr: 0 should be: 0 Enum_Comp: 2 should be: 2 Int_Comp: 17 should be: 17 Str_Comp: DHRYSTONE PROGRAM, SOME STRING should be: DHRYSTONE PROGRAM, SOME STRING Next_Ptr_Glob-> Ptr_Comp: 933400000 should be: (implementation-dependent), same as above Discr: 0 should be: 0 Enum_Comp: 1 should be: 1 Int_Comp: 18 should be: 18 Str_Comp: DHRYSTONE PROGRAM, SOME STRING should be: DHRYSTONE PROGRAM, SOME STRING Int_1_Loc: 5 should be: 5 Int_2_Loc: 13 should be: 13 Int_3_Loc: 7 should be: 7 Enum_Loc: 1 should be: 1 Str_1_Loc: DHRYSTONE PROGRAM, 1'ST STRING should be: DHRYSTONE PROGRAM, 1'ST STRING Str_2_Loc: DHRYSTONE PROGRAM, 2'ND STRING should be: DHRYSTONE PROGRAM, 2'ND STRING Microseconds for one run through Dhrystone: 0.8 Dhrystones per Second: 1243935.8 DMIPS: 707.99 Rolled Double Precision Linpack Benchmark - PC Version in 'C/C++' Compiler xtensa hifi4 dsp xtensa-elf-gcc 10.2 Optimisation -O2 norm resid resid machep x[0]-1 x[n-1]-1 1.7 7.41628980e-14 2.22044605e-16 -1.49880108e-14 -1.89848137e-14 Times are reported for matrices of order 100 1 pass times for array with leading dimension of 201 dgefa dgesl total Mflops unit ratio 0.07600 0.00300 0.07900 8.69 0.2301 1.4107 Calculating matgen overhead 100 times 0.43 seconds 200 times 0.85 seconds 400 times 1.70 seconds 800 times 3.41 seconds 1600 times 6.81 seconds Overhead for 1 matgen 0.00426 seconds Calculating matgen/dgefa passes for 5 seconds 100 times 8.05 seconds Passes used 62 Times for array with leading dimension of 201 dgefa dgesl total Mflops unit ratio 0.07621 0.00231 0.07851 8.75 0.2287 1.4021 0.07621 0.00231 0.07851 8.75 0.2287 1.4021 0.07621 0.00231 0.07851 8.75 0.2287 1.4021 0.07621 0.00231 0.07851 8.75 0.2287 1.4021 0.07621 0.00231 0.07851 8.75 0.2287 1.4021 Average 8.75 Calculating matgen2 overhead Overhead for 1 matgen 0.00426 seconds Times for array with leading dimension of 200 dgefa dgesl total Mflops unit ratio 0.07629 0.00231 0.07860 8.74 0.2289 1.4035 0.07629 0.00231 0.07860 8.74 0.2289 1.4035 0.07629 0.00231 0.07860 8.74 0.2289 1.4035 0.07627 0.00232 0.07860 8.74 0.2289 1.4035 0.07629 0.00231 0.07860 8.74 0.2289 1.4035 Average 8.74 Rolled Double Precision 8.74 Mflops 2K performance run parameters for coremark. CoreMark Size : 666 Total ticks : 16258 Total time (secs): 16.258000 Iterations/Sec : 1414.688154 Iterations : 23000 Compiler version : GCC10.2.0 Compiler flags : -O2 Memory location : STACK seedcrc : 0xe9f5 [0]crclist : 0xe714 [0]crcmatrix : 0x1fd7 [0]crcstate : 0x8e3a [0]crcfinal : 0xd340 Correct operation validated. See README.md for run and reporting rules. CoreMark 1.0 : 1414.688154 / GCC10.2.0 -O2 / STACK
目前hifi dsp采用的自编译的gcc编译器,采用R528 SDK提供的指令集,对linpack的跑分存疑,谁有candence explore能否跑个分比对下,谢谢
-
目前hifi dsp采用的自编译的gcc编译器
这个赞!
dsp的gcc可以拿来玩吗?只有基本指令还是有部分DSP指令?
-
玩没问题,可以直接链接xcc的库文件(包含在R528的SDK中),这样可以解决math库中TIE指令和SIMD的问题,,用GCC直接编译的newlib的math库没有SIMD指令,具体版权问题需要咨询全志SDK中的二进制库文件在GCC中使用是否侵权
-
xtensa-hifidsp-gcc.tar.bz2
仅用于学习及研究目的,不提供任何保证,商业用途请联系全志购买candence exploreC库的头文件已经被替换为SDK中XCLIB的头文件,如要使用newlib请自行重新编译安装newlib
-
好耶,又可以学习新知识了!
-
请问大佬是怎么把 rtems 搬到 T113 上去的?
-
自己写个RTEMS的bsp就可以了,大部分驱动在freebsd中有,稍微修改下添加到libbsd中可以直接工作
-
@yao0718 提供的这个环境,是直接带工具链的吗?有没有简易的使用说明?
-
@yao0718 个人用途的HIFI4 DSP GCC是怎么搞到的?联系candence还是联系全志?
-
@adaofu01 人家大佬自己搓的
-
@yuzukitsuru 全志
Copyright © 2024 深圳全志在线有限公司 粤ICP备2021084185号 粤公网安备44030502007680号