Comments (5)
纯C版本只是当成参考实现,目前只做了 arm neon 优化,其他架构还没有。。
from ncnn.
但是SIMD优化是基于纯C实现的,SIMD的优化倍数是有限的,C的效率不高SIMD并行优化之后效率本身也是有限的,同时我在ARM V7的单核CPU上编译测试了TensorFlow和NCNN,均开了neon优化,squeezenet demo的性能差距也是蛮大的
from ncnn.
能否提供一下你们在ARM参考平台上TensorFlow和NCNN测试的性能对比数据?? 谢谢!
from ncnn.
我测试了海思安防芯片,HI3536平台(4核A17 1.4Ghz),开一个核心450ms。已经不错了
from ncnn.
纯C实现是没有优化的,速度可能比 arm 优化版本慢10倍以上。。
from ncnn.
Related Issues (20)
- MacOS下 new ncnn::Net() 崩溃 HOT 2
- Tile 算子似乎没有起作用
- yolov8训练完成的模型转成ncnn的模型后,推理不出结果,网上查了说要进行前后处理,确实不会,能发个cpp的例子看看么! HOT 4
- ncnn-20240410使用-DNCNN_BF16=OFF编译报错 HOT 3
- rnn(lstm,gru),解卷积的量化以及增加weight only的量化 HOT 1
- 可以给生成的pnnx.py中的pytorch model 增加初始化权重默认入参么,方便迁移到其他地方使用
- 预编译库中,希望能增加macOS下的动态库版本供下载
- [pnnx]:torch.clamp_min convert failed HOT 2
- 目前RISCV版本中用到的RVV intrinsic代码已经不是最新版本了,有升级riscv intrinsic代码的计划吗?
- Android下的build,为什么默认关闭exception呢? HOT 5
- openmp冲突引起crash
- iPhone创建ncnn::net崩溃 HOT 2
- benchmark测试占用率低
- 手动创建的net,推理慢了很多
- yolov8n模型在鲲鹏ARM机器的检测结果和pytorch结果不一样 HOT 6
- 我有3个GPU,但get_gpu_count()=1 HOT 8
- pnnx和ncnn输出不一致
- I convert onnx to ncnn successfully, but all my inference is all nan. Eg, the output of net.extract() is all nan.
- 鲲鹏920环境,yolov8n模型int8量化速度比默认的fp16慢了50% HOT 5
- 在对mtcnn模型第二层进行./ncnn2table过程中出现了段错误
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ncnn.