Code Monkey home page Code Monkey logo

Comments (6)

jan-wassenberg avatar jan-wassenberg commented on June 8, 2024

Just to confirm, do you have backslashes in that command line so that all are indeed passed in?
Does the binary print num_threads : 2?
Even two threads should help, but it depends on the platform. Maybe one core is already enough to saturate memory bandwidth?

from gemma.cpp.

windmaple avatar windmaple commented on June 8, 2024
threads

Yes, pretty sure gemma.cpp accepted the param. See the screenshot (was using 6 this time).

You are probably right; it may have hit the memory bottleneck even w/ 1 thread. Not sure how to check though.

Btw, this runs on an Android phone.

from gemma.cpp.

austinvhuang avatar austinvhuang commented on June 8, 2024

it won't always be monotonically increasing with # threads, can be quite system dependent so takes a bit of experimentation. You might want to try 2b-it-sfp which should be faster in general and may be less mem bandwidth bound.

Neat to hear it's running on an android phone! what model?

from gemma.cpp.

windmaple avatar windmaple commented on June 8, 2024

Running on Xiaomi 14

from gemma.cpp.

jan-wassenberg avatar jan-wassenberg commented on June 8, 2024

Good, so it's getting the argument value correctly. You can run STREAM to benchmark bandwidth, it also supports threading.

+1 to the SFP suggestion.

from gemma.cpp.

austinvhuang avatar austinvhuang commented on June 8, 2024

Closing for now, if there's anything that's not addressed above, feel free to chime in. Also added a small note to the README "What are some easy ways to make the model run faster?" here https://github.com/google/gemma.cpp?tab=readme-ov-file#troubleshooting-and-faqs

from gemma.cpp.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.