Code Monkey home page Code Monkey logo

Comments (6)

GabeAl avatar GabeAl commented on June 15, 2024 2

This certainly made sense when this testing likely happened. There have been a few things that have changed recently, including much more RAM bandwidth (in cases where that was bottlenecking the system), and much more efficient AVX512 hardware (including more on-die resources and optimizations) which might warrant re-running some of them. I guess it's a next-next generation RAxML idea, perhaps? I know ExaML was at least 3 times faster on KNL back in the day, but since then no system has treated double-precision AVX512 as a first class citizen again until Zen 4 and IceLake/Sapphire Rapids.

https://www.phoronix.com/review/intel-sapphirerapids-avx512

The above link shows over time the profound differences (including reduced penalties and proportionately greater performance) over time for essentially the same AVX512 code. A modern 12-channel 4800MHz DDR5 Genoa system or a new 60-core Sapphire Rapids system seems to net proportionately more performance from AVX512 in double-precision floating-point math than the sad old Skylake/Cascade Lake Xeons (which were indeed bottlenecked in many ways).

from raxml-ng.

amkozlov avatar amkozlov commented on June 15, 2024 1

Despite extensive experiments, we never managed to get reasonable speedups with AVX512 compared to AVX2. Hence, AVX512 support was never integrated to RAxML-NG.

from raxml-ng.

amkozlov avatar amkozlov commented on June 15, 2024

@GabeAl: you're right, it mostly concerns libpll, and we do have it on our radar anyway. Still, please feel free to open a corresponding issue in the (new) libpll repository:

https://github.com/xflouris/libpll

from raxml-ng.

GrassW avatar GrassW commented on June 15, 2024

May I know how exactly to compile with AVX512? eg: cmake -DENABLE_AVX512=TRUE ?
Thanks.

from raxml-ng.

amkozlov avatar amkozlov commented on June 15, 2024

@GabeAl thanks for the heads up!

Good to know that AMD added AVX512 support as well, and Genoa benchmark results look really impressive!

from raxml-ng.

GrassW avatar GrassW commented on June 15, 2024

@amkozlov @GabeAl thanks a lot, both of you. I see. I will consider AMD's CPU next time if my budget can cover them. :)

from raxml-ng.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.