Comments (6)
Hi,
thanks for your interest to the lock-free queue and the article. Actually, the queue in the repository is developed further than the original implementation in the article. In particular, since volatile is considered harmful and bad practice the queue in the repository uses compiler barriers where necessary instead of defining the queue properties as volatile.
builtin_expect
is just a compiler hint how to place the code, e.g. in the code at the below
if (builtin_expect(cond, 0)) {
A;
} else {
B;
}
while A
expression is placed before B
, compiler moves it after more likely B
. This is bit different optimization than volatile
specifier.
from blog.
Thanks for your response. I'm looking at the code from the article and the n consumer m producer lock free implementation and it seems like they are both the same. Could you point me to where the differences are ? I apologize since it is not immediately obvious where the differences are. IT appears that the code in the repo simply discards volatile and it still works.
from blog.
To be more clear, my question comes from the while loop in the push and pop methods from lines 250 and 297. what prevents the compiler from generating code that loads tail and last_head_ into registers and use them in the while loop "while (_builtin_expect(thr_pos().tail >= last_head, 0))," ?
from blog.
The exact change which you are looking for is 307a6e9 : initially the queue was developed with too strong memory ordering requirements while x86-64 already provides strong enough guarantees. So the commit references Intel documentation about the memory ordering guarantees. The commit removes volatile
and introduces memory and compiler barriers. Please follow the latest commits (git log lockfree_rb_q.cc
) - some of the barriers also were optimized out.
what prevents the compiler from generating code that loads tail and last_head_ into registers and use them in the while loop "while (_builtin_expect(thr_pos().tail >= last_head, 0))," ?
thr_pos().tail
is our thread local variable, nobody changes it but us, so we're completely OK with letting the compiler cache it. Meantime latst_head_
is calculated inside the loop body, so compiler isn't allowed to ignore the variable update in the loop and just cache some old value.
from blog.
@krizhanovsky Thanks! Makes perfect sense now! Appreciate your prompt responses.
from blog.
Nice to read that my answers helped you!
from blog.
Related Issues (5)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from blog.