Comments (3)
There are no plans to include this in the immediate future. To limit the scope of the project, we want to focus solely on asynchronously storing distributed parameters. I should definitely clarify this in the documentation to prevent misunderstanding.
It is always possible to use Glint's asynchronous methods and impose blocks or waits to make it behave synchronously or bounded-synchronously (SSP). However, this is actually quite difficult and without taking proper care, such blocks can deadlock the scala execution context...
In general, if the dataset RDD is partitioned sufficiently well, Spark should already load balance the tasks based on data locality (and use a fallback timeout mechanism in case of stragglers). I have found that Spark's own methods for dealing with stragglers work surprisingly well.
from glint.
Can I add iteration information in pull message, servers record the latest iteration number, and send stop or start signal back to workers?
from glint.
It is possible but not easy. It would require rewriting some of the core code base of Glint. In particular, you'd have to change the information that is send over the network, which requires modifying the very low level serialization routines. Recording iteration numbers and sending start/stop signals would be completely new functionality that would need implementing.
I am not planning on doing that soon, but if you wish to try that, I'd be happy to help with any questions or problems that arise.
from glint.
Related Issues (20)
- Actor Not Found
- PullFailedException in large dataset HOT 8
- Implementation bug in ColumnIterator? HOT 15
- A question about glint HOT 4
- Rework of Glint internals HOT 7
- Struggling with data transfer / actor disassociated HOT 5
- Not able to pull matrix slice with rows != cols HOT 5
- Look into Akka Artery
- Random init HOT 1
- why make "push" as an “accumulator” rather than “replacer”? HOT 6
- BigMatrix should support push(rows)
- Can glint support BigInt type? HOT 1
- Does glint support (key, value) store ? HOT 1
- BigMatrix push and pull is high network consumption HOT 1
- Can not create Glint Client in Apache Spark HOT 4
- Akka Actor Error when initializing Glint Client HOT 2
- cluster conf example
- Got runtime issues when using spark-shell HOT 2
- Need Save Operation to store the big vector/matrix into HDFS HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from glint.