Comments (6)
The error is coming from queue producers using your ecto pool (not the notifier, which uses a dedicated postgrex connection). The second error could be coming from anywhere really, but it seems to be well after the fact when it has gathered too many connections.
Oh well! Thanks for digging through those logs.
I'm going to release v0.11 today. You won't need to run from master for long.
from oban.
Thanks for the report! That is a nasty bug.
I'm fairly sure this has to do with the circuit breaker getting into a loop and recreating the connection. The problem is that I can't recreate it so farβI haven't seen this issue anywhere else so far. Working on recreation and a fix currently.
Out of curiosity, how many nodes are you running?
from oban.
The circuit breaker logs a structured message with the error message that tripped the circuit. Do you have any logs with those error reports?
from oban.
We are running 3 nodes.
Also now, after restarting all of them, the issue seems to have disappeared (for now).
What might also be interesting is that our underlying infrastructure provider seemed to have some network issues today, so I assume the issue only happens when the connections to postgres break.
Now that our infrastructure provider has fixed the issue and after the server restart, we are back to a normal amount of connections.
About the logs I will go and see what I find, but I can't promise anything, since our system is not yet live on production and we are still working on our kibana setup π.
from oban.
@andreasknoepfle Glad to hear it has resolved itself. Hopefully this isn't something you run into again!
In any case, I've made a change to how the circuit breaker handles reconnection. It should prevent this from happening in the future. I say should here because I can't replicate the issue with tests π€·ββ
from oban.
So I looked through the logs and just found lots and lots of those:
{source=oban, error=connection not available and request was dropped from queue after 2997ms. You can configure how long requests wait in the queue using :queue_target and :queue_interval. See DBConnection.start_link/2 for more information, message=Circuit temporarily tripped}
And then afterwards it is a sea of those, where I can't say if there is anything valuable in between:
"Postgrex.Protocol (#PID<0.5139.0>) failed to connect: ** (Postgrex.Error) FATAL 53300 (too_many_connections) remaining connection slots are reserved for non-replication superuser connections"
Thanks for looking into this β€οΈ π
Going to update to your latest commit now and will report back when this ever happens again :)
from oban.
Related Issues (20)
- Oban.resume_all_queues/_ issues HOT 1
- Oban Met during app shutdown HOT 2
- [Questions] Oban Jobs management HOT 1
- Oban bringing app down on Prod with error (expected :repo to implement Ecto.Repo, got: MyApp.Repo) HOT 2
- Chain worker options ignored when passed through use HOT 1
- How to use Oban.Pro.Relay with Oban.Pro.Workers.Chunk HOT 2
- No config registered for [Oban, Oban.Met] instance HOT 9
- Can't update queue limits manually in Oban Web HOT 2
- ArgumentError when opening Oban Web before Oban executes a job and/or the jobs table is empty HOT 1
- Regression in `all_enqueued` handling of timestamp time zones HOT 3
- Oban Crontab : make sure that the module exist and is a job HOT 1
- Lazy Loading in Environments Results in Unknown Worker HOT 1
- Flooded with Oban errors in Sentry since upgrading (see description for versions) HOT 1
- Prevent Postgres notifier from crashing without connectivity HOT 1
- Replace pg_notify insert trigger with in-app notifications
- Starting postgres and sqlite correctly for development HOT 2
- Unstable test durations when using `Oban.Pro.Testing.run_workflow/2` HOT 1
- Test helpers don't work with SQLite3 unless prefix: false is explicitely used HOT 1
- ** (UndefinedFunctionError) function Ecto.Query.Builder.OrderBy.apply/2 is undefined or private HOT 3
- Global Partitioning Docs HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from oban.