Comments (5)
Thanks for opening your first issue here! Be sure to follow the issue template! If you are willing to raise PR to address this issue please do so, no need to wait for approval.
from airflow.
This happen due to log deduplication, which might happen when logs streaming from remote logging
airflow/airflow/utils/log/file_task_handler.py
Lines 130 to 140 in 3938f71
from airflow.
As I understood, the main problem is in log.splitlines(), which split log-string by simple lines and not by log-messages. Then function analyzes line by line and deduplicates lines, but we need to analyze and deduplicate log-messages.
As Airflow can be configured with a custom log-format, then we need to store the pattern in config (custom patterns) to split the whole log into log-messages.
from airflow.
If you have a suggestion how improve logging feel free to raise a PR which will work with any type of existed loggers without breaking changes.
from airflow.
I had the same multiple times, for example using DockerOperator which logs all stdout of the container upon failure. Also the logs are messed up not only because of the split lines but also because the file log handler per default tries to sort messages. This not only causes a lot of overhead on the server, it also changes the order any makes a confusion.
Looking forward that somebody raises a PR allowing log sorting and merging to be turned off :-)
from airflow.
Related Issues (20)
- Ineffiction dag_run endpoint HOT 1
- Airflow 2 to 3 auto migration rules HOT 5
- Evaluate DB migration restriction when upgrading to AF 3 HOT 2
- Confirmation text "Wait a minute" is confusing HOT 1
- Incorrect link to task detail from task instance list HOT 1
- S3ToGCSOperator broken in deferable mode HOT 7
- Unable to specify env variables using fieldPath HOT 3
- can't scroll task tries HOT 6
- Success callback not being executed in Custom kubernetesPodOperator HOT 1
- Logs issue HOT 2
- Bad display of xcom values HOT 2
- "cannot pickle '_cffi_backend.FFI' object" error in python/branch operator processing HOT 10
- OpenLineage - dag_run.update_state() - AssertionError: daemonic processes are not allowed to have children
- Simple auth manager HOT 5
- Airflow DAG access control permissions are not working HOT 4
- worker: Warm shutdown (MainProcess) HOT 4
- ONNX Model Inference Operator HOT 8
- GKEStartJobOperator's job_poll_interval parameter is not used by its GKEJobTrigger HOT 2
- kubernetes connection defined via env variable does not work with deferrable=True HOT 4
- post_execute() method does not work correctly when used along with a trigger HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from airflow.