iof_hnp finalizes before all processes IOF COMPLETE, the read handlers of some processes are not released when hnp start to finalize.
runing mpi application compiled with mpicc. ompi and prrte using the same external pmix and libevent.
I included verbose output from iof and state, sorry about so much information, but i think this is helpful.
[saturn.icl.utk.edu:81156] [[6128,0],0] orted_cmd: received add_local_procs
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pulling fd 38 for process [[6128,14],0]
[saturn.icl.utk.edu:81156] defining endpt: file iof_hnp.c line 366 fd 38
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pushing fd 32 for process [[6128,14],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] defining read event for [[6128,14],0]: iof_hnp.c 187
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pushing fd 39 for process [[6128,14],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] defining read event for [[6128,14],0]: iof_hnp.c 190
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pushing fd 41 for process [[6128,14],1]
[saturn.icl.utk.edu:81156] [[6128,0],0] defining read event for [[6128,14],1]: iof_hnp.c 187
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pushing fd 43 for process [[6128,14],1]
[saturn.icl.utk.edu:81156] [[6128,0],0] defining read event for [[6128,14],1]: iof_hnp.c 190
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pushing fd 45 for process [[6128,14],2]
[saturn.icl.utk.edu:81156] [[6128,0],0] defining read event for [[6128,14],2]: iof_hnp.c 187
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pushing fd 47 for process [[6128,14],2]
[saturn.icl.utk.edu:81156] [[6128,0],0] defining read event for [[6128,14],2]: iof_hnp.c 190
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pushing fd 49 for process [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] defining read event for [[6128,14],3]: iof_hnp.c 187
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pushing fd 51 for process [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] defining read event for [[6128,14],3]: iof_hnp.c 190
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],0] STATE RUNNING AT base/odls_base_default_fns.c:1185
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],0] STATE RUNNING PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],1] STATE RUNNING AT base/odls_base_default_fns.c:1185
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],1] STATE RUNNING PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],2] STATE RUNNING AT base/odls_base_default_fns.c:1185
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],2] STATE RUNNING PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE RUNNING AT base/odls_base_default_fns.c:1185
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE RUNNING PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],0] state RUNNING
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],1] state RUNNING
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],2] state RUNNING
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],3] state RUNNING
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE JOB [6128,14] STATE RUNNING AT base/state_base_fns.c:683
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING JOB [6128,14] STATE RUNNING PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp pushing fd 0 for process [[6128,14],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],0] STATE SYNC REGISTERED AT orted/pmix/pmix_server_gen.c:89
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],0] STATE SYNC REGISTERED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],0] state SYNC REGISTERED
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],1] STATE SYNC REGISTERED AT orted/pmix/pmix_server_gen.c:89
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],1] STATE SYNC REGISTERED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],1] state SYNC REGISTERED
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE SYNC REGISTERED AT orted/pmix/pmix_server_gen.c:89
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE SYNC REGISTERED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],2] STATE SYNC REGISTERED AT orted/pmix/pmix_server_gen.c:89
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],2] STATE SYNC REGISTERED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],3] state SYNC REGISTERED
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],2] state SYNC REGISTERED
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE JOB [6128,14] STATE SYNC REGISTERED AT base/state_base_fns.c:693
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING JOB [6128,14] STATE SYNC REGISTERED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE JOB [6128,14] STATE READY FOR DEBUGGERS AT base/plm_base_launch_support.c:731
[saturn.icl.utk.edu:81156] ACTIVATE: ANY STATE NOT FOUND
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],2] of size 31 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 31 bytes from stdout of [[6128,14],2]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],0] of size 31 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 31 bytes from stdout of [[6128,14],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],1] of size 146 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 146 bytes from stdout of [[6128,14],1]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],3] of size 31 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 31 bytes from stdout of [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],0] of size 116 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 116 bytes from stdout of [[6128,14],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],3] of size 116 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 116 bytes from stdout of [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],2] of size 346 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 346 bytes from stdout of [[6128,14],2]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],1] of size 346 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 346 bytes from stdout of [[6128,14],1]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],0] of size 231 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 231 bytes from stdout of [[6128,14],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],3] of size 243 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 243 bytes from stdout of [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp:read handler [[6128,14],3] Error on connection:49
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],3] of size 0 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 0 bytes from stdout of [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],3] of size 0 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 0 bytes from stderr of [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE IOF COMPLETE AT iof_hnp_read.c:328
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE IOF COMPLETE PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],3] state IOF COMPLETE
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE ABORTED BY SIGNAL AT base/odls_base_default_fns.c:1897
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE ABORTED BY SIGNAL PRI 0
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE WAITPID FIRED AT errmgr_default_hnp.c:647
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE WAITPID FIRED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],3] state WAITPID FIRED
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE NORMALLY TERMINATED AT base/state_base_fns.c:715
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE NORMALLY TERMINATED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],3] state NORMALLY TERMINATED
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE NORMALLY TERMINATED AT errmgr_default_hnp.c:206
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE NORMALLY TERMINATED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:cleanup_node on proc [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE ABORTED BY SIGNAL AT base/plm_base_receive.c:352
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE ABORTED BY SIGNAL PRI 0
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE WAITPID FIRED AT errmgr_default_hnp.c:647
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE WAITPID FIRED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],3] state NORMALLY TERMINATED
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:cleanup_node on proc [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],3] state WAITPID FIRED
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],3] STATE NORMALLY TERMINATED AT base/state_base_fns.c:715
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],3] STATE NORMALLY TERMINATED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],3] state NORMALLY TERMINATED
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:cleanup_node on proc [[6128,14],3]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],0] of size 97 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 97 bytes from stdout of [[6128,14],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],1] of size 106 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 106 bytes from stdout of [[6128,14],1]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],2] of size 106 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 106 bytes from stdout of [[6128,14],2]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],0] of size 97 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 97 bytes from stdout of [[6128,14],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp:read handler [[6128,14],1] Error on connection:41
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],1] of size 0 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 0 bytes from stdout of [[6128,14],1]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],1] of size 0 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 0 bytes from stderr of [[6128,14],1]
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],1] STATE IOF COMPLETE AT iof_hnp_read.c:328
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],1] STATE IOF COMPLETE PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],1] state IOF COMPLETE
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],1] STATE WAITPID FIRED AT base/odls_base_default_fns.c:1897
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],1] STATE WAITPID FIRED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],1] state WAITPID FIRED
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE PROC [[6128,14],1] STATE NORMALLY TERMINATED AT base/state_base_fns.c:715
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING PROC [[6128,14],1] STATE NORMALLY TERMINATED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:track_procs called for proc [[6128,14],1] state NORMALLY TERMINATED
[saturn.icl.utk.edu:81156] [[6128,0],0] state:base:cleanup_node on proc [[6128,14],1]
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE JOB [6128,14] STATE NORMALLY TERMINATED AT base/state_base_fns.c:775
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING JOB [6128,14] STATE NORMALLY TERMINATED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] state:dvm:check_job_complete on job [6128,14]
[saturn.icl.utk.edu:81156] [[6128,0],0] state:dvm releasing procs from node saturn
[saturn.icl.utk.edu:81156] [[6128,0],0] state:dvm releasing proc [[6128,14],0] from node saturn
[saturn.icl.utk.edu:81156] [[6128,0],0] state:dvm releasing proc [[6128,14],2] from node saturn
[saturn.icl.utk.edu:81156] [[6128,0],0] state:dvm:check_job_completed state is terminated - activating notify
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATE JOB [6128,14] STATE NOTIFY COMPLETED AT state_dvm.c:588
[saturn.icl.utk.edu:81156] [[6128,0],0] ACTIVATING JOB [6128,14] STATE NOTIFY COMPLETED PRI 4
[saturn.icl.utk.edu:81156] [[6128,0],0] iof:hnp:read handler [[6128,14],2] Error on connection:45
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],2] of size 0 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 0 bytes from stdout of [[6128,14],2]
[saturn.icl.utk.edu:81156] [[6128,0],0] sending data from proc [[6128,14],2] of size 0 via PMIx to tool [[6128,13],0]
[saturn.icl.utk.edu:81156] [[6128,0],0] read 0 bytes from stderr of [[6128,14],2]
prte: base/iof_base_frame.c:195: orte_iof_base_proc_destruct: Assertion `((0xdeafbeedULL << 32) + 0xdeafbeedULL) == ((opal_object_t *) (ptr->revstderr))->obj_magic_id' failed.