Code Monkey home page Code Monkey logo

executor-hnsw-postgres's Issues

performance(HNSWPSQL): syncing is slow

Right now sync will be slow

  • we are iterating and doing individual updates (should batch somehow, per sync operation type - index, update, delete)
  • if rebuild, the operations will always be index. We should optimize for this. Done in #5

Numbers before any perf refactoring

Performance

indexing 1000 ...       indexing 1000 takes 0 seconds (0.22s)
rolling update 3 replicas x 2 shards ...            psq_handler@19733[I]:Using existing table
    psq_handler@19738[I]:Using existing table
    psq_handler@19751[I]:Using existing table
    psq_handler@19759[I]:Using existing table
    psq_handler@19769[I]:Using existing table
    psq_handler@19779[I]:Using existing table
rolling update 3 replicas x 2 shards takes 0 seconds (0.82s)
search with 10 ...      search with 10 takes 0 seconds (0.23s)
indexing 10000 ...      indexing 10000 takes 0 seconds (0.75s)
rolling update 3 replicas x 2 shards ...            psq_handler@20547[I]:Using existing table
    psq_handler@20552[I]:Using existing table
    psq_handler@20564[I]:Using existing table
    psq_handler@20574[I]:Using existing table
    psq_handler@20626[I]:Using existing table
    psq_handler@20636[I]:Using existing table
rolling update 3 replicas x 2 shards takes 9 seconds (9.08s)
search with 10 ...      search with 10 takes 0 seconds (0.22s)
indexing 100000 ...     indexing 100000 takes 7 seconds (7.59s)
rolling update 3 replicas x 2 shards ...            psq_handler@24546[I]:Using existing table
    psq_handler@24551[I]:Using existing table
    psq_handler@24736[I]:Using existing table
    psq_handler@24746[I]:Using existing table
    psq_handler@24827[I]:Using existing table
    psq_handler@24837[I]:Using existing table
rolling update 3 replicas x 2 shards takes 7 minutes and 17 seconds (437.44s)
search with 10 ...      search with 10 takes 0 seconds (0.22s)

fail to connect to PostgreSQL with docker-compose

  • start a PostgreSQL service with docker:

docker run -e POSTGRES_PASSWORD=123456 -p 127.0.0.1:5432:5432/tcp postgres:13.2

  • build a flow with one executor:HNSWPostgresIndexer

  • run the flow locally, it works well

  • expose the flow to docker-compose yaml, and run the flow with docker-compose ,get an error:

image

jina version info:


- jina 3.3.19
- docarray 0.12.2
- jina-proto 0.1.8
- jina-vcs-tag (unset)
- protobuf 3.20.0
- proto-backend cpp
- grpcio 1.43.0
- pyyaml 6.0
- python 3.10.2
- platform Linux
- platform-release 4.4.0-186-generic
- platform-version #216-Ubuntu SMP Wed Jul 1 05:34:05 UTC 2020
- architecture x86_64
- processor x86_64
- uid 48710637999860
- session-id 906abcd2-c797-11ec-b1df-2c4d544656f4
- uptime 2022-04-29T16:37:11.758133
- ci-vendor (unset)
* JINA_DEFAULT_HOST (unset)
* JINA_DEFAULT_TIMEOUT_CTRL (unset)
* JINA_DEFAULT_WORKSPACE_BASE /home/chenhao/.jina/executor-workspace
* JINA_DEPLOYMENT_NAME (unset)
* JINA_DISABLE_UVLOOP (unset)
* JINA_FULL_CLI (unset)
* JINA_GATEWAY_IMAGE (unset)
* JINA_GRPC_RECV_BYTES (unset)
* JINA_GRPC_SEND_BYTES (unset)
* JINA_HUBBLE_REGISTRY (unset)
* JINA_HUB_CACHE_DIR (unset)
* JINA_HUB_NO_IMAGE_REBUILD (unset)
* JINA_HUB_ROOT (unset)
* JINA_LOG_CONFIG (unset)
* JINA_LOG_LEVEL (unset)
* JINA_LOG_NO_COLOR (unset)
* JINA_MP_START_METHOD (unset)
* JINA_RANDOM_PORT_MAX (unset)
* JINA_RANDOM_PORT_MIN (unset)
* JINA_VCS_VERSION (unset)
* JINA_CHECK_VERSION True

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.