Incorrect documentation: This page mentions that cache config is set by <code clas

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Thanks for the quick reply <a class="user-mention notranslate" data-hovercard-type="us

Issue on page /user_guide/response_cache.html about server HOT 4 CLOSED

anantzoid commented on June 11, 2024

Issue on page /user_guide/response_cache.html

from server.

Comments (4)

rmccorm4 commented on June 11, 2024

Hi @anantzoid,

Thanks for raising an issue!

All of these forms work for me:

tritonserver --model-store models --cache-config local,size=1048576
tritonserver --model-store models --cache-config "local,size=1048576"
tritonserver --model-store models --cache-config=local,size=1048576

Can you share the full command and corresponding full error/log you're getting for this format?
Also can you share the echo ${SHELL} you're using?

from server.

anantzoid commented on June 11, 2024

Thanks for the quick reply @rmccorm4!
Here's the error:

=============================
== Triton Inference Server ==
=============================

NVIDIA Release 24.01 (build 80100513)
Triton Server Version 2.42.0

Copyright (c) 2018-2023, NVIDIA CORPORATION \u0026 AFFILIATES.  All rights reserved.

Various files include modifications (c) NVIDIA CORPORATION \u0026 AFFILIATES.  All rights reserved.

This container image and its contents are governed by the NVIDIA Deep Learning Container License.
By pulling and using the container, you accept the terms and conditions of this license:
https://developer.nvidia.com/ngc/nvidia-deep-learning-container-license

NOTE: CUDA Forward Compatibility mode ENABLED.
  Using CUDA 12.3 driver version 545.23.08 with kernel driver version 535.129.03.
  See https://docs.nvidia.com/deploy/cuda-compatibility/ for details.

tritonserver: unrecognized option '--cache-config local,size=10485760'

I'm running it inside a kubernetes pod, so its not an interactive shell.

from server.

rmccorm4 commented on June 11, 2024

I see, thanks!

Would you like to contribute the quick doc change? (You'd need to sign and email the CLA outlined in CONTRIBUTING.md)

Otherwise I can make the quick change.

Thanks,
Ryan

from server.

anantzoid commented on June 11, 2024

Seems like it'll be quicker if you do it on your end. Thanks!

from server.

Recommend Projects