Code Monkey home page Code Monkey logo

Comments (10)

noahc1510 avatar noahc1510 commented on September 3, 2024 14

All of this repo is python code, I have already run it on Ubuntu 22.04, and you just only need to fix some dependencies about it manually.

from chatrtx.

flefevre avatar flefevre commented on September 3, 2024 12

It will be définitively pertinent to have a version compatible with Ubuntu / Linux since there are scientificité work station with Linux on top.

from chatrtx.

noahc1510 avatar noahc1510 commented on September 3, 2024 12

Hi @noahc1510, can you describe the process you used to fix the dependencies?

Would be nice to add what you did to the readme so people can use it on linux.

You can follow this guide below to fix the requirement. I tested it on the official installer and created a repo about it: https://github.com/noahc1510/trt-llm-rag-linux

System Requirement

  • Nvidia Driver: sudo apt install nvidia-driver-535
  • CUDA: sudo apt install nvidia-cuda-toolkit
  • NCCL:
    wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64/cuda-keyring_1.0-1_all.deb
    sudo dpkg -i cuda-keyring_1.0-1_all.deb
    sudo apt-get update
    sudo apt install libnccl2
  • libmpi: sudo apt install libopenmpi-dev

Installation

  1. Install miniconda, create new environment and install pytorch=2.1.0, mpi4py=3.1.5, tensorrt-llm
    conda create -n trtllm python=3.10
    conda activate trtllm
    conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
    conda install -c conda-forge mpi4py mpich
    pip install --no-cache-dir --extra-index-url https://pypi.nvidia.com tensorrt-llm
    In China, you can use these command below without vpn:
    conda create -n trtllm python=3.10
    conda activate trtllm
    conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
    conda install -c conda-forge mpi4py mpich
    pip install -i https://pypi.tuna.tsinghua.edu.cn/simple --extra-index-url https://pypi.nvidia.com tensorrt-llm
  2. Install the requirements
    pip install -r requirements.txt

from chatrtx.

IfrKonv avatar IfrKonv commented on September 3, 2024 9

I believe that the majority of users interested in a similar aspect are Linux users, and it's indeed disheartening to note that it's released without API and support for Linux OS. I hope they provide a Linux version soon.

from chatrtx.

anujj avatar anujj commented on September 3, 2024 8

Current application is only targeted for windows

from chatrtx.

cdelv avatar cdelv commented on September 3, 2024 5

Hi @noahc1510, can you describe the process you used to fix the dependencies?

Would be nice to add what you did to the readme so people can use it on linux.

from chatrtx.

REALERvolker1 avatar REALERvolker1 commented on September 3, 2024 4

@noahc1510 seconded. I would consider automating it and making a pr to add it. Basically throwing it against the wall, seeing if it sticks.

from chatrtx.

metal3d avatar metal3d commented on September 3, 2024 3

There are two points on the readme that border on insulting.

Firstly, it's aimed solely at Windows, when the vast majority of servers used to train ML models are on Linux. That's a bit much for a community that's been fighting for 30 years to explain why we use this OS.

And it's also getting very irritating to see a perpetual amalgam between Ubuntu, which is a distribution, and not the best at all, and GNU Linux.

It's time for a change of mentality once and for all.
Thank you, therefore, for making sure that these models and this kind of project are initially offered on GNU Linux in general.

Sorry for the irritation, but I've been in the business for 22 years and it's starting to get a bit desperate.

from chatrtx.

ninono12345 avatar ninono12345 commented on September 3, 2024

hi @noahc1510 . I have a ubuntu 23.10 running on RTX 4090, I've followed your instructions, after pip install -r requirements.txt I downloaded the engine and tokenizer files.
Now I'm getting a different error
(trtllm) vgtu@vgtu-Default-string:~/Tomo/trt-llm-rag-windows$ python app.py --trt_engine_path model/ --trt_engine_name llama_float16_tp1_rank0.engine --tokenizer_dir_path model/ --data_dir dataset/ Traceback (most recent call last): File "/home/vgtu/Tomo/trt-llm-rag-windows/app.py", line 26, in <module> from trt_llama_api import TrtLlmAPI #llama_index does not currently support TRT-LLM. The trt_llama_api.py file defines a llama_index compatible interface for TRT-LLM. File "/home/vgtu/Tomo/trt-llm-rag-windows/trt_llama_api.py", line 24, in <module> from llama_index.bridge.pydantic import Field, PrivateAttr ModuleNotFoundError: No module named 'llama_index.bridge'
at first it was only missing llama_index, installed it using pip install llama_index, and now llama_index.bridge is missing... Also tried using conda install llama_index, it installed, but this same error keeps showing.
Can you please help?
Thank you very much

from chatrtx.

ninono12345 avatar ninono12345 commented on September 3, 2024

yeah... I just found out that I had to change llama_index.bridge to llama_index.legacy.bridge , same for some others

from chatrtx.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.