It's been running all night with no results def fetch_GPT_response(instruction, sy

Isn't it convenient for you to release this file? <span class="ema

run time,about baranzinilab/kg_rag

karthiksoman commented on August 19, 2024

Hi @yangyangyang-github
This is a typical openai API call.
If the function is going in an endless loop, can you please double check your openai credentials that was provided in the config.yaml file?
Also, we have already put guardrails in the api call function to stop making calls after some sufficient time using retry module. Please refer here. Hence, if you are using the same functionality, it should not go to an endless loop.
Let me know how things turn out for you.

from kg_rag.

DayanaYuan commented on August 19, 2024

Hi @yangyangyang-github This is a typical openai API call. If the function is going in an endless loop, can you please double check your openai credentials that was provided in the config.yaml file? Also, we have already put guardrails in the api call function to stop making calls after some sufficient time using retry module. Please refer here. Hence, if you are using the same functionality, it should not go to an endless loop. Let me know how things turn out for you.

I have added openai.api_key parameters to the file. What exactly is the openai credentials that was provided in the config.yaml file? May I have a look, please?

from kg_rag.

karthiksoman commented on August 19, 2024

You should have a file named '.gpt_config.env' and store it in your $HOME path. Content of the file should be in the following format:

API_KEY='openai api key'
API_VERSION='this is optional'
RESOURCE_ENDPOINT='this is optional'

from kg_rag.

DayanaYuan commented on August 19, 2024

Isn't it convenient for you to release this file?

…

------------------ 原始邮件 ------------------ 发件人: "BaranziniLab/KG_RAG" ***@***.***>; 发送时间: 2024年2月28日(星期三) 中午1:38 ***@***.***>; ***@***.******@***.***>; 主题: Re: [BaranziniLab/KG_RAG] run time (Issue #18) You should have a file named '.gpt_config.env' and store it in your $HOME path. Content of the file should be in the following format: API_KEY= API_VERSION= RESOURCE_ENDPOINT= — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***>

from kg_rag.

karthiksoman commented on August 19, 2024

The file contains API credentials, which, like any other sensitive information, should ideally not be shared publicly. Hope you get it :)
Feel free to reach out if you need further assistance!

from kg_rag.

DayanaYuan commented on August 19, 2024

For Llama coda, when run to llm = llama_model(MODEL_NAME, BRANCH_NAME, CACHE_DIR, stream=True, method=METHOD), model = AutoModelForCausalLM.from_pretrained(model_name, device_map='auto', torch_dtype=torch.float16, revision=branch_name, cache_dir=cache_dir), the program simply exits.

…

------------------ 原始邮件 ------------------ 发件人: "BaranziniLab/KG_RAG" ***@***.***>; 发送时间: 2024年2月28日(星期三) 中午1:54 ***@***.***>; ***@***.******@***.***>; 主题: Re: [BaranziniLab/KG_RAG] run time (Issue #18) The file contains API credentials, which, like any other sensitive information, should ideally not be shared publicly. Hope you get it :) Feel free to reach out if you need further assistance! — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***>

from kg_rag.

karthiksoman commented on August 19, 2024

@yangyangyang-github
Did you check if this is a memory issue? We are not using quantized versions of llama here, hence it could take a good chunk of memory. If you see here, you can see the size of the tensors for llama-13b and compare it with the memory of the machine that you are using.

I tried using llama-13b on p3.8x.large AWS instance which has following specs:
4 Tesla V100 GPU
64 GB GPU memory
32 vCPU
244 GB RAM

from kg_rag.

run time about kg_rag HOT 7 CLOSED

Comments (7)

Related Issues (17)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent