The lingoose from henomis

release v0.0.1-beta1

Tasks

introduce a joke about goose in README
refactor documentation

#13

Support WithHTTPClient() method in llm/openai like what's already done in llm/huggingface

See: https://github.com/search?q=repo%3Ahenomis%2Flingoose%20WithHTTPClient&type=code

does it support function mode?

Hello, does it support function mode?

v0.0.13

#150
add drop and delete index methods #153
switch to openai tools and update models #156
Add HF text to image models support #157
implement lingoose thread using openai as LLM backend #158

Can vectordb increase support for memory instead of files?

Implement method for Vector Databases to delete collection/payloads

Is your feature request related to a problem? Please describe.
Sometimes you do want to remove a collection/payload from a vector database, these methods are exposed by vector/databases APIs.

Describe the solution you'd like
Add a method to the index interface to allow for deletion of documents

Possible to use with a local LLM?

As titled, would prefer to use a local LLM instead of OpenAI's GPT. I arrived here via this tutorial/introduction to RAG;

https://simonevellei.com/blog/posts/leveraging-go-and-redis-for-efficient-retrieval-augmented-generation/

release v0.0.3

Tasks

use new documentation template
add linter to github actions #26
Refactor pipeline input and ouput #27
Add splitter #28
move docs to a different branch #29
refactor prompt interface #30

v0.0.8

tasks:

support Qdrant #89
implement dalle transformer #84
support custom openai client (discussion here) #86
support new openai models #88
support for openai functions see here #90

Fun fact: started integrating qdrant official client that uses GRPC. To avoid huge imports a custom qdrant-go package has beed develope, based on my (and already imported in lingoose) restclientgo

Build issue with index package (using Go 1.21)

Hi! Thank you for this library, very excited to give it a try. When trying to build your quickstart, I run into these issues on go build (Go 1.21).

./main.go:18:8: undefined: index.NewSimpleVectorIndex
./main.go:19:27: undefined: index.NewSimpleVectorIndex
./main.go:19:127: undefined: index.WithTopK

All other packages seem to be correctly installed. Only "index" is preventing the build. Any idea what could be wrong here?

I understand that openAI does not allow for shared API keys, but I have been unable to find in any example where to put an API key. For example, from the quickstart on the docs I see no instructions on where to put my key, despite the example saying that this is literally it. I also haven't been able to locate anything at all regarding openAI API keys in documentation.

qapipeline.WithPrompt returns a new QAPipeline instance

Describe the bug
When using qapipeline.WithPrompt, it returns a new QAPipeline instead of modifying the current one, later when calling Run or Query it throws a nil pointer exception because the LLMEngine is nil.

To Reproduce

res, err := qapipeline.
	New(openaiClient).
	WithPrompt(chatConv).
	WithIndex(a.index).
	Query(context.Background(), query, option.WithTopK(1))

Expected behavior
The WithPrompt method should modify the current instance and return that one.

Lingoose version: 0.0.12

v0.0.7

Tasks

support openai completion and chat streams #73
support hugging face conversational #74
support hugging face text generation model ref #75
add tesseract loader #76
add hugging face image to text loader #77
add hugging face speech recognition loader #78
add hugging face sentence transformer embedder #79
summarize #80

release v0.0.4

Tasks

📖 Write a better documentation

LinGoose needs a better technical documentation.
The actual documentation can be found here https://lingoose.io/docs/

There are a lot of examples in examples/ directory that can be used as a code base for writing documentation.

Why go?

Hi henomis,
Thanks for making this cool project.
I am hesitating which framework to use, a go version or a python version.
Is the only reason why we choose a golang framework because it's more friendly to gophers?
Are there any other strong reason for choosing a golang framework?
Any insight would be much appreciated!

SimpleVectorIndex.load() is easy to be called more than once

index/simpleVectorIndex

func (s *Index) IsEmpty() (bool, error) {
	...      
	err := s.load()
	...
}

func (s *Index) SimilaritySearch(ctx context.Context, query string, opts ...option.Option) (index.SearchResponses, error) {
	...
	err := s.load()
	...
}

The invocation of the load() function is very subtle, which easily leads to cases of repeated calls. Such as in the example.

We'd better make sure it's loaded only once.

v0.0.11

Where to add API key for examples

Hello, where to add the openai API key in the readme example?

add whisper as prompt

PR : #17

release v0.0.1-alpha3

Tasks

add llm options such as tokens, temperature, etc #11

release 0.0.6

How to use LinGoose with LocalAI?

Hi! Is it possible to use LinGoose with LocalAI which has a OpenAI compatible API?

If so, cloud you provide a simple example? Since last week LocalAI has api-key support.

mudler/LocalAI#877

Inquiry about support for custom HTTP Client of Hugging Face API

Background:

Currently, in the doRequest function of the HuggingFace, HTTP requests are made using the http.DefaultClient. While this works for most scenarios, there is a need for more flexibility when it comes to customizing the behavior of the HTTP client.

Request:

I would like to request an enhancement that allows users to specify their own HTTP client when making requests through the library. This feature would provide users with the ability to configure custom settings for the HTTP client, such as timeouts, custom transport options, or any other client-specific configurations.

Proposed Implementation:

One possible implementation approach could involve modifying the doRequest function to accept an http.Client as an argument. This change would allow users to pass their own pre-configured HTTP client when making requests, as follows:

func (h *HuggingFace) doRequest(ctx context.Context, jsonBody []byte, model string, httpClient *http.Client) ([]byte, error) {
    // Use the provided httpClient for making the request.
    // ...
}

By making this modification, users would have the flexibility to tailor the HTTP client to their specific requirements.

implement index engines #138
implement Milvus as engine #139
add index insert data callback #144
implement redis vector storage (see here and here) #145
implement PostgreSQL engine #146
Misc #147

Support hugging face inference API

Add support to hugging face inference API

llm/uggingface/huggingface.go

Knowledge Base example can't be run

Describe the bug
When trying to run the Knowledge base example (https://github.com/henomis/lingoose/blob/main/examples/embeddings/knowledge_base/main.go) I got an error about the github.com/henomis/lingoose/index/vectordb/jsondb package.

% go mod tidy
go: finding module for package github.com/henomis/lingoose/index/vectordb/jsondb
go: example.com/lingoosedb imports
	github.com/henomis/lingoose/index/vectordb/jsondb: module github.com/henomis/lingoose@latest found (v0.0.11), but does not contain package github.com/henomis/lingoose/index/vectordb/jsondb

v0.0.10

Tasks

use pinecone-go v1.1.2 #104
use qdrant-go v1.0.2 #104
integrate cohere use henomis/go-cohere #103
implement cohere embedder #108
implement cohere rerank #106, #107
implement cohere generate as LLM #103
implement batching request (https://platform.openai.com/docs/guides/rate-limits/batching-requests) #105
refactor and document #109
Add localai example #111

Release v0.0.1-alpha2

Tasks

implement github pages lingoose homepage
add to the README the concept lingoose = lingo + go + goose
request a ⭐

Refactor API

tasks

type Decoder interface {
	Decode(interface{}) error
}

type OutputHandler func(string) Decoder

type Template struct {
	Input         interface{}
	Output         interface{}
	OutputHandler OutputHandler
	Template string

	templateEngine *template.Template
}


func New(
	input interface{},
	output interface{},
	outputHandler OutputHandler,
	template string,
) (*Template, error) {
         // validate input struct using go struct validator
         // validate template
	templateEngine, err := texttemplate.New("prompt").Parse(template)
	if err != nil {
		return nil, err
	}

	return &Template{
		Input:          input,
		Output:       output,
		OutputHandler:  outputHandler,
		Template:       template,
		templateEngine: templateEngine,
	}, nil
}

func (p *Template) Format() (string, error) {

	var output bytes.Buffer
	err := p.templateEngine.Execute(&output, p.Input)
	if err != nil {
		return "", err
	}

	return output.String(), nil
}

type Llm struct {}

func (l *Llm) Completion(promptTemplate *Template) (interface{}, error) {
	// prompt

	prompt, err := promptTemplate.Format()
	_ = prompt

	var output string
	_ = output // call llm(prompt) -> output

	var llmResponse interface{}
	_ = llmResponse // llm response

	// decode output
	err = promptTemplate.OutputHandler(output).Decode(promptTemplate.Output)
	if err != nil {
		return nil, err
	}

	return llmResponse, err
}

func (l *Llm) Chat(chat *chat.Chat) interface{} {
	// chat prompt

	messages := chat.ToMessages()
	_= messages

	// call llm(messages) -> output
	// add message to chat messages?

	return nil
}

type Pipeline struct{}

func (p *Pipeline) Run(llm *Llm, prompt *Template) (interface{}, error) {
	llm.Completion(prompt)

	return prompt.Output, nil
}

v0.0.9

Tasks

Add LLM history #92
Fix openai functions 🐞 #93 #95
the LLM should response with the JSON output of the result of function calling. don't interate! #94
#96
search similarity filter by metadata #99
https://huggingface.co/dandelin/vilt-b32-finetuned-vqa #97
exporting structs #98
add youtube-dl loader #100

Long term plan

Long term plan

implement agent see https://github.com/ColinEberhardt/langchain-mini/blob/main/index.mjs
implement tools
see https://github.com/imartinez/privateGPT
see https://medium.com/llamaindex-blog/a-new-document-summary-index-for-llm-powered-qa-systems-9a32ece2f9ec
plan and execute agent must wrap each step with CoT agent https://python.langchain.com/en/latest/modules/agents/plan_and_execute.html
remove pipeline.Llm.LlmMode use i, ok = llm.(interface{Completion()}) to understand the llm type. Remove "llm" from struct fields.

Passing context to LLM execution

PR: #19

henomis / lingoose Goto Github PK

lingoose's Introduction

lingoose's People

Contributors

Stargazers

Watchers

Forkers

lingoose's Issues

Background:

Request:

Proposed Implementation:

Tasks

Recommend Projects

Recommend Topics

Recommend Org