Code Monkey home page Code Monkey logo

gitrjaa / gpt-document-trained-chatbot-builder Goto Github PK

View Code? Open in Web Editor NEW

This project forked from shamspias/gpt-document-trained-chatbot-builder

0.0 0.0 0.0 4 KB

GPT-Document-Trained-Chatbot-Builder is a powerful, user-friendly tool that lets you create AI chatbots trained on your uploaded documents. Extract text from various file formats, train a GPT-3.5 or 4 based chatbot, and enjoy seamless interactions with efficient Pinecone storage.

gpt-document-trained-chatbot-builder's Introduction

GPT-Document-Trained-Chatbot-Builder

GPT-Document-Trained-Chatbot-Builder is a powerful tool that enables users to create AI chatbots capable of answering questions based on the content of provided documents. The chatbots are built using OpenAI's GPT-3.5 and text-embedding-ada-002 models for text understanding and processing. The embeddings generated by text-embedding-ada-002 are stored using Pinecone, a high-performance vector search and storage service.

This tool allows users to upload documents in various formats (such as PDF, Word, or Markdown), extract the text, and train the AI chatbot to understand and respond accurately based on the information in those documents.

Technology Used

  • OpenAI GPT-3.5: A powerful language model for generating text and answering questions.
  • Text-embedding-ada-002: An embedding model used for converting text into numerical vectors.
  • Pinecone: A scalable, high-performance vector storage and search service.
  • Python: The backend programming language for processing and training the chatbot.
  • Flask: A lightweight web framework used for creating the web interface for users to interact with the chatbot.

Features

  • Document upload: Users can upload documents in various formats like PDF, Word, Markdown, etc.
  • Text extraction: The tool automatically extracts text from the uploaded documents.
  • AI chatbot training: The extracted text is used to train the AI chatbot based on GPT-3.5 and text-embedding-ada-002 models.
  • Pinecone storage: The embeddings generated by text-embedding-ada-002 are stored in Pinecone for efficient retrieval and matching.
  • Multilingual support: The chatbot can be trained and answer questions in multiple languages.
  • Customizable chatbot: Users can provide specific instructions for the chatbot to follow when answering questions.

Future Scope

  • Integration with popular chat platforms like Discord, WhatsApp, and Telegram.
  • Support for more file formats and content sources for training the chatbot.
  • API access for developers to easily integrate the chatbot into their applications.
  • Enhanced customization options for the chatbot, including adjusting response style and tone.
  • Improved performance and scalability for handling large numbers of documents and users.

Requirements

  • Python 3.7 or newer
  • Flask web framework
  • OpenAI API key
  • Pinecone API key
  • Additional Python libraries: pandas, numpy, pdfplumber, docx2txt, markdown2, and requests

To get started, clone the repository and follow the installation instructions in the provided documentation.

gpt-document-trained-chatbot-builder's People

Contributors

shamspias avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.