Code Monkey home page Code Monkey logo

generativeai_and_llm_odia's Introduction

Generative AI and LLM Initiative for the Odia Language

HuggingFace badge License: CC BY-NC-SA 4.0

Twitter Discord

Table of contents

Latest Updates

  • [12thApril2023] We released our first experiment Odia LLM odiagenAI-model-v0. Please go through our Blog for more details.

About

The Odia Generative AI (in short, OdiaGenAI) is an initiative to research Generative AI and Large Language Models (LLMs) for the low-resource Odia language.

Objective

The OdiaGenAI aims to

  1. Build pre-trained Odia LLM,
  2. Fine-tuned Odia LLM, and
  3. Instruct LLM (Odia).

The data, code, and models will be available to the public for research and non-commercial purposes.

Why OdiaGenAI

  • First: Though many LLMs support multilingual, including Odia language, the performance for various tasks (e.g., content generation, question-answering) is limited due to the amount of ingested data for Odia.

  • Second: There is subscription or fees associated with the high-performing LLMs.

  • Third: The usage (privacy) and bias of data input to these LLMs are in question.

What are the focus research areas of OdiaGenAI

We have divided the primary focus areas into three parts.

1. Literature Survey: Investigate the latest developments in Generative AI and LLMs and analyze current methods to support the Odia language for different tasks.

2. Development: Developing pre-trained and fine-tuned Odia LLM, which includes dataset preparation, model training, evaluation, prompt engineering, and API development.

3. Deployment: Deploy the Odia LLM models for public access for research and non-commercial purposes.

Who can use OdiaGenAI LLMs

The models (pre-trained/fine-tuned) will be available through Hugging Face for research and non-commercial purposes. Feel free to contact us for a domain-specific application or particular use cases.

What are the use cases of OdiaGenAI LLMs

There are several use cases of OdiaGenAI LLMs. Three primary domains relating to Odisha which we are focusing to use the developed LLM are:

  • Education
  • Healthcare
  • Governance
  • Tourism
  • Agriculture
  • Industrial Application

Apps

Contributors

About our logo: The critically endangered Olive Ridley sea turtle is the world's smallest and most prevalent marine turtle. Travel thousands of kilometers in the ocean for nesting. The Gahirmatha Marine Sanctuary in Odisha is the largest known mass nesting rookery for olive ridley sea turtles worldwide.

Contact

Please contact Shantipriya Parida ([email protected]) for any contribution/support/usage.

Supporters

Odias in Machine Learning

Citation

If you find this repository useful, please consider giving โญ and citing:

@misc{OdiaGenAI,
  author = {Shantipriya Parida and Sambit Sekhar and Subhadarshi Panda and Soumendra Kumar Sahoo and Swateek Jena and Abhijeet Parida and Arghyadeep Sen and Satya Ranjan Dash and Deepak Kumar Pradhan},
  title = {OdiaGenAI: Generative AI and LLM Initiative for the Odia Language},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/shantipriyap/OdiaGenAI}},
}

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

CC BY-NC-SA 4.0

generativeai_and_llm_odia's People

Contributors

a-parida12 avatar sam-ai avatar shantipriyap avatar shantipriyaparida avatar soumendrak avatar swateek avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.