Code Monkey home page Code Monkey logo

Comments (6)

BenWilson2 avatar BenWilson2 commented on June 10, 2024

cc @prithvikannan @dbczumar

from mlflow.

dbczumar avatar dbczumar commented on June 10, 2024

Hi @alena-m, thank you for raising this. We agree that it would be valuable, and we would really appreciate a contribution for it. I'll add the help wanted label for now. Let us know if you'd like to reconsider and take this on.

from mlflow.

dbczumar avatar dbczumar commented on June 10, 2024

cc @sunishsheth2009

from mlflow.

prithvikannan avatar prithvikannan commented on June 10, 2024

Agreed this would be super useful. The existing components for this are largely already present: make_metric() to create an arbitrary EvaluationMetric and the deployment client.predict() to call LLM. We would need to figure out some approach for users to define their own grading prompt and corresponding parsing logic.

from mlflow.

github-actions avatar github-actions commented on June 10, 2024

@mlflow/mlflow-team Please assign a maintainer and start triaging this issue.

from mlflow.

Cokral avatar Cokral commented on June 10, 2024

I'd like to work on this issue.


My understanding is that make_genai_metric instantiates a EvaluationModel that has the grading_system_prompt_template hardcoded.
So, the user could provide an optional grading_system_prompt_template argument that simply is used instead of the template in that case, that way the option is provided simply, and it doesn't break the existing approach!

Would that fit the requirements?

from mlflow.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.