arize-ai / phoenix Goto Github PK

AI Observability & Evaluation

Home Page: https://docs.arize.com/phoenix

License: Other

Python 20.20% TypeScript 8.35% HTML 0.04% JavaScript 0.02% Jupyter Notebook 71.33% Dockerfile 0.03% Mako 0.01% Makefile 0.01% Batchfile 0.01% CSS 0.01%

ml-observability model-observability ai-roi llmops mlops ai-monitoring ai-observability llm-eval aiengineering datasets

phoenix's People

Contributors

Stargazers

Watchers

Forkers

amart85 muralij2ee6 firobeid iportilla qqq-tech alphacute adib0073 datascience-2021 creative-research-project-v1-1 hannalaguilar cjh88888 hongshibao sd37 olavl rishirelan techthiyanes rgutwein micseb thanhpham1987 jaedukseo jingli-wtbox andreajparker velu1122 ausjorg krayyalasomayajula rafaelgallo eltociear nghiavodinh arifaygun hhy5277 ssahgal sri-awadh pratik-behera axiomofjoytest pbadhe sweenke4 samirmahmudzade lxlxok codingchild2424 spoluan sushilkhadkaanon vishalsingh17 cczhgit damonclifford aditikhare007 ahmadhakami advit200 moonisali brunoscaglione ramiro-gm ai-mou chanbiines while-basic transparentapi mohamedtaoufik abdulk084 vonrosenchild shaunwei dezzydez007 zaradana moohax frankgmail 00mjk keshava noircir openselab eternalerrors tomchapin mz0in manu87ds carbirbal ericksiavichay alexmojaki christellejulias sivasankar2002 amygda koegode boragocode thirurjst data-drone jlopatec codehornets zekinv ishaan-jaff be-secure duncankmckinnon lou-k huamichaelchen codeaudit techrajk rkp64 tony2023b akira mayurji logp pandyamarut mvandermeulen deyh2020 quanticoi silasdao

phoenix's Issues

[spike] Create an iPython extension to spin up a server

https://github.com/switowski/ipython-reverser as a simple example

determine what server to run

[spike] investigate webRTC between python and browser

gitignore compiled js bundle

Make the js bundles gitignored and find a way to make it part of the assets when pip install is called. Right now it fails to be mounted in the python package if it is gitignored.

[metrics] Add support for getting timeseries data of any calculation

As a developer, I want to be able compose together a metric calculation and 1 or 2 datasets such that the metric is calculated for a specified interval over time at a granularity of my choosing(or auto).

Pseudo-Code

 metricAggregate = Calculate(df, metrics=[EuclideanDiscance]) // Retrieves the metric at an aggregate over the entire dataframe
 metricOverTime = TimeSeries(dv, metrics=[EuclideanDistance], granularity="hourly") // Retrieves the metric over time at the granularity interval specified

Calculate

psi = Calculate([primary_df, baseline_df], psi)

Returns

print(psi) # 2.73

Calculate Over Time

psiOverTime = CalculateOverTime([primary_df, baseline_df], psi, granularity="1hour")

Returns

print(psiOverTime) # [{ timestamp: "12-12-2022", v: 1.27 }, { timestamp: "12-12-2022", v: 1.27 }]

[metrics] CSV parsing for embeddings

vector columns are not parsed in CSV appropriately and are not type safe.

Acceptance Criteria

Properly validate columns in dataframe matches the type expected
Remove the cast and make the column retrieval methods off of Dataset type safe.
Add tests for parsing

Initialize a server using python magics or other

Mimic Tensorboard and start up a local port using magics or other https://ipython.readthedocs.io/en/stable/interactive/magics.html

Reference: https://www.tensorflow.org/tensorboard/tensorboard_in_notebooks

clear out python notbook state in a githook

https://zhauniarovich.com/post/2020/2020-06-clearing-jupyter-output/

pip install fails due to HDF5Close

Not sure how to avoid

(notebook) ➜  phoenix git:(47-lasso-select) ✗ (⎈|dev:arize-dev)pip install .
Processing /Users/mikeldking/work/phoenix
  Installing build dependencies ... done
  Getting requirements to build wheel ... done
  Installing backend dependencies ... done
  Preparing metadata (pyproject.toml) ... done
Collecting pandas
  Downloading pandas-1.5.2-cp310-cp310-macosx_11_0_arm64.whl (10.8 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 10.8/10.8 MB 18.7 MB/s eta 0:00:00
Collecting umap-learn
  Using cached umap_learn-0.5.3-py3-none-any.whl
Collecting numpy
  Using cached numpy-1.23.5-cp310-cp310-macosx_11_0_arm64.whl (13.4 MB)
Collecting hdbscan
  Using cached hdbscan-0.8.29-cp310-cp310-macosx_12_0_arm64.whl
Collecting tables
  Using cached tables-3.7.0.tar.gz (8.2 MB)
  Installing build dependencies ... done
  Getting requirements to build wheel ... error
  error: subprocess-exited-with-error

  × Getting requirements to build wheel did not run successfully.
  │ exit code: 1
  ╰─> [12 lines of output]
      /var/folders/1s/4vdv59n15b1ghg42frdd8f480000gn/T/H5close646cmm8d.c:2:5: error: implicit declaration of function 'H5close' is invalid in C99 [-Werror,-Wimplicit-function-declaration]
          H5close();
          ^
      1 error generated.
      cpuinfo failed, assuming no CPU features: No module named 'cpuinfo'
      * Using Python 3.10.3 (main, Apr 14 2022, 13:44:37) [Clang 13.1.6 (clang-1316.0.21.2.3)]
      * Found cython 0.29.32
      * USE_PKGCONFIG: True
      .. ERROR:: Could not find a local HDF5 installation.
         You may need to explicitly state where your local HDF5 headers and
         library can be found by setting the ``HDF5_DIR`` environment
         variable or by using the ``--hdf5`` command-line option.
      [end of output]

  note: This error originates from a subprocess, and is likely not a problem with pip.
error: subprocess-exited-with-error

× Getting requirements to build wheel did not run successfully.
│ exit code: 1
╰─> See above for output.

note: This error originates from a subprocess, and is likely not a problem with pip.

Validate schema and data table are valid at Dataset creation

Create UMAPHyperparameters object

Context here #57 (comment)

[metrics] Switch Euclidean distance calculation to return a single value

Usually we just want to know the distance between two embedding "clouds" so we can determine if there is embedding drift.

Increase test coverage up to 60%

[docs] Resolve and document installation issues

Document any install issues - pip install is failing for M1

Add unit tests for `validate_dataset_inputs`

Context here

Generate documentation automatically from docstrings in CD

[UI] Add Lasso Selection to Point Cloud

Implement euclidean distance using two datasets

implement

EuclideanDistance(primary: Dataset, baseline: Dataset))

Look at enforcing conventional commits (or PRs)

[metrics] Bottom out computation limits using pandas.

Metrics

Accuracy
Percent Empty (percent of each column that's NaN)
AUC
60M predictions - drift calculation on 200 features using PSI
Embedding average 60M embeddings with 1K dimensions
Embedding average 60M embeddings with 10K dimensions
NDCG metric and precision metric on 60M predictions

[Spike] Investigate how to sync notebook state with server state

If we have a server running to host a UI and an API, we need a way to sync and commit state back to the server and vice versa.

Possibilities:

file system writes
rSync

Scaffold a python notebook - hugging face dataset

Enable Dataset creation from download

From formats:

csv
hdf5

[metrics] Implement quantile binning strategy.

[spike] Investigate server/client session sharing

https://github.com/voxel51/fiftyone/blob/ee30f6e70e47903dfc23d37f352148c186acdc72/fiftyone/core/session/session.py#L286

[UI] Add point cloud visualization to the notebook

Enable Dataset creation from HugginFace Dataset

Document how to mount the `phoenix` code via `PATH` so that it's accessible in the example notebookx

Improve docstring coverage github action

The current (commented) github action that checks for docstring coverage needs to:

Ignore init.py files
Allow for verbose option so we can see in the action console what went wrong instead of having to run interrogate locally

Embeddings are read as full strings from csv files

Ideally, we would want to read a table and get in every cell of the embedding vector column an array of floats. This is not the case when reading data from a csv file. The whole embedding is read as a string '[1.31,-0.46,...,-.108]' instead of [1.31,-0.46,...,-.108]. Other file formats, like hdf5, conserve better the data structure and don't have this problem.

This happens essentially with any field that expects an iterable inside a cell, i.e., it also happens with the list of token arrays.

Add Unit Test Framework for python (Pytest or other)

Create our own custom `head` method for `Dataset` object

Add dimensionality choice to UMAP pointcloud

Currently it only works for a default of 3D

[metrics] Implement binning strategy.

bottoming out a universal binning strategy based on quantiles

Currently this is how it works: https://docs.google.com/document/d/1MUpJw_DEoHLDRcHO_kkFza5LN03zNEp1/edit

Enable pytest on CI

Display React code inside of a notebook cell

[spike] investigate file system API from a static HTML page

Try to perform a fs read from the browser.

[metrics] Implement custom binning strategy.

Propagate point metadata to the UI by populating a `metaData` field

Follow up to #57

Context #57 (comment)

Add flexibility on the number of points per dataset in UMAP

Users should have some control over the number of points they want from each dataset.

In addition, separating the primary and reference with a fixed points per dataset like here will error

primary_dataset_points = construct_dataset_points(
        projections[:points_per_dataset], sampled_primary_dataset, embedding_feature
    )
    reference_dataset_points = construct_dataset_points(
        projections[points_per_dataset:], sampled_reference_dataset, embedding_feature
    )

Enforce docstring on PR CI

Add HDBSCAN clustering to point cloud computation

run HDBSCAN in conjunction to UMAP
clusters should be serializable to JSON
provide enough info to compute purity

serializable to a pandas dataframe
serializable to JSON