maaikeb / elasticsearch-benchmarking Goto Github PK

View Code? Open in Web Editor NEW

0.0 0.0 0.0 2.64 MB

A tool to measure ElasticSearch search performance over time, created for the OpenBudget project

Home Page: https://next.obudget.org/

Python 100.00%

elasticsearch-benchmarking's People

Contributors

Watchers

elasticsearch-benchmarking's Issues

Add abstraction for more generic use

Add abstraction for more generic use, so that it can easily be used for search engines other than ElasticSearch.

The code could be according to the below pseudo-code:

class Result(dict):

    def position(self):
        raise NotImplementedError()


class ElasticSearchResult(Result):
    pass


class Query(object):
    pass


class ElasticSearchQuery(Query):

    def __init__(self, term, filters={})
        pass


class BaseResultOracle(object):
    def query(self, query: Query) -> Result[]:
        raise NotImplementedError()


class ElasticSearchOracle(BaseResultOracle):

    def query(self, query: ElasticSearchQuery):
        # returns ElasticSearchResult[]

class BaseScorer(object):
    def calculate_score(self, matcher_results) -> float:
        raise NotImplementedError()

class FirstPageScorer(BaseScorer):
    pass
    
    
b = Benchmark(
    oracle=ElasticSearchOracle(hostname, port),
    scorer=FirstPageScorer(page_size=10, factor=0.87),
)

test_b = Benchmark(
    oracle=FixedResultsOracle(test_results),
    scorer=ConstScorer(5),
)

test_suite = [
    (ElasticSearchQuery('JK Rowling'), lambda r: r['author'] == 'JK Rowling'),
    (ElasticSearchQuery('Harry Potter 1'), lambda r: r['isbn'] == '12331234657465'),
]

score = b.execute(test_suite)

Add tests

Description

Add more tests, both queries that return expected results correctly, as well as cases of queries that don't return all the expected results.

Examples of queries on the entities index that dont return all the expected results:

שיקום
returns results with הקמה instead of שיקום

קוד פתוח
returns results with only פתוח

This happens probably because of the analyzer_hebrew

Can ask @noamoss for more cases.

Build the ElasticSearchQuery Class

Description

Right now the ElasticsearchQuery class doesn't do much, and it should be expanded to be something similar to the Query class in the apies repo:
https://github.com/OpenBudget/apies/blob/master/apies/query.py

Enhance scoring function(s)

Description

Currently there is a default scoring function - the FirstPageScorer.
It returns the ratio of the results that meet the requirement to the total amount of results.
This is not very sophisticated, and can probably use improvement.
Also, add ideas for other scoring methods.

maaikeb / elasticsearch-benchmarking Goto Github PK

elasticsearch-benchmarking's People

Contributors

Watchers

elasticsearch-benchmarking's Issues

Add abstraction for more generic use

Add tests

Build the ElasticSearchQuery Class

Enhance scoring function(s)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent