arthur_bench.client.local#

Submodules#

class arthur_bench.client.local.client.LocalBenchClient(root_dir: str | Path | None = None)#

Bases: BenchClient

Client for managing local file system test suites and runs

create_new_test_run(test_suite_id: str, json_body: CreateRunRequest) CreateRunResponse#

Create a new run for a test suite.

Parameters:
  • test_suite_id – the uuid of the test suite to log a run for

  • json_body – run request containing run_metadata and scored model generations

create_test_suite(json_body: TestSuiteRequest) PaginatedTestSuite#

Create a new test suite.

Parameters:

json_body – test suite request object consisting of test suite metadata and test cases

delete_test_run(test_suite_id: str, test_run_id: str)#

Delete a test run from a suite.

delete_test_suite(test_suite_id: str)#

Delete a test suite. All associated runs will also be deleted

get_runs_for_test_suite(test_suite_id: str, sort: CommonSortEnum | TestRunSortEnum = CommonSortEnum.CREATED_AT_ASC, page: int = 1, page_size: int = 5) PaginatedRuns#

Get runs for a given test suite.

Parameters:
  • test_suite_id – the uuid of the test suite

  • sort – optional sort key. possible values are ‘name’, ‘avg_score’, and ‘ created_at’. use ‘-’ prefix for descending sort. defaults to ‘created_at’

  • page – the page to fetch

  • page_size – page size to fetch

get_summary_statistics(test_suite_id: str, run_ids: list[str] | None = None, page: int = 1, page_size: int = 5) TestSuiteSummary#

Fetch aggregate statistics of a test suite. Returns averages and score distributions for runs in test suite.

Parameters:
  • test_suite_id – uuid of the test suite

  • run_id – optional run id. run will be included in response regardless of page information if provided

  • page – the page to fetch

  • page_size – page size to fetch

get_test_run(test_suite_id: str, test_run_id: str, page: int = 1, page_size: int = 5, sort: TestCaseSortEnum | None = None) PaginatedRun#

Get a test run by id.

Parameters:
  • test_suite_id – uuid of the test suite

  • test_run_id – uuid of the test run

  • page – the page to fetch, pagination refers to the test cases

  • page_size – page size to fetch, pagination refers to the test cases

  • sort – sort key to sort the retrieved results

get_test_suite(test_suite_id: str, page: int = 1, page_size: int = 5) PaginatedTestSuite#

Get a test suite by id.

Parameters:
  • test_suite_id – the uuid of the test suite to fetch

  • page – the page to fetch, pagination refers to the test cases

  • page_size – page size to fetch, pagination refers to the test cases

get_test_suite_by_name(test_suite_name: str) PaginatedTestSuite#

Additional getter to maintain backwards compatibility with non-identified local files

get_test_suites(name: str | None = None, sort: CommonSortEnum | TestSuiteSortEnum = TestSuiteSortEnum.LAST_RUNTIME_ASC, scoring_method: List[str] | None = None, page: int = 1, page_size: int = 5) PaginatedTestSuites#

Get metadata for all test suites.

Parameters:
  • name – filter test suites by name if provided

  • sort – optional sort key. possible values are ‘name’, ‘last_run_time’, ‘created_at’, use ‘-’ prefix for descending sort. defaults to ‘last_run_time’

  • method (scoring) – optional filter on scoring method name, multiple names may be provided

  • page – the page to fetch

  • page_size – page size to fetch

class arthur_bench.client.local.client.PageInfo(sorted_pages: List, start: int, end: int, page: int, page_size: int, total_pages: int, total_count: int)#

Bases: object

end: int#
page: int#
page_size: int#
sorted_pages: List#
start: int#
total_count: int#
total_pages: int#